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PRAI DE 2000-10009055 20000228 

L3 ANSWER 38 OF 111 DISSABS COPYRIGHT (C) 2004 ProQuest Information and 

Learning Company; All Rights Reserved on STN 
AN 93:38034 DISSABS Order Number: AAR9318723 

TI ISOLATION AND CHARACTERIZATION OF A SODIUM- AND CHLORIDE -DEPENDENT GABA 

TRANSPORTER CDNA FROM TORPEDO CALIFORNICA (SODIUM DEPENDENT) 
AU SWANSON, GEOFFREY TODD [PH.D.]; CHO, ARTHUR K. [advisor] 
CS UNIVERSITY OF CALIFORNIA, LOS ANGELES (0031) 

SO Dissertation Abstracts International, (1993) Vol. 54, No, 3B, p. 1245. 

Order No.: AAR9318723. 159 pages. 
DT Dissertation 
FS DAI 
LA English 

ED Entered STN: 19930817 

Last Updated on STN: 19930817 

L3 ANSWER 3 9 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADDS 0662 peptide DGENE 

TI Novel isolated polynucleotide encoding ***liuman*** or mouse 

* * * high* ** ***aff ini ty * * * * * * chol ine * * * * * * transport er* * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC C- terminal peptide of ***human*** , mouse and rat CHT. 

L3 ANSWER 40 OF 111 DGENE COPYRIGHT 2 004 THOMSON DERWENT on STN 
AN ADD50647 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***af f inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***High*** - ***af f inity*** ***cholxne*** *** transporter*** 

(CHT) associated protein sequence #1. 

L3 ANSWER 41 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ADD50643 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f ini ty* * * * * *choline* * * * * * transporter* * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 
CR N-PSDB: ADD50642 

DESC Rat ***high*** - ***aff inity*** ***choline*** 
***transporter*** (rCHT) . 



AN ADD50648 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***af f inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***High*** - ***af f inity*** ***choline*** ***transporter*** 

(CHT) associated protein sequence #2. 

L3 ANSWER 43 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD5063 9 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** *** transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 
CR N-PSDB: ADD50638 

DESC ***Human*** ***hiqh*** - ***aff inity*** ***choline*** 

***transporter*** (hCHT) . 

L3 ANSWER 44 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD5064 9 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 
PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***High*** - ***aff inity*** ***choline*** ***transporter*** 

(CHT) associated protein sequence #3. 

L3 ANSWER 45 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50645 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f inity* * * * * * chol ine * * * * * * transporter * * * 

polypeptide, useful in qene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

( FERG- I ) FERGUSON S . 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 
DESC C. elegans CHOI protein. 



AN ADD50661 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f inity* * * * * *choline* * * * * * transporter * * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 
CR N-PSDB: ADD50660 

DESC Mouse ***high*** - ***aff inity*** ***choline*** 
*** transporter*** (mCHT) #2. 

L3 ANSWER 47 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ADD50641 protein DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f inity* * * * * * chol ine* * * * * * transporter * * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S, 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 
CR N-PSDB: ADDS 0640 

DESC Mouse ***high*** - ***aff inity*** ***choline*** 
***transporter*** (mCHT) #1. 

L3 ANSWER 48 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ABU08 980 Protein DGENE 

TI Novel isolated polynucleotide (I) that encodes ***high*** ^ , 

***aff inity*** ***choline*** ***transporter*** protein, usefu. 

for preventing, treating or ameliorating neurological and cognitive 
disorders such as Alzheimer's or Parkinson's disease - 

IN Wu D; Gu Y; Millard W J; He Y 

PA (UYFL) UNIV FLORIDA. 

PI US 6500643 Bl 20021231 20p 

AI US 2000-657252 20000907 

PRAI US 2000-657252 20000907 

DT Patent 

LA English 

OS 2003-361535 [34] 

CR N-PSDB: ABX943 3 9 

DESC ***Human*** choline acetyltransf erase . 

L3 ANSWER 49 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ABU08 97 9 Protein DGENE 

TI Novel isolated polynucleotide (I) that encodes ***high*** . 

***aff inity*** ***choline*** ***transporter*** protein, usefu. 

for preventing, treating or ameliorating neurological and cognitive 

disorders such as Alzheimer's or Parkinson's disease 
IN Wu D; Gu Y; Millard W J; He Y 
PA (UYFL) UNIV FLORIDA. 

PI US 6500643 Bl 20021231 20p 

AI US 2000-657252 20000907 

PRAI US 2000-657252 20000907 

DT Patent 

LA English 

OS 2003-361535 [34] 

CR N-PSDB: ABX94338 . . ^ ^ . 

DESC ***Human*** ***high*** ***af f inity*** ***choline*** 

***transporter*** , HACT. 



L3 ANSWER 50 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 



TI New rat and ***human*** spinal cord ***hiqh*** ***af f inity*** 

***choline*** ***transporters*** , useful in diagnosis of 

Alzheimer »s disease and screening promoters as drugs for treating 

Alzheimer's disease - 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 Al 20010308 90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 
CR N-PSDB: AAF81713 

DESC Mouse ***high*** ***aff inity*** ***choline*** 

***transporter*** protein. 

L3 ANSWER 51 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAB74665 Protein DGENE 

TI New rat and ***human*** spinal cord ***hiqh*** ***af f xnity*** 

***choline*** ***transporters*** , useful in diagnosis of 

Alzheimer's disease and screening promoters as drugs for treating 
Alzheimer's disease 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 Al 20010308 90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 
CR N-PSDB: AAF81712 

DESC ***Human*** ***high*** ***aff inity*** ***choline*** 

***transporter*** protein. 

L3 ANSWER 52 OF 111 DGENE COPYRIGHT 2 004 THOMSON DERWENT on STN 
AN AAB74664 Protein DGENE 

TI New rat and ***human*** spinal cord ***hiqh*** ***af f inity*** 

***choline*** ***transporters*** , useful in diagnosis of 

Alzheimer's disease and screening promoters as drugs for treating 

Alzheimer's disease 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 Al 20010308 90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 
CR N-PSDB: AAF81711 

DESC Rat ***high*** ***aff inity*** ***choline*** 

***transporter*** protein. 

L3 ANSWER 53 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAB74663 Protein DGENE 

TI New rat and ***human*** spinal cord ***hiqh*** ***aff inity 

***choline*** ***transporters*** , useful in diagnosis of 

Alzheimer's disease and screening promoters as drugs for treating 

Alzheimer's disease 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 Al 20010308 90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 
CR N-PSDB: AAF81710 

DESC C. elegans ***high*** ***aff inity*** ***cholxne*** 

* * * transporter * * * protein . 

L3 ANSWER 54 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 



* * * 



TI Identifying genes having altered expression level in presence of non- 
conventional transmissible agent, e.g. prion, useful for diagnosis and 
drug screening. 

IN Mouthon F; Nouvel V; Deslys J P 

PA (COMS) COMMISSARIAT ENERGIE ATOMIQUE. 

PI FR 2839081 Al 20031031 lOOp 

AI FR 2002-5392 20020429 

PRAI FR 2002-5392 20020429 

DT Patent 

LA French 

OS 2004-045747 [05] 

DESC NaCl dependent ***high*** ***af f inity*** ***choline*** 

***transporter*** PGR primer, SEQ ID 125. 

L3 ANSWER 55 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADF083 97 DNA DGENE 

TI Identifying genes having altered expression level in presence of non- 
conventional transmissible agent, e.g. prion, useful for diagnosis and 
drug screening. 

IN Mouthon F; Nouvel V; Deslys J P 

PA (COMS) COMMISSARIAT ENERGIE ATOMIQUE. 

PI FR 2839081 Al 20031031 lOOp 

AI FR 2002-5392 20020429 

PRAI FR 2002-5392 20020429 

DT Patent 

LA French 

OS 2004-045747 [05] 

DESC NaCl dependent ***high*** ***aff inity*** ***choline*** 

***transporter*** PCR primer, SEQ ID 126. 

L3 ANSWER 56 OF 111 DGENE COPYRIGHT 2 004 THOMSON DERWENT on STN 
AN ADD50653 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f inity * * * * * * choline * * * * * * transporter * * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC PCR primer #2 for ***human*** ***high*** - ***af f inity*** 

***choline*** *** transporter*** (hCHT) cDNA. 

L3 ANSWER 57 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50650 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * * af f inity* * * * * * chol ine * * * * * * transporter * * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC BAC sequence #1 containing hCHT DNA. 

L3 ANSWER 58 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ADD50 664 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f inity* * * * * *choline* * * * * * transporter* * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 



(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 

OS 2003-810914 [76] . . 

DESC PGR primer #2 for mouse ***high*** - ***af f inity*** ***choline*** 
*** transporter*** (mCHT) cDNA. 

L3 ANSWER 59 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50660 cDNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***af f inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 
CR P-PSDB: ADD50661 

DESC cDNA encoding mouse ***high*** - ***aff inity*** ***choline*** 
***transporter*** (mCHT) #2. 

L3 ANSWER 60 OF 111 DGENE COPYRIGHT 2 004 THOMSON DERWENT on STN 
AN ADD5064 6 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f inity* * * * * * choline * * * * * * transporter* * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D, 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***High*** - ***aff inity*** ***choline*** ***transporter*** 

(CHT) associated DNA sequence #2. 

L3 ANSWER 61 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50652 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** *** transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 

OS 2003-810914 [76] . . 

DESC PGR primer #1 for ***human*** ***high*** - ***af f mity*** 

***choline*** ***transporter*** (hCHT) cDNA. 

L3 ANSWER 62 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50651 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *aff inity* * * ** *choline* * * * * * transporter* * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 



(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC BAG sequence #2 containing hCHT DNA. 

L3 ANSWER 63 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50644 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f inity* * * * * *choline* * * * * * transporter* * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease . 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***High*** - ***aff inity*** ***choline*** ***transporter*** 

(CHT) associated DNA sequence #1. 

L3 ANSWER 64 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50657 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** *** transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***Human*** chromosome 2 marker D2S340. 

L3 ANSWER 65 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ADD50663 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *af f ini ty* * * * * *chol ine* * * * * * transporter* * * 
polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC PCR primer #1 for mouse ***high*** - ***aff inity*** ***choline*** 
*** transporter*** (mCHT) cDNA. 

L3 ANSWER 66 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50638 cDNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** *** transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 
a ceil of a patient suffering from Alzheimer's disease. 

IN Blakely R D; Apparsundaram S; Ferguson S 

PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 



PI us 2003114399 Al 20030619 74p 

AI US 2001-911077 20010723 

PRAI US 2001-911077 20010723 

DT Patent 

LA English 

OS 2003-810914 [76] 

CR P-PSDB: ADD50639 . 

DESC cDNA encoding ***human*** ***high*** - ***af f inity*** 

***choline*** ***transporter*** (hCHT) . 

L3 ANSWER 67 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50658 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***af f inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***Human*** chromosome 2 marker D2S176. 

L3 ANSWER 68 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADDS 0656 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

* * *high* * * * * *aff inity* * * * * * choline* * * * * * transporter * * * 

polypeptide, useful in gene therapy to increase cholinergic function in 
a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 
(FERG-I) FERGUSON S, 
PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC BAC sequence containing hCHT gene. 

L3 ANSWER 69 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ADDS 0655 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 

OS 2003-810914 [76] . . 

DESC PGR primer #2 for ***human*** ***high*** - ***af f inity*** 

***choline*** ***transporter*** (hCHT) gene. 

L3 ANSWER 70 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ADDS 064 2 cDNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG-I) FERGUSON S. 



AI US 2001-911077 20010723 

PRAI US 2001-911077 20010723 

DT Patent 

LA English 

OS 2003-810914 [76] 

CR P-PSDB: ADD50643 

DESC cDNA encoding rat ***high*** - ***af f inity*** ***choline*** 
***transporter*** (rCHT) . 

L3 ANSWER 71 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50640 cDNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

( FERG- I ) FERGUSON S . 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 
CR . P-PSDB: ADD50641 

DESC cDNA encoding mouse ***hiqh*** - ***aff inity*** ***choline*** 
***transporter*** (mCHT) #1. 

L3 ANSWER 72 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADD50654 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** ***transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG- I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC PCR primer #1 for ***human*** ***high*** - ***aff inity*** 

***choline*** ***transporter*** (hCHT) gene. 

L3 ANSWER 73 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ADDS 06 59 DNA DGENE 

TI Novel isolated polynucleotide encoding ***human*** or mouse 

***high*** ***aff inity*** ***choline*** *** transporter*** 

polypeptide, useful in gene therapy to increase cholinergic function in 

a cell of a patient suffering from Alzheimer's disease. 
IN Blakely R D; Apparsundaram S; Ferguson S 
PA (BLAK-I) BLAKELY R D. 

(APPA-I) APPARSUNDARAM S. 

(FERG- I) FERGUSON S. 

PI US 2003114399 Al 20030619 74p 
AI US 2001-911077 20010723 
PRAI US 2001-911077 20010723 
DT Patent 
LA English 
OS 2003-810914 [76] 

DESC ***Human*** chromosome 2 marker D2S1893. 

L3 ANSWER 74 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ABX94341 DNA DGENE 

TI Novel isolated polynucleotide (I) that encodes ***high*** 

***aff inity*** ***choline*** *** transporter*** protein, usefu 

for preventing, treatincf or ameliorating neurological and cognitive 

disorders such as Alzheimer's or Parkinson's disease 
IN Wu D; Gu Y; Millard W J; He Y 
PA (UYFL) UNIV FLORIDA. 

PI US 6500643 Bl 20021231 20p 



PRAI US 2000-657252 20000907 
DT Patent 
LA English 

OS 2003-361535 [34] . . ^ ^ . 

DESC ***Human*** ***high*** ***af f inity*** ***choline*** 

***transporter*** , HACT, PGR primer #2. 

L3 ANSWER 75 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN ABX94340 DNA DGENE ^ , , ^ ^ ^ ^v.*** 

TI Novel isolated polynucleotide (I) that encodes ***high*** 

***aff inity*** ***choline*** ***transporter*** protein, usetu. 

for preventing, treating or ameliorating neurological and cognitive 

disorders such as Alzheimer's or Parkinson's disease 
IN Wu D; Gu Y; Millard W J; He Y 
PA (UYFL) UNIV FLORIDA. 

PI US 6500643 Bl 20021231 20p 
AI US 2000-657252 20000907 
PRAI US 2000-657252 20000907 
DT Patent 
LA English 

OS 2003-361535 [34] . . ^ ■, ■ u.** 

DESC * * *Human* * * * * *high* * * * * *af f ini ty* * * * * * choline* * * 

***transporter*** , HACT, PGR primer #1. 

L3 ANSWER 76 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ABX94339 cDNA DGENE , , ^ ^ 

TI Novel isolated polynucleotide (I) that encodes ***high*** 

***aff inity*** ***choline*** ***transporter*** protein, usefu. 

for preventing, treating or ameliorating neurological and cognitive 
disorders such as Alzheimer's or Parkinson's disease - 

IN Wu D; Gu Y; Millard W J; He Y 

PA (UYFL) UNIV FLORIDA. 

PI US 6500643 Bl 20021231 20p 

AI US 2000-657252 20000907 

PRAI US 2000-657252 20000907 

DT Patent 

LA English 

OS 2003-361535 [34] 

CR P-PSDB: ABU08980 

DESC ***Human*** cDNA encoding choline acetyltransf erase . 

L3 ANSWER 77 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 

AN ABX94338 cDNA DGENE , , , ^ • u*** 

TI Novel isolated polynucleotide (I) that encodes ***high*** ^ . 

***aff inity*** ***choline*** ***transporter*** protein, usetu. 

for preventing, treating or ameliorating neurological and cognitive 
disorders such as Alzheimer's or Parkinson's disease - 
IN Wu D; Gu Y; Millard W J; He Y 
PA (UYFL) UNIV FLORIDA. 

PI US 6500643 Bl 20021231 20p 

AI US 2000-657252 20000907 

PRAI US 2000-657252 20000907 

DT Patent 

LA English 

OS 2003-361535 [34] 

CR P-PSDB: ABU08979 . ^ . . 

DESC ***Human*** cDNA encoding ***high*** ***af f inity*** 

***choline*** ***transporter*** , HACT^ 

L3 ANSWER 78 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAF81713 cDNA DGENE . . ^ ^a. 4=^- 

TI New rat and ***human*** spinal cord ***hiqh*** _ ***af f inity*** 
***choline*** ***transporters*** , useful in diagnosis of 

Alzheimer's disease and screening promoters as drugs for treating 

Alzheimer's disease 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 AI 20010308 .90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 
CR P-PSDB: AAB74666 



*** transporter*** protein encoding cDNA, 



L3 ANSWER 79 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAF81712 cDNA DGENE 

TI New rat and ***human*** spinal cord ***hiqh*** ***af f inity*** 

***choline*** *** transporters*** , useful in diagnosis 91 

Alzheimer's disease and screening promoters as drugs for treating 

Alzheimer »s disease 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 Al 20010308 90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 

CR P-PSDB: AAB74665 ^ ^ ^ • 

DESC ***Human*** ***high*** ***af f inity*** ***choline*** 

***transporter*** protein encoding cDNA. 

L3 ANSWER 8 0 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAF81711 cDNA DGENE . ^ . . 

TI New rat and ***human*** spinal cord ***hiqh*** ***af f inity*** 

***choline*** ***transporters*** , useful in diagnosis of 

Alzheimer's disease and screening promoters as drugs for treating 

Alzheimer's disease 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 Al 20010308 90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 

CR P-PSDB: AAB74664 ^ ^ . 

DESC Rat ***high*** ***aff inity*** ***choline*** 

***transporter*** protein encoding cDNA. 

L3 ANSWER 81 OF 111 DGENE COPYRIGHT 2004 THOMSON DERWENT on STN 
AN AAF81710 cDNA DGENE . ^ . . 

TI New rat and ***human*** spinal cord ***hiqh*** ***af f inity*** 

***choline*** ***transporters*** , useful in diagnosis 9f 

Alzheimer's disease and screening promoters as drugs for treating 

Alzheimer's disease 
IN Haga T; Okuda T 

PA (NISC-N) JAPAN SCI & TECHNOLOGY CORP. 

PI WO 2001016315 Al 20010308 90p 
AI WO 2000-JP5545 20000818 
PRAI JP 1999-240642 19990827 
JP 1999-368991 19991227 
DT Patent 
LA Japanese 
OS 2001-226688 [23] 

CR P-PSDB: AAB74663 ^ n • 

DESC C. elegans ***high*** ***af f inity*** ***choline*** 

*** transporter*** protein encoding cDNA, 

L3 ANSWER 82 OF 111 EMBASE COPYRIGHT 2004 ELSEVIER INC. ALL RIGHTS 

RESERVED, on STN 
AN 96156006 EMBASE 

DN 1996156006 ^ ^ . ^^^^ ^ ... 

***High*** ***aff inity*** ***choline*** ***transporter*** 

status in Alzheimer's disease tissue from rapid autopsy. 
AU Bissette G. ; Seidler F.J.; Nemeroff C.B.; Slotkin T.A. 
CS Duke University Medical Center, Durham, NC 27710, United States 
SO Annals of the New York Academy of Sciences, (1996) 777/- (197-204) . 

ISSN: 0077-8923 CODEN: ANYAA 
CY United States 
DT Journal; Conference Article 

FS 005 General Pathology and Pathological Anatomy 

008 Neurology and Neurosurgery 
LA English 
SL English 



L3 ANSWER 83 OF 111 



GENBANK . RTM . COPYRIGHT 2004 on STN 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
KEYWORDS (ST) : 
SOURCK • 
ORGANISM (ORGN) : 



(NA) 



BX294147 GenBank (R) 

BX294147 BX119912 
BX294147.1 GI: 32445670 

543672-06-6 

314450 

DNA; linear 

Bacteria 

11 Jul 2003 

Pirellula sp. strain 1 complete genome; segment 15/24 
complete genome 
Pirellula sp. 
Pirellula sp. 

Bacteria ; Planctomycetes ; Planctomycetacia ; 
Planctomycetales ; Planctomycetaceae ; Pirellula 
72076 a 92563 C 83065 g 66746 t 



NUCLEIC ACID COUNT 
COMMENT : 

This project was carried out by 

*Max Planck Institute for Molecular Genetics, Berlin, Germany; *Max 
Planck Institute for Marine Microbiology, Bremen, Germany; in the 

fremework of the REGX-project , http://www.regx.de 

Genome Center 

Center: Max Planck Institute for Molecular Genetics 
Center code: MPIMG 

Summary Statistics 

Sequencing vector: pUC19; 100% of reads 
Chemistry: Dye -terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.990329 
Consensus quality: 7142841 bases at least Q40 
Consensus quality: 7145138 bases at least Q30 
Consensus quality: 7145484 bases at least Q20 
Quality coverage : 8 . 03 

This sequence was finished as follows unless otherwise noted: all 
regions were double stranded, sequenced with an alternate 
chemistry, or coveredby high quality data (i.e., phred quality >= 
30) ; an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; allregions were covered by at least 
one plasmid Sequence; assembly was additionally confirmed by long 
range per and cosmid end sequences. 

See http://www.micro-genomes.mpg.de/pirellula/ for more information 
including minimal tiling path from a set of 220 cosmids out of 
908. See the misc_feature tag below for the boundaries of the MTP 
cosmids. Annotation 

Center: Max Planck Institute for Marine Microbiology 
Celsiusstrasse 1, D-28359 Bremen, Germany. 

Center Code: MPIMM 

Email : f og®mpi -bremen . de 

Phone: +49 (0)421 2028 938 Fax: +49 (0)421 2028 580 



Three different programs (Glimmer, Critica and Orpheus) were used 
for ORF-prediction. A nonredundant list of ORFs was generated by 
suitable parsing of the results. 

Automated annotation was done with the software package Pedant Pro 
(http://www.biomax.de). All ORF predictions and annotations were 
manually corrected by considering all results of the different 
tools applied. See http://www.regx.de for more information and 
access to supplementary information. 
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AUTHOR (AU) : 



TITLE (TI) : 

JOURNAL (SO) : 

OTHER SOURCE (OS) 
REFERENCE : 

AUTHOR (AU) : 



1 (bases 1 to 314450) 

Gloeckner, F.O. ; Kube,M.; Bauer, M. ; Teeling/H. ; 
Lombardot , T . ; Ludwig , W . ; Gade , D . ; Beck , A . ; Borzym , K . ; 
Heitmann,K.; Rabus,R-; Schlesner , H. ; Amann,R.; 
Reinhardt , R. 

Complete genome sequence of the marine planctomycete 
Pirellula sp. strain 1 

Proc. Natl. Acad. Sci. U.S.A., 100 (14), 8298-8303 
(2003) 

CA 139:96134 

2 (bases 1 to 314450) 

Kube,M.; Borzym, K.; Heitmann,K.; Klages,S.; 
Marquardt , I . ; Lehrack,S. ; Beck, A. ; Pawlik,R.; 
Reinhardt, R. ; Gloeckner, F.O. ; Bauer, M.; Teeling,H.; 



TITLE (TI) : 
JOURNAL (SO) : 



Schlesner,H. ; Amann,R. 
Direct Submission 

Submitted {21-JAN-2003) Max Planck Institute for 
Molecular Genetics, proScience Ihnestrasse 73, D-14195 
Berlin, Germany Max Planck Institute for Marine 
Microbiology Celsiusstrasse 1, D-28359 Bremen, Germany 



FEATURES (FEAT) : 
Feature Key 



Location 



Qualifier 




1. ,314450 



/organism="Pirellula sp. 
/mol- type = "genomic DNA" 
/strain="l" 
/db-xref="taxon:117" 
/gene="cycA" 
/locus- tag= "RB7973 " 



II 



gene 



CDS 




/locus-tag="RB7973" 
/function^" electron transport and 
membrane-associated energy 
conservation" 

/note="PMID: 97428333 PMID: 
2986626 PMID: 9281430 PMID: 
11152119 best DB hits: BLAST: 
pir:A83363; probable cytochrome c 
precursor PA2266 [imported] -; 
E=5e-09 pir:E83075; probable 
cytochrome c PA4571 [imported] - 
Pseudomonas; E=7e-07 pir:CCTW5T; 
cytochrome c552 [validated] - 
Thermus aquaticus; E=9e-07 COG: 
PA2266; COG2010 Cytochrome c, 
mono- and diheme variants; E=5e-10 
PFAM: PF00034; Cytochrome c; 
E=0 . 042" 
/codon-start=l 
/transl-table=ll 
/product = "probable cytochrome c 
precursor" 

/protein-id="CAD78402 .1" 

/db-xref ="GI : 32445671" 

/ 1 rans lat ion= " MNSLRRSLMLLSVLYEVTVP 

VRDTPAENVCTFRHSTLGCFVCSQ 

FRWPAEQKLKLHSKIRQLRKRTMPQRITTVLQWS 

LFFLCAGVIPGIAQRADAQDAFEF 

QSNDWAIYGNGLADRMQHDPWVETFLQHQLKGL 

DVS FRNMSFSGDRVNQRPRNQGFT 

NDTEYLKHVAPNWFTF YGFNES FAGPEKAGEHR 

DELIKLVQRYTQVQKDDGKDLRFV 

LFSPIAYENTGDASLPDGAELNVNLAAYTEATRE 

AAEITGAKFVDLFSPTYQLFQSSS 

ERLTLNGVHLNEAGYKQLAGI ISQALLGEKPASD 

AELQDLYEAI EDKNWHWHNRYRAT 

DGNDIWGSRSTLTFVDGQSNADVLKHELVMLDVM 

TANRDKVIWAAADGRTLQADDSNV 

PPPVKVTSNIGGGSASSNAMKEGSVAYLSPEESL 

AKINVPEGYELNVFASEVQFPDLA 

NPVQMQVDAHGRLWVASWNTYPKWEPGKEMNDSL 

MILEDTDNDGKADVRKI FAHVHNP 

LGFEFWNGGVWTSGPDLLFLKDTDGDDKADVRY 

PILQGLGTSDTHHAAISINLVYGPDG 

GIYWQSGIFLVHNHETPWKQNLNIGASGMYRFDP 

LTFAITPHAGNSPNPHGTSFDYWG 

YCYASDGTGGRCYQVRPEGNGFKMHKLLEKEFRP 

VAANAILSSEHFPEELQNDILICN 

TIGFLGVKQYKLDREGDVEEEAAAEKTLKEETGP 

VKITRNGGLITVDHPALKDAKITG 

FKLSVNGRQQMNLSEVEVISGGRNIAKTAKLAQS 

SEYNNGTFPVQRLVDGDKGNFAHT 

SQQNNPWMRGDFPSPVQISEFKVWNRKGFEDRFN 

NGKIEFFDGDEWAAVDIKIVSAD 

QEEQHREFGEVWGTPGLELLNSDDRNFRPTDAW 

GEDGALYVSDWHNAI IGHMQHNIR 

DPNRDHAHGRI FRLTVKNRPLQKPVKIAGQS lEA 

LLENLKHPVNGVRHRTRVELTKHD 



RNGVKNEALLNQLLESDVRHAWA 

AKTVRHFWENVDTKGGSEFAAPAELEFVKFDPPK 

HLSGADRKTYELGATIYQRESHCA 

TCHMTHGKGTPNVYPPLVGS PWVNGS EDRLI KMA 

LHGVWGKMTVAGKT YD PARGVP PM 

TAFRSLLKDDEMAAVLTFVRNTWGNEASWSPEA 

VSRVREETKGRTTFYKPEEILELH 

PLEKELMDESVTPETEVFSNEELEKELLAASPAK 

LAQVALAKGNFQRGKRLFHESS7VA 

CFACHSPPAGTVRMGPDLEKATTKRSREELVDAL 

LRPSKLIDKDYAQVSVLTADGQIF 

TGI RVSENDDE I VLRNLAQPEP I T I SQ YDVEDVI 

ESEVSLMPENLMRQMKNRREFNDL 

LKYIIEVRKK" 

gene 4473. .4817 /locus-tag="RB7978" 

CDS 4473.. 4817 /locus- tag= "RB7978 " 

/codon-start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein-id="CAD78403 . 1" 
/db'Xref="GI: 32445672" 
/ 1 r ans lat ion= " MRQPTNTLRRSLNWGVKHRT 
TVWTSRGLSFQLGTLPRVFKPWLN 
LSCNGLVAFRVSGFLS IPYPAVPHNRSRSNAWPS 
AQGVSSLLPLTLVLRPIVGAKVSS 
GSEWMAWGPFL" 

gene 4869. .5462 /locus- tag= "RB798 0 " 

CDS 4869. .5462 /locus-tag="RB7980 " 

/ codon- s t art = 1 
/transl-table=ll 
/product= "hypothetical protein" 
/protein-id= "CAD78404 .1" 
/db-xref ="GI : 32445673 " 
/ 1 r ans 1 at i on= " MDHLKNDLALNQ II ES CNS I 
QTTVHKAQMLPVSRGETGPLRWRE 
S CDKKI AI AGD I LQFQLTLEMPQVTDECDWQLA 
LSRSYGPNLMFDQIVSTHIDGKPA 
RPTNHSMHRSLDLHFNAIQSGSTWIRYQVRVDR 
TADPSIVALDNIDVRWAFRSQSKR 
DAARKAHTASPGVSAPHVTGLVPAARAKASALTS 
SW" 

gene complement (5520 . .6023) /locus-tag= "RB7983 " 

CDS complement (5520. .6023) / locus - tag= " RB7983 " 

/ codon- start=l 
/transl- tableau 
/product=" hypothetical 
protein- transmembrane prediction" 
/protein- id= " CAD784 05 . 1 " 
/db-xref ="GI : 32445674 " 
/translat ion= "MEYTMNFGTNDFGAKDFRRG 
VLVATATLLSVAFFGGEAFGQCRS 
RGGGPPSGSPTSFTSGSQYAQNPYGQNALVSSYP 
QNNLIASQYQQQLLASQRQIASSA 
YRNTQQQRLALMQSEADVQAYRLARAESKRAARA 
ERIAARLREQGRSEDAYTLASVNV DKSPPRR" 
gene 6183. .6290 /locus-tag="RB7 984 " 

CDS 6183. .6290 /locus-tag= "RB7984 " 

/codon- start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein-id= "CAD78406 .1" 
/db-xref="GI: 32445675" 
/ 1 rans 1 at ion= " MSDVS PFGCSLTGGKWIRCP 
SPNLSVTLKSSCHEF" 
gene 6280.. 7695 /gene="ansA" 

/ locus - tag= " RB7 9 8 5 " 
CDS 6280.. 7695 /gene="ansA" 

/locus-tag="RB7985" 
/EC-number="3 .5. 1. 1" 
/function=" degradation of 
amino-acids of the 
aspartate - group " 

/note="PMID: 1391778 best DB hits: 
BLAST: swissprot : P30364 ; 
ASPG-LUPAN L-ASPARAGINASE 



swissprot:Q9ZSD6; ASPG-LUPLU 
L-ASPARAGINASE (L-ASPARAGINE; 
E=3e-62 swissprot :P50288; 
ASPG-LUPAL L-ASPARAGINASE 
(L-ASPARAGINE; E=5e-62 COG: ybiK; 
C0G144 6 Asparaginase; E=2e-56 
PFAM: PF0253 0; Porin subfamily; 
E=0.047 PF01112; Asparaginase; 
E=1.9e-57" 
/codon- start =1 
/transl-table=ll 
/product="L-asparaginase" 
/protein- id="CAD78407 . 1" 
/db-xref ="GI : 32445676 " 
/ 1 rans lat ion= " MNFKTTIIjAAAVAFAFVAPR 
TIQSKEPAMKTKWAIVIHGGAGSS 
PAQLGDASSKQRTKGLQHALQTGRDMLADGATAM 
DTVEAVIRTLEDDPIFNAGRGSW 
TNEGRVEMDS S VMDGKTLACGAVAGVTRVKNP I S 
LARRVMTETKHVLLVGPGADEFAE 
TQQVPLVDPKYFLSQRDGDDASSIASATQDEDES 
HLGTVGCWLDSHGNLAAGTSTGG 
TANKLPGRVGDS P I VGAGTYAANGLCAVSGTGVG 
EEYIRNSVAYDIAAQMRYANQSLE 
SAVTDIMLNRLDPGVGGLIAVSQQGEIVMQHNTP 
GMSCAAADSTGRFETHLILDNGGA 
PAEATTKSNPSASDESKIRELlQQQASDWNAGNI 
DAFMKVYWKSDQLTFSSGGDVTRG 
FDATLQRYKERYPTSEKMGKLTFTDLEFLPLGNS 
AMQVLGVWKLDRMEPMEGKFTLVF 
RRFPEGWKIVHDHTSKSPDSQ " 
/locus- tag="RB7987" 
/ locus - tag= " RB 7 9 8 7 " 
/codon- start =1 
/transl-table=ll 
/product= "hypothetical protein" 
/prote in- id= " CAD7 84 08.1" 
/db-xref ="GI : 32445677 " 
/translation= "MEDRPRPHLKVPRLAIGVND 
RRSSKQKGNSLIRFHPARWDSSV 
TDQCHFNSPS IVTSMVFCFGKGFPTRVGLLTLAW 
VMS I STNAFAIDFNRDVRP ILSEN 
CFHCHGPNAESREADLRLDTQTGSRQELSGSPAV 
VPNEPDQSGLIERIFSDDPDSLMP 
PPESKRKLSAHQKFILKSWIAEGGDYEQHWAFIA 
PTQPSVPEIDQAASSSHPVDRFIW 
NRLHERGLEPSKTADRRTLIRRVSLDLTGLPPTP 
EQVNSFVNESALNAYERLVDRLLA 
SEQFGERWARPWLDLARYADSNGFQADQLRDSWA 
YRDWVINALNDDMPFDQFTIEQLA 
GDLLPNATIDQKSQQAS I APCRAMSKPAYTQKKT 
G" 

/locus-tag="RB7989" 

/ locus - tag= " RB7 9 8 9 " 

/codon- start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78409 . 1" 

/db-xref ="GI : 32445678 " 

/translation= "MPCNVEAGVHPEENRVNQVI 

DRVNATATAWLGMT I ECAQCHDHK 

YDPVSQKEYYKLFAFFNNTPMEVENPSGKGVSFD 

FWGPKMDLPLPPDQASNRSRLQAE 

I ASLEAKRNAVLEAGDFENWTQQLS I ALDNQPEF 

KTLSIASVSSTGDEEMKILDDQSV 

LFTGKLPDKTVYEIHCDEIPADVIGWKIEALTHP 

SLPGTGPGRGDAVRSNFILSEFEV 

EIVSKEGSPTQRLDLHSASADYSQPAWNVANAID 

GDLKTGWAIASKFQQSHWAQFLLQ 

TPLRQSEDSTLRFRLEQRYGRGRTIGRVRLSALC 

GNPLAINVPEQVAKAAKKPAKDRS 

PSDFATLESHFEKTNPKARALQRKIQKLTKQLES 

TQPATTLVMVEMEEPRETFVMQRG 

NYLSPGDRVSPGTPMALHQMNEDLPKNRLGLARW 

LVDPANPLTARVTVNRWWAQLFGQ 



SMKHVLKTIVMSETYQQDSKVRAD 

LIEADPANKWYARGPCFRMSAEMIRDNALAVSGL 

LSAKMHGPPIMPHQPTGIWRQVGR 

NEPKWI AATDQDRYRRGVYWWRRAAPYPS FVNF 

DGPDRSSCWGRPRTNTPLQALTL 

LNDPAYVEMALALANRILCEHPDGSDSDRLSAAF 

EIVLSRRPAPQESARLMTFLDERK 

QHLHSNPELAKSLVREQSFPNNTVYPSTEELGAW 

FYVANVLLNLDETITKD " 

/ locus - 1 ag= " RB7 9 9 1 " 

/ locus - 1 ag= " RB 7 9 9 1 " 

/note="PMID: 11743194" 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical 

protein- signal peptide prediction" 

/protein- id="CAD7 84 10 . 1" 

/db-xref="GI:3244 56 79" 

/translation="MKTYRSKQRINVSAAAAJyiSA 

NQTRRQLFQRAGMGLGGFALTSLF 

QSEQNAAFGQEKIGSAVGTHHPARAKSVIYIHMV 

GAPSHLDLFDYKPVLQERSGQLCP 

DEFFNGKQLAFIREQPKLFGTPTDQAFQFQRCGE 

S GAQ I SNLMPNLQTVADELCF I KT 

LHTDQFNHAPAQMCLLSGFERFGRPSIGSWVSYG 

IGSKNKNLPDFVTLITGQVLGAGN 

SAWGSGFLPTVHQGIEFRSQGDPVLFLSNPEGMR 

PSDRRKWDAVKDLNALRLDDVGD 

PEISTRISQYEMAYRMQTSVPELMNIDNETADVH 

QMYGTEPGKTSFANNCLLARRLVE 

RGVRFVQLFDQGWDHHGNIENRLAAKTKEVDQPI 

AAL I KDLKQRGMLDDTLWWAAEF 

GRTPMAQASNGGGTPTKVGRDHHKEAFTIWMAGG 

GTKRGHTHGATDELGYGIVENGVH 

IHDLNATILHLLGIDHQRLTFRYQGRQFRLTDVH 

GNWNDIIS" 

/gene="arsA" 

/locus-tag="RB7995" 
/gene="arsA" 

/ locus - 1 ag= " RB 7 9 9 5 " 
/EC-number= "3.1.6.8" 
/ f unct ion= " sul f atase " 
/note="PMID: 2 0164110 PMID: 
7910580 PMID: 1572648 best DB 
hits: BLAST: pir:E64903; 
arylsulf atase homolog bl498 - 

Escherichia coli ; E=2e-39 

swissprot :P77318; YDEN-ECOLI 
PUTATIVE SULFATASE YDEN PRECURSOR; 
E=2e-39 gb:AAG56270.1; AE005355-4 
(AE005355) putative sulfatase; 
E=2e-3 9 COG: ydeN; COG3119 
Arylsulfatase A and related 
enzymes; E=2e-40 PFAM: PF00884; 
Sulfatase; E=l.le-59" 
/ codon - start = 1 
/transl-table=ll 
/product = "arylsulfatase homolog 
bl4 98" 

/protein-id= "CAD78411.1" 

/db-xref ="GI : 32445680 " 

/trans lat ion= " MGFGFDQLGKFESHGWLLVC 

EPKEWESLTLCSPRRSYHGVESV 

ATLVPESHNHPAPPILDTLRIRMAKTIFCAKKIC 

RTWMVLFVIGAGTS IQAEASDRP 

NVLLILTDDQGWGDLAAHRNPKISTPTLDALANE 

S ARLDRF YVS PVCAPTRAALLTGR 

YPERSGVAGVTGRREVMRAEETTLAELYRSAGYA 

TGCFGKWHNGAQMPLHPNGQGFNE 

FFGFCGGHFNLYDDALLERNGTPVQTKGYITDVL 

TDAAVE F I QNHHDRP FF C YVP FNA 

PHGPFQVRRDLFDRYNDGS IDEKTAAVYAMVQNI 

DTNVSRLLKCLSDHSLDEETIWF 



misc- feature 

gene 
CDS 



12582 125 



14399. 
14399. 



15436 
15436 



gene 
CDS 



15441. 
15441. 



16514 
16514 



gene 
CDS 



WTGNIQPQSISQVAAHIDLLPTLM 
QWCDIPLPTKVPLDGRSLVELIRDGADPTLADRS 
ILTYRPNPMQLQKFGKAAVRTNTH 
RLTIEKSKASLFDMTTDAGQTTDIASSHPELTKQ 
LRSQIQKYVQEITPSITAIRPVPI 
DSMRSVYLPAVDAKLEGGVGFADGISWAHSWADR 
WTTTRDRITWPIEVKESGRYEWI 
HYVCDSDS AE I VLKAGDQEATTTLPRFTMKS WR 
PDLDSKATPRRMLTFQQHSLGWD 
LAAGVSTIALQRTDQAGAMIELNGLTITRQLP " 
/note="cosmid pircos-c2h02/ cosmid 
pircos-d2gl2 joining point" 
/ locus - tag= " RB7 9 9 8 " 
/locus- tag="RB7998" 
/codon-start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein- id= " CAD784 12 . 1 " 
/db-xref ="GI : 32445681" 
/ 1 rans la t ion= " MALEFPEL I ESEAQLDALLA 
RPSDELVKFMQQLDGDLI ILGIAG 
KMGVSLGQLAVAAIEKAGLQKNVLGVARFSDADA 
RARLES AGI KT IQCDLLNREAVAN 
LPSVPNVLFMAGRKFGTEGDEPLTWAMNTIVPAN 
VAHHFRDSNIVAFSTGCVYPLARL 
DQPPNEDTAPGPIGEYAQSCLGRERMFEYGSLEW 
GTRVCLYRLNYS IDLKYGVLHDIA 
TKIWNDQPVDNSVQAFNVIWQGDANQQALMCLGQ 
CTSPANVLNVTGPETLLTEDVAMQ 
LGELLDKPVRFTTTPGNASYLSDSSRATKLFGAP 
SVNAEQLIRWQAHWVKTGGRSLNK 
PTHFEVSDGAY" 
/ locus - tag= " RB7 9 9 9 " 
/locus - 1 ag= " RB7 9 9 9 " 
/note="best DB hits: PFAM: 
PF00573; Ribosomal protein L4/L1 
family; E=0.0042" 
/ codon - s t art = 1 
/transl-table=ll 
/product = "hypothetical protein" 
/protein-id="CAD78413 .1" . 
/db-xref="GI: 32445682" 
/ 1 r ans 1 a t i on= " MQ I SDLS PT IRENVRRGWI 
PAQPLALDADRQFDSRYQTALTRY 
YIDAGAGGIAVGVHSTQFAIRDPSIALYEPVLRL 
ASDVIDEYAGAQGRELFKVAGVCG 
KTKQAI CEAEFAVSQRYHACLLSLAALADES IDE 
LISHCREVAQVMPVIGFYLQPAVG 
GCVLPYEFWREFAEIENVIAIKLAPFNRYQTLDV 
VRAVCDADRDDAITLYTGNDDNI I 
ADLLTPYHIVTDHGEKRVRIRGGLLGHWCVWTRT 
AVELLDEIHAVLDRGGDIPSELLS 
RNIQVTDCNAAFFDAANGFAGCIPGIHEVLRRQG 
LLAGTWCLDPNEVLSSGQSKEIDR 
VI AAYPHLHDNAFVRENLSRWLR " 
/ locus -tag="RB8 001" 



complement (15849. .16541 

complement (15849. .16541 /locus-tag="RB8001" 



) 



/codon- start=l 

/transl-table=ll 

/product= "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein-id="CAD78414 . 1" 

/db-xref="GI: 32445683" 

/ 1 rans 1 a t i on= " ML FCLWWACS TQPPAQ I FAY 

ECWMEVRVGGDHTIDFFGLTAGK 

DFVWIEAPCSGQQALATKHFVDSRDATSKTVGRI 

EESRIAIRDLDVAAEQFRRNVATA 

IQYGMNFIEQFNGRSSPHAPVPEQTAANSDPFLT 

WRDDVIGSQQVRDDWIVAGVKS 

DCIVAVGVANRTNHIQSLVAVEGSQFDCDDVLNL 

SELPPEFIRQHTTTDGRLQVEANH 

RHDLGNFSAMRN" 



CDS 



16579. .18036 
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18356. .19558 
18356. .19558 
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19574 . . 19861 
19574. .19861 



/ locus - 1 ag= " RB8 0 0 2 " 
/gene="GALNS" 
/locus - tag= " RB8 0 0 2 " 
/f unct ion= " sulf atase " 
/note="PMID: 8020961 PMID: 1755850 
PMID: 8001980 PMID: 1522213 best 
DB hits: BLAST: gb : AAF4 9287 . 1 ; 
(AE003522) CG7402 gene product 
[Drosophila; E=2e~38 
gb:AAF49302 . 1; (AE003522) CG5584 
gene product [Drosophila; E=2e-37 
ddbj :BAA04535.1; (D17629) 
N- acetylgalactosamine 6 - sulf ate; 
E=2e-35 COG: PA0183; COG3119 
Arylsulf atase A and related 
enzymes; E=2e-31 PFAM: PF00884; 
Sulf atase; E=2.4e-44" 
/codon- start =1 
/transl-table=ll 
/product="N-acetylgalactosamine 
6-sulfate sulfatase (GALNS) " 
/protein-id="CAD78415 . 1" 
/db-xref ="GI : 32445684 " 
/ 1 rans lat ion= " MQPIRSWTNHLLMTLVMLLV 
ATAASAADRPNILLIVSDDQGYND 
LGQLGNGI ITPALDRLAKEGTRLTNFYVAWPACT 
PSRASLLTGRYPQRNGIYDMIRNE 
APDYGHRYTPEEYAVTFERIGGMDEREVILPAVL 
RPAGYKSGI FGKWDLGALQRMLPT 
SRGFDDFYGFVNTGIDYFTHERYGVPCMVRNLEP 
TEADKGTYCTYLFQREALRFLDEH 
AGNEPFFLYVPFNAPHNSSSLVPTIRSSVQAPDQ 
FKAMYPPVEVETRVTDRYRYGSPA 
TVATPQARRRD YRAAVTCMDAAI GE I LDRLEAKQ 
MLDETIWFFSDNGGSGGADNSPL 
RGHKAQTWEGGIRVPCLVRWPAGQIPAGWNDEF 
LTSLELLPSFAAAAGVEPPPGWL 
DGFDWWPTLRGEAES PRKEMFWKRRDQIGARVGQ 
WKWVQMGEQGGLFNLEED I AEKND 
QSQAKPEILAMVKERYANWLREMEAADPRGPFRD 
F" 

/ locus - tag= " RB8 0 0 8 " 
/locus- tag="RB8008" 
/function="type II protein 
secretion system" 

/note="PMID: hofG PMID: hopG best 
DB hits: PFAM: PF00114; Pilin 
(bacterial filament); E=0.038" 
/codon- start = 1 
/transl-table=ll 
/product = " hypothe t ical 
protein-putative general (type II) 
secretion pathway protein G" 
/protein- id= " CAD7 84 16 . 1 " 
/db-xref="GI: 32445685" 
/trans lat ion= "MKKMRRNGFTLVELLWIAI 
IGVLVGLLLPAVQAAREAARRMSC 
SNNFKQIGLALHNYHSAYNKLPAQKGGTNNANDG 
SLGGNQLRLGWLPTILPFIEQQGL 
WEQISHPLAINTDGTLRTTPWPGMGPAPWRTDYT 
PWITELAALRCPSDPGTGLPALGR 
TNYAACQGDAVEYNEVGAWSGSNPLVMNSGYAEQ 
IRISGRGMFVARQYTGFRDVLDGL 
SNTI AAGE lATDLGDRD IRTAP IDGPGS AVLRDN 
PSWARDNDKIDPERPQFWRAGENL 
TTDNAAGDGARASWARGFHWADGEMQHSGMNTIL 
SPNSETVSRTTADSTWGMYPPSSR 
HQGGVHVLMGDGAVKFITDS lEAGNSRAHTVYKD 
HNPGNAGTQSPYGLWGALGTKANK ETVSFDSL" 
/ locus - tag= " RB8 0 1 0 " 
/ locus - 1 ag= " RB8 010" 
/codon- start=l 
/transl-table=ll 
/product= "hypothetical 
protein- signal peptide and 



gene 
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complement (20114 



/prot e in- id= " CAD7 8417.1" 
/db-xref="GI: 32445686" 
/translation="MLRQMRIRSWAISQQRFLP 
FRFLHASLHSICNQMTNTKHSNRL 
ITFLSLMCLSLLCAGCDNNESTVIQQPADAAQEM 
ED YESMMNS SAKEAYGK " 
,21298 /locus-tag="RB8011" 



complement (20114. .21298 /locus-tag="RB8011" 



) 



gene 
CDS 



complement (21280 . 



/codon-start=l 
/transl-table=ll 
/product=" hypothetical protein" 
/protein-id= "CAD78418 .1" 
/db-xref ="GI : 32445687" 
/ trans lat ion= " MPTSSMTRRLRTSANVASTL 
TLSCWLGSLTNSLPVCADETASV 
AQSAVKMDESEIGRVSVQKIAVEDQTDQLAKQAI 
AELGLDEANVIQQVSATSPVKSFS 
TRDALAPLMDAKGTAGNAEDHRLHPPIDDHPLGW 
ALGFAKAHADHIRQNVSDYSCKLI 
KRERIDGELQTPQIMDLAVRVEQSNEQGDVSPLS 
VFLQYQSPRTLRDRRVLYIEGENE 
GKARVRKGGGALS YLVLS IDPHGRQAKEQSNYPI 
TDIGFDKIMSRLIGLIEFDMENDP 
TGDNTEVTYFRNARVMGRPATHIQIVHPSKEGGV 
TFHRANAYIDDELHVPIRLEVYDW 
PAAEGDSPELKEEYTYADLKLNVGLANATFASTR 
LKGKLDGPVGLTQSDVKAS PNAVA KD " 
.21669 / locus -tag="RB8 012" 



complement (21280, .21669 / locus -tag="RB8 012" 
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complement (21620 < 
complement (21620 . 
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21805. .23103 
21805. .23103 



/ codon - s t ar t = 1 
/ trans l-table=ll 
/product=" hypothetical protein" 
/protein- id= " CAD7 84 19 . 1 " 
/db-xref ="GI : 32445688 " 
/translation="MRFAIASQNAIRRCQIESSL 
SLTGWDCPASDVGSWKQTSAPV 
LWGETVGRLCGFFRWSRRPLFPSRADCSFSWSP 
WGVANLKLGFLQAIGTFRAQTDFA 
AWAERAPPETNGNSTAIWSHLLCRPHP " 
,21802 / locus -tag="RB8 014" 

.21802 /locus-tag="RB8014" 

/codon- st art =1 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id="CAD78420 .1" 
/db-xref="GI: 32445689" 
/ 1 rans lat ion= " MPLTKERGSSFHGLITTPKD 
DIAEVTITSGGGSFSCPSLEWEFH 
VAFRDRVSECNPAMSN " 
/gene=" ncx" 
/ locus - tag= " RB8 0 16 " 
/ gene = "ncx" 
/ locus - tag= " RB8 0 16 " 
/function="other cation 
transporters (Na(+) ,K(+) ,Ca(2+) 
,NH4(+) ,etc.); ionic homeostasis; 
ion transporters" 
/note="PMID: 11294 88 0 best DB 
hits: BLAST: gb : AAG12 988 . 1; 
AF169257-1 (AF169257) 
sodiumcalcium exchanger; E=4e-08 
ref :XP-009649.1; sodium calcium 
exchanger [Homo sapiens] ; E=4e-08 
qb:AAG60049. 1; AF314821-1 
(AF3 14821) K+- dependent NaCa 
exchanger; E=4e-08 COG: MTH1155; 
COG053 0 Ca2+/Na+ antiporter; 
E=0.002 PFAM: PF01699; 
Sodium/ calcium exchanger protei; 



/codon- start =1 
/transl-table=ll 

/product = " probable sodium/ calc ium 

exchanger antiporter" 

/prot e in- id= " CAD7 84 2 1.1" 

/db-xref="GI: 32445690" 

/translation= "MGIVIPLVLIFLTCLVIWRA 

CDGFEIASEYIGRNLSEGVRGGTI 

NAISSSIPELFTTLIALFVLSDRDGFSIGIGTTA 

GSALFNGMIIPAVCILSWGFWM 

GVRVTSVNVSTRVLLRDGISLILCEFILILLING 

EQLHWWQGLILMLMYATYLVYMLT 

TMKSSELSANDEQPDNDGEEEDESPRGPLASLFY 

WISGGPLLDLERWFVKDHHREQMK 

EETWNGWGLLLTSTGVIAMACWTLVLACEWLGSG 

PANVENPSYELFGQTFEGLGMPAM 

FVAVI FAS MATS VPDTVMS IRDARDGDYDDAVAN 

ALGSNI FD I CFALGFPLFLYTLIN 

GPIAMEPEIAAQSGELRLFLFILTIIGFLIYYIG 

KRGVAADGTKYVEMKRGKAFALLT 

MYGLFVIYILAHENDVEWHQISDHLQTFLKMLP 

AIGAIV" 

/ locus - 1 ag= " RB8 0 18 " 
/ locus - tag= " RB8 0 18 " 
/ codon-start=l 
/transl-table=ll 
/product=" hypothetical 
protein- transmembrane prediction" 
/protein- id= " CAD7 84 2 2.1" 
/db-xref="GI: 32445691" 
/ translation= "MLPWNRGDVTKWITYRKSFE 
LTTWKPR P F F F ATAL I VAAQ II AT 
EPIFAAVPADVLASLTTHCIDCHGPESAEANLRI 
DQLTSDLADRENALKWIEIRNAIN 
LGEMPPDGEPPLPVD I I SQTSEWVSGELREVDRL 
RTRADKQVRLRRLNRHEYTHTVSD 
LLSMKFPTGESPLDTLPPDGTAEGFDKVSSALMI 
DPSLMTQYYRVARRIADRAIVDGP 
PEYPTETMRLEFEEIADSHAIGYLVTRRGMNPVP 
GGLQLIEGSTRSFGMLRYPGRKDN 
NVAPINGFYRFTIRAGGSPGADGEVPRIRLKHNH 
PDDSMQT I ME YDVTAPWD S PAETS 
VT I PRDTLGGELNI E I LNDTKLSMGQRPGENFMR 
RINEVGEAGDFTESLRLAGRKVAE 
GWGGDRSTPDPEKLDLTKFPRVFLDHLEVEGPLY 
DQWPPKSHTTVLFRGEAGTDVLAY 
AREILLRLLPTAWRRPVEESEIEPILGVIQSELA 
NGETFHEAIRVGLAATLTS PNFL Y 
LAEGTGNAADTSVLTNHEIASRMSYFLWSSMPDQ 
ELFRAARDGELTDPNHRKQQVERM 
LADPKMDRFVGSFARQWLQTDTFLTFTPDRSLYR 
EYDEKLSEAIVREPLEFFHEILRS 
DRSVLNFLDSDFWINDRLARHYGIDGVSGEEFR 
WDLPSGSVRGGLLAMAGVHQAGS 
DGTRTKPVSRAVYVREVLFNNPPDPPPPNAGEVE 
PN I QGQRLTVRERLLQHQQ I EACA 
ACHRSLDPYGLALENFNVIGQWREQQDGENFRGR 
NLPEIDASGRLPNGNEFTGFEDFR 
GLLLQQSDRFRRALAEKVFVYAFGRPVGPEDDAP 
LTRWTDMKAEGDTFRALIQSIVT SQAFVSR" 
24233 /locus-tag="RB8019" 

24233 /locus-tag="RB8019" 

/function= "biosynthesis of 
polyketides" 

/note="PMID: 98295987 best DB 

hits: BLAST: pir:A70985; probable 

polyketide synthase - 

Mycobacterium; E=0 . 69" 

/codon-start=l 

/transl-table=ll 

/product=" similar to polyketide 

synthase" 

/protein- id= " CAD7 84 2 3.1" 
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27220. .28278 
27220. .28278 
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28305. .29003 
28305. .29003 



/ 1 rans la t ion= " MWFQTNSRHFTVGTRGTSR 
TNGES lETIDWCNVWIiATRVTKH 
PKRSRAAFDQLQSTGDRIHAPTCDEISDRMTVSD 
FLKLQPHCFRRVFRWTVHNRTVRD 
SASDSWLSHQRRVDHQCRADFVETFCGSIGWQC 
I EWTFAGWELHRQQVGNRVRVLMA 
VQSTQTNLFVRSSSQSIHFSQLA" 
/ 1 ocus - tag = " RB 8 0 2 2 " 
/locus-tag="RB8022" 
/codon- start =1 
/transl-table=ll 
/product= "hypothetical 
protein- signal peptide prediction" 
/protein-id= "CAD78424 .1" 
/db-xref ="GI : 32445693 " 
/ 1 rans lat ion=: " MSKPS YKPRLKTLNRRMLLR 
GASTCLALPWLEAMLPRSAAAAES 
TKWPRMGMFYFGTGMNMREFYPEGYGKDAKMSR 
ILKPLEEQRDQFTVLSGTSLSHGG 
GHDGAYPFATSIARGEKQTISPDQLAAEVQGKHT 
RFPSLQLSVKRGTGFGSQALATIS 
WNRQGIPLAAENDPQTIFNHLFRPPNSQQLLQQS 
DEFRQRGSVLDAVLSDAKRLQTRL 
SQSDRQQLDQYFHSIREVETTLGREMDWSDRPKP 
KPNLDGFGDYEKAIAPEGNGKFVY 
DTYAKLMYDLIALAFQTDSTRWSYWRTELAGG 
VYPEFGVSKGYHELTHHGNDPKNL 
EELARVDTIYMQHWSHFLNRLASIREGEGTLLDN 
TLLGFSSGMGIGHSKDVLPTMISG 
GSGLGIRHQTHLQLKDNTPLSSLWQTMVERMGVN 
VDGEFQDSTGTIGELLA" 
/ locus - 1 ag= " RB8 0 2 6 " 
/ locus - 1 ag= " RB 8 0 2 6 " 
/note="best DB hits: BLAST: 
pir:E82768; conserved hypothetical 
protein XF0752 [imported] -; 
E=4e-30 pir:A823 81; conserved 
hypothetical protein VCA1077 
[imported] -; E=le-26 
gb:AAF47337.1; (AE003467) CG7049 
gene product [Drosophila; E=5e-19 
COG: XF0752-2; COG1262 
Uncharacterized BCR; E=9e-30" 
/ codon- s tart =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAD7 84 2 5.1" 

/db-xref ="GI : 32445694 " 

/translat ion= "MIRLCIKRLHAALVIiALTLT 

VPIQAQEPARQSNPNSIGMQFVAI 

APGEYRRGFDGSGQRDRQFAIiAHTFSNPQAFRFD 

SPSHLWITKPFELQTTEVTVAQF 

REFVEATGYQTEAESNGGALGCFPKEKDYVDRFR 

KSPDVTWRTPGFEQSDDHPWAVS 

WNDAVKFCEWLSEQETTTYRLPTEAEWEYACRAG 

QSTWYSWGSDPDEAYVHANVADGA 

LEAAQPNTTRYQRAVKLEADDGDGWFTAAVASF 

RPNPWGLHDMHGNVWEWCQDRWSA 

DLYRRYLDDVPWPKRSQFTMTDPVMLEETPQHAY 

GDWRCIRGGAWTCAPASVRCSIRT 

FAEASDATI YTGFRWRE " 

/locus-tag="RB8027" 

/ locus - 1 ag= " RB 8 0 2 7 " 

/codon- start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78426 . 1" 

/db-xref ="GI : 32445695" 

/ 1 rans la t ion= " MI STWCCHVC]^AMQQLSSMS 

EPQNVSHWIDQVKDGDS I AANQI W 

QHYYDRLVRSVRNKLFGQNRAVSDEEDIVLSVFD 

SFYSAAQKGRFPDLSDRDDLWRLL 

LKMSARKWDKKRRDHRQRRGGDVQLHSLDKRED 

NQTFIEAIGDEPSPEMILMMEESV 



SERTIERRLHLIREKCHQELFDEH 
SSEETTDRDLGTNR " 

28867. .30702 /gene="pknB" 

/locus - 1 ag= " RB8 0 2 8 " 

28867. .30702 /qene="pknB" 

/Iocus-tag="RB8028" 
/EC-number= "2.7.1.-" 
/note="PiyiID: 99426839 best DB 
hits: BLAST: gb : AAF80365 . 1 ; 
AF159160-1 (AF159160) 
serinethreonine kinase PknlO; 
E=6e-50 swissprot : P54744 ; 
PKNB-MYCLE PUTATIVE 
SERINETHREONINE-PROTEIN; E=3e-46 
swissprot :P71584; PKNB-MYCTU 
PUTATIVE SERINETHREONINE-PROTEIN; 
E=7e-44 COG: Rv0014c-1; COG0515 
Serine/threonine protein kinases; 
E=7e-45 PFAM: PF00069; Protein 
kinase domain; E=1.6e-64" 
/codon- start =1 
/transl-table=ll 

/product = " serine/ threonine kinase 
PknlO" 

/protein-id= "CAD78427 ,1" 
/db-xref ="GI : 32445696" 
/translation="MPHDLDAPNGRLNWCISSV 
KNVTRNCSMSIRQKKLPIATLERI 
DDLCAEFERKWQSDQS PS I ESALTEDI S PDERDV 
LLAELIVLDVDYRRRRSESPNEQD 
YLDRFPDNVSVIRDAFRGNDQPKRAFLPPSVEQL 
SEQFPSLQITELLGAGGMGAVYKA 
RQEGLDRWALKILPEEFGHDVKFALRFTREART 
LAKLNHPNIVSVYEFGHVDDTYYF 
LMEYVDGSTLRDWAAGQLAPAHALAIVPHLCDA 
LQ YAHDNGVI HRD I KPENI LMAVD 
GSVKIADFGLSRILGDQDQPSALTGTHQIMGTPR 
YMAPEQLEGARGVDHRADIYSLGV 
VFYEMLTGELPIGRFAVPSKKVQIDVRLDDWLR 
TLEKEPQRRYQRASQIKSDVQSIA 
SGDRAALAPTQVMPSDPQRDHAHSSSGAHDQELA 
GRLLLTRRQLMERVESSLRPLFAG 
QVLQI FVGVALIALGAYCWAPNTHVPHRLFSGVI 
VHVYGLLLI IVAANVLVRIKRVDT 
SQPINEIRERLNAVRSFSLRVGPIIGIAWWLIWI 
PVWAAGFDSILHPNSLYPSLILG 
VIGLVGSCGLYAMAMKSEPWRQKLGGRSVTNAER 
EIiAEIEQTNIR" 
complement (30888. .31625 /locus-tag="RB8029" 

complement (3 0888 . .31625 /locus-tag="RB8029" 

/EC-number="l. -.-.-" 
/note="PiyiID: 8290650 best DB hits: 
BLAST: embl : CAB82049 . 1; (AL161803) 
putative short chain 
dehydrogenase; E=3e-66 pir:B83462; 
probable short -chain dehydrogenase 
PA1470 [imported] -; E=4e-58 
embl:CAB56135 .1; (AL117669) 
putative short chain 
oxidoreductase; E=6e-57 COG: 
PA1470; COG1028 Dehydrogenases 
with different specificities 
(related; E=4e-59 BS-fabG; COG1028 
Dehydrogenases with different 
specificities; E=9e-36 DR1943; 
COG1028 Dehydrogenases with 
different specificities (related; 
E=le-33 PFAM: PF00106; short chain 
dehydrogenase; E=1.2e-65 PF00678; 
Short chain dehydrogenase/ reduct; 
E=5.8e-10" 
/ codon- start = 1 
/transl-table=ll 



gene 
CDS 



31674 . .33116 
31674. .33116 



gene 
CDS 



dehydrogenase " 
/protein- id="CAD78428 . 1" 
/db-xref ="GI: 32445697" 
/translation= "MNNNNRVAIVTGGSRGIGAA 
lAERLASDGFSWINYANSSKAAD 
ELTQRITEAGGSAQSFQADVSDSDAVEKLFDFAT 
ES FGGVDVLVNNAGVLKMQPLAET 
SDEDFARLVDVNLKGCFHTMREAACRLRDGGRVI 
NLSSSVIGLRMPNYGVYSATKAAV 
EAMS S VLANELRGRQ I TVNS VAPGPTATKLFLDD 
KSDELIDRLTNMSPLERLGQPDDI 
AS WSFLAGPDGSWVNGQTLRANGGW " 
/gene="GALNS" 
/locus- tag=:"RB8030" 
/gene="GALNS " 
/ locus - tag= " RB 8 0 3 0 " 
/EC-number= "3.1.6.4" 
/ f unct ion= " sulf atase " 
/note= " PMID : 1755850 PMID: 8001980 
PMID: 1522213 PMID: 7668283 PMID: 
7795586 best DB hits: BLAST: 
ddbj :BAA04535.1; (D17629) 
N-acetylgalactosamine 6-sulf ate ; 
E=4e-24 gb:AAF63858 . 1; (AF112242) 
N-acetylgalactosamine- 6 - sulfate ; 
E=4e-23 gb:AAF58475.1; {AE003821) 
CG8646 gene product [Drosophila; 
E=le-22 COG: ydeN; COG3119 
Arylsulfatase A and related 
enzymes; E=le-14 PFAM: PF00884; 
Sulfatase; E=9.4e-37" 
/codon- start =1 
/transl-table=ll 
/product= "N-acetylgalactosamine 
6-sulfate sulfatase (GALNS) " 
/protein- id= " CAD7 8429.1" 
/db-xref ="GI : 32445698 " 
/translation= "MTLHPKSTPALYTISTNPPP 
FSWHAFLLVNLFMSRVLPLLSTLL 
LTYFLPFASSNATEPSGVTTHPNILMIYADDLGY 
EALQCYDGLDFATPKLDAMAAEGV 
RFDRAYASPVCTPSRVSLLTGLYPFRHSHLGVLP 
VHKGTRQKVDFGKMPTFPQLMREG 
GYTTSVTGKWQLATLEVWPNHIRNAGFDSWCVWQ 
IWRDGEKTSRHWNPTFNEDGHIRE 
DIQQRFGPDVLVDHVIDQMAEAQASQQPFLIIHN 
ELLPHDPIVETPEDRRLKRQPRLA 
HMIEYMDQLVGRLLDAVERLGIRDNTYWFMGDN 
GTHEVDFDNPKFGQPGEQKHTRHT 
RAGNVNGGKFDLNDAGTHVPLIVWGPPSVPAGQV 
CDDLVDWDLFPTYCELTGTTIPE 
SLRVDGRSIAPQIHGQTGKHRPWVHHAISGKGDS 
LFDGSWRLSSQKDQLWDARELPAE 
SPAAANDEHAESARVRLEKVFRTLHQNASQ" 
complement (33183. .34373 /gene="ybhE" 



complement (33183 



/ locus -tag="RB8 031" 
.34373 /gene="ybhE" 

/ locus - tag= " RB8 0 3 1 " 

/ f unc t ion= " ortho - cleavage " 

/note="best DB hits: BLAST: 

swissprot: 034499; YKGB-BACSU 

HYPOTHETICAL 38.4 KDA PROTEIN IN; 

E=le-43 embl:CAB66198.1; 

(AL136502) hypothetical protein 

SCF43.09.; E=le-31 

embl:CAC13 069.1; (AL445503) 

putative secreted protein; E=6e-31 

COG: BS-ykgB; COG2 706 

3 -carboxymuconate cyclase; 

E=le-44" 

/codon- start=l 

/transl-table=ll 

/product=" conserved hypothetical 



cyclase" 

/protein-id="CAD78430 . 1" 

/db-xref ="GI : 32445699" 

/ 1 rans lat ion= " MS SELSGDQQTTTRAYFGMK 

FGLMAMWLAVLQGS PVMAETLNV 

WFGTTTPRGGASEGIYHATFDTESGKLSPATLAA 

KAQQPGFLAMHPNKPVLYASSGNG 

VTAYRVQADSTGAKLVEMGSVESGDGGVAHLATD 

RTGSVLLSAQYGGGSTTLYELAED 

GSVKRRVQVYEHDALLS PAGSGWGNRQDAPHAH 

WVGTSPDNRFAFTPDLGMDKWIW 

KLDPNGPSLTHHGFGEGIPGGGPRHMKFSPDGKR 

lYLLNELALSVTVFDYEAKAGTMT 

AGQTVPALSEEIKAKERFNSASEIRVHPSGKFVY 

SANRGHDS ISVFRIDEAGQMELVE 

VEAIRGGWPRNFNLDPSGRWLIAAGRDSHTATVF 

EIDADTGELTFTRQTQQVPTPICV LFSK" 

/ locus - 1 ag= " RB8 0 3 4 " 

/locus- tag="RB8034" 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical 

prot e in - 1 ransmembrane predict ion " 

/protein-id=" CAD78431 . 1" 

/db-xref="GI: 32445700" 

/translation= "MLVAAQFAAHLRLSDHGCDS 

WLSGEKNSLKGASYPSRRGGRERK 

VAQGHRATGARARSFSSRDIADRRWFDLGCSFFE 

PFLES AVLS I S PTYLLYYLPL 1 1 A 

ISLVFGATRHEDLSLILRHAFHTARWITGFMAW 

FALVLFLDWMV" 

/gene="trkA" 

/ locus - 1 aq= " RB8 0 3 5 " 

/gene="trkA" 

/locus-tag="RB8035" 

/function^" other cation 

transporters (Na{+) ,K(+) ,Ca(2+) 

,NH4(+) ,etc.)" 

/note="PiyiID: 8268235 PMID: 8412700 

best DB hits: BLAST: 

gb:AAB90401.1; (AE001046) TRK 

potassium uptake system protein; 

E=4e-41 pir:T45222; heat shock 

protein trkA [imported] - 

Methanosarcina ; E=2e - 3 8 

pir:E83643; potassium uptake 

protein TrkA PA0016 [imported] -; 

E=3e-38 COG: AF0838; COG0569 K+ 

transport systems, NAD-binding 

component; E=4e-42 PFAM: PF02254; 

KTN NAD-binding domain; E=l.le-15 

PF02080; Potassium channel; 

E=2.9e-12 PF02254; KTN NAD-binding 

domain; E=3.1e-08" 

/codon- start=l 

/transl-table=ll 

/product="TRK potassium uptake 

system protein (trlcA-2) " 

/protein- id="CAD78432 .1" 

/db-xref ="GI : 32445701" 

/translation^ "MRILTLGGGTVGRWIADMLC 

RRRHSVTLIDSDPEIVRSINSELD 

VRAIEGNASQSTVLFSADVLSADLCLAVTGDDEV 

NIVAASMAKALGARRAIARVYAPA 

FRDLSTFDYQRHFQIDSLLSLEQLSASELiARAIR 

NPDAIPLEHFARGQLQVYEMDVAA 

GSAAAGKKLMELQLPPSVRVGSLAREGRMWIASG 

ADETRAGDRVSLVGMPDAVSVARQ 

I FGGEKRRRKQSKVMIAGGGETGYHLAGLLGRED 

FRTVLLEQDAKRCEQLAKLLPDVT 

WHANANRRSVLEDEGGGTVDYFVGCTGNDENNI 

MAGVEARELGASRVMCWGRPDYA 

NWGKLGIDLAVSERDWARQILGFLNEGAIISQ 

SKLPNGSIGVYELEILEGSAVTQA 

SLANLPIAGRCLIAAIQRDGFVRVPTASDVLEVN 



gene 35997.. 37454 /gene="serB" 

/locus-tag="RB803 7" 

CDS 35997.. 37454 /gene="serB" 

/locus-tag="RB803 7" 
/EC-number="3 . 1.3.3" 
/ f unc t ion= " as s imi lat ory redact ion 
of sulfur and biosynthesis of the 
serine family" 

/note="PMID: 2997734 best DB hits: 
BLAST: pir:C83 025; probable 
phosphoserine phosphatase PA4960 
[imported] -; E=le-108 pir:E70860; 
probable serB2 protein - 
Mycobacterium tuberculosis; 
E=le-60 pir:T36772; probable 
phosphoserine phosphatase - 
Streptomyces; E=le-57 COG: 
PA4960-2; COG0560 Phosphoserine 
phosphatase; E=5e-70 PA4960-1; 
COG2716 ACT domain- containing 
protein; E=3e-31 jhp0597; COG0560 
Phosphoserine phosphatase; E=6e-26 
PFAM: PF01842; ACT domain; E=0 . 052 
PF00702; haloacid 
dehalogenase-like hy; E=3.6e-26" 
/codon- start=l 
/transl-table=ll 

/product= "probable phosphoserine 
phosphatase" 

/protein- id="CAD7843 3 . 1" 

/db-xref="GI: 32445702" 

/trans lat ion= " MPDRRDTTRRICSRPNRQRR 

AGSQRHHCRLDRFIRCRRSPFTLS 

QRLIVPVTTNAPKPTPNAWNDSILPSTLGNSPI 

APSPPTIVLLRFTGDDCPGLTASI 

SERLHRFNCRVIDVNQAVIHRSLLLGMLVQIPGS 

EDPERLIRKMRRKSRKLGLKCKAK 

IVCDPDYDAWVDRQGKSRFILTLLSRSVTAEQFA 

AVSRLVSDQGLNIDVITRLSGRPK 

REPGDELTRACVEFSLRGDPTDVNALKASLLELS 

NRLNLDLAWQRDDAFRRNRRIVAL 

DMDSTLLQAEVIDELAKEAGAGEKVSAITEAAMR 

GEIDFDESLRQRVQSLEGLPESVL 

PKVAERLQLTEGAERLLSNLRRFGYTTAILSGGF 

TYFGEHLQKLLGIDHVHANQLEIV 

DGKLTGRVLGPIVNAERKALLLEQLAANEGVDRK 

QMIAIGDGANDLPMLSRAGLGIAF 

HAKPIVRESAEHQMSTLGLDAVLYLLGVRDRDLV 

E" 

gene 37294.. 41439 /gene=:"OPLAH" 

/locus - tag= " RB8 0 3 8 " 

CDS 37294. .41439 /qene= "OPLAH" 

/IocuS"tag="RB8038" 
/EC-number= "3.5.2.9" 
/function="amino-acid metabolism" 
/note="PMID: 8943290 best DB hits: 
BLAST: pir:E70627; hypothetical 
protein Rv0266c - Mycobacterium 
tuberculosis; E=0.0 pir:S77037; 
hypothetical protein slr0697 - 
Synechocystis sp. {strain; E=0.0 
swissprot :P97608; OPLA-RAT 
5-OXOPROLINASE 

(5-OXO-L-PROLINASE) ; E=le~153 COG: 
Rv0266c-1; COG0145 
N-methylhydaintoinase A; E=2e-99 
slr0697-2; COG0146 
N-methylhydaintoinase B; E=9e-85 
YKL215C-1; COG0145 
N-methylhydaintoinase A; E=6e-73 
PFAM: PF01968; 
Hydantoinase/oxoprolinase; 
E=0. 00018 PF02538; Hydantoinase 
B/oxoprolinase; E=l. 7e-167" 
/codon- start=l 



gene 
CDS 



complement (41446 . . 46233 



/ product = " 5 - oxoprol inase " 

/protein- id= " CAD7 8434 . 1 " 

/db-xref ="GI : 32445703 " 

/translation="MVRTICPCSPAPVSGLPFTP 

NRSCWPLNIRCPHWGWTPFSICS 

ASETAIWLSDSMDAEQIEVWVDVGGTFTDCLVQQ 

TDSNGHRQREAIKVLSSGLVSVRV 

LDHSDDFTQVEVELPPAFAIDAFWETAKLREQAE 

PT^APQSFTVLNCQRKSSGKVQLQ 

LNEPATQLRRDQHWVLDAEIESPILATRILLSCP 

LPHSLPPLSVRMGTTRGTNALLTR 

NGAPTALLITEGFADLLSIGTQDRPNLFALDIVK 

PQPLTSQVLEVRGRLNASGEELIP 

VDEQHLRSCLLQLRANSSEQAVLAICLLHAHRND 

AHEQIVQRIAREVGFNDVICSTEV 

ASLPRIVPRAETTTLDAYLLPILESYVARVWEQF 

GGETTCHLRWMTSGGNLVASAGFR 

GKDSVLSGPAGGWALGEVAKQCGVSGAVGLDMG 

GTSTDVSRFEGAV6RRQESMIGGI 

RVLTPMMDIHTIASGGGS ICAIRDGRLTVGPDSA 

GAFPGPACYGHGGPLTITDVNVLL 

GRLPIDRFPFPLDLEAAQTVLRQTHQQLPTDSTL 

SPEQLADGFLQLAITQMAEAVRW 

TTAAGSDARQMALVGFGGAAGGHLCQIADSLEMT 

HVIDHPDSGLLSAVGMGAASIGRI 

ATES VQ ITVAPAI PETADTTASRTFLSGQ I PALQ 

SAAKRTLAQCEQQLRQEENIPRDT 

VLHWHQCDARPLGTQSTLPIDLFPLETLPNRFD 

AKHQSTFGYARTSMPIEIVTVRCE 

ATIERTSNANELARDKSASSTKTIRTPAKADVIS 

MWVRGAFRKVDLWDRDS ILAGQVI 

SGPSWAGPYSVLWEPGWMATMSSDGILEIRKE 

PACSANTNSSALSTEQLDDTIAME 

lAARRVQGIAEAMGEVIRRTSVSVNVKERRDYSC 

AVFLGDGSLVANAPHVPVHLGAMS 

HTVRSL I ADYPKMS PGDS Y I SNDP YAGGSHLPDI 

TWTPVFCDDDTPLSSRPNHWPCD 

FFVASRCHHAEIGGMVPGSMAPAATCLAEEGWL 

HNECLVRNGQSHHAHIRKLLSSAT 

YPSRNVDENMADIAAAEAAGQAGANRIRTLADSM 

TKTRLTELLKQLLHVAGEATAAWI 

ETLGAAPRNFEDALDDGTPIQVELQPDQTTGRLR 

IDFTGTGPVHDNGFNATPS I VTAA 

VLYVLRCWPGELPLCDGVLKRIELIIPNGLLSP 

LAGSTPETSPAWAGNVETSNRW 

DVLLGALGVAAASQGTMNNLLLGDSTFGYYET I G 

GGAGATAYASGADAVHTHMTNTRI 

TDPEILESRLPVRLWRFAIRRGSGGLGLHRGGDG 

MIREMEFLRPLTLSLITSRRTTRA 

YGMNGGEPGQPGRQTLVHQSQRTELPFATSQEVD 

AGDRLIMETPGGGGYGKELSTATS ELP" 

/ locus - tag= " RB 8 0 3 9 " 



complement (41446 . .46233 /locus-tag="RB8039" 



) 



/note="best DB hits: PFAM: 
PFOlOll; PQQ enzyme repeat; E=0.48 
PF00515; TPR Domain; E=0.0026 
PFOlOll; PQQ enzyme repeat; 
E=0.16" 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical 

protein-putative secreted 

oxidoreductase " 

/protein-id= "CAD78435 .1" 

/db-xref = "GI : 32445704 " 

/translation= "MTRLTIAAVTVAAMFFAMSL 

QSKNAQAQGLFDSRMQMGDAEFLP 

PPRSMRQVIRDAEEAIDEKRYSDAWHLGNLLLR 

S GDNPS S EMMQD Y YYP I ARADDDD 

QTDRPLSDQRGMLDRDGQLIEDETDQAEALAKLR 

SQLPASLLEHVRDLIGELPNQGLE 

lYQLRYGPLAEKLLEEAGPKRDWKTVERVRRETF 

HTEAGYKASVLLAVREWTLGNPLA 



misc-feature 41689 416 

gene complement (46428 . .47630 

CDS complement (46428. .47630 



RMIPSPLKYPQSFTSMFPNGEADE 

VAWKEYIRAQSESDLVTAEALAENYPLLGGSADR 

NSSGSGQMPLSNPRWMVESSTSPR 

QERMIQEKTQQLKATGQLPPPSWTPLQVGDQLLM 

RTTQRLLGVDFRTGKRVWQYPWFA 

TAETALSPELTDPMEEEEDPVDILVQRVWNDVPF 

GQISSDGRRVFLIDDLSKIEIERF 

SPFASRGMSSLANTLVALDVKTEGKILWRIGKSE 

STPSELSEAFFLGPPLPFRGDLYV 

lAEIAGEWLLCLDPADGSVQWQQVLVAVESGGI 

GSDPVRRVAGAMPTYHDGLLICPT 

GAGATVAINLVDRTIRWGNQIPRSQEFVHSIYRP 

NTRVSANQLSRRWTHSLAIAKGTS 

WLTPIECDRLQVLDVLTGRNRFTPKNRIRLAYV 

AGVKDDLFFWGADEVSACSLNDG 

SEVWSTPPTLVRPGQRISGRGVFGTDRYYLPTTT 

DELIEISLQNGEVLQRRTVRFPLG 

NLIAIDGELISQSATMVAVAYGERSLQPRVDALL 

QQDPDNFFALVRKAELLIEQEQRP 

RALELLRKARDLDPDNDEVIMLSVEAMLGSLRQN 

PSDSSVDLEALMKLIDQPEQRAEL 

LVLQIKGSVDQSADSIDHPKALELLLDLSELVLR 

HPTLNRDMDAVFPNAARQFTVDDW 

ISARVAQLVSTASEDELVEMTGKVESLVSSRGNM 

DGAALMRTLQHFS PWAATNGLRR 

DMAARALLMEDGLLAERLIWADKMTTDDALQSLS 

NERLLMLAKLYSKEAWNLDRLAVG 

ELLGQRLSDSEPVGADTFIEDALQANPIRGREYD 

MGEQWPRDVDLEWQSMRMPRSSGM 

MMlTORRYAKSTHLMGRQLRGWNAMSDlSnyiNSI^ 

NPYGIHRS I PLEGFSGMRSLGNEL 

TYSGGLLLLLTQSELIAIDLFRTLGANGEDAIRW 

RRTLGTDGQPVAKRRSAATPFGNQ 

lYRNMMVSATADNGEAQLILGPVLGDRLFLLQGT 

ELICLDAVSGEERWRTPTEAVGNA 

WHAEGRVLWSKSQNQVES YD IHDGGLLGIQKR 

SIGTVISSLGSHVLTVTQVNREAK 

VPSDAAEELRASLEAAKEMHDPQLVELVDMSSGE 

TVLHRVADSQNVNEASAYAEMTDG 

RYFTLLDHNGRTTIWDLLSGREIADVMVPARKGL 

YGVSVMS IQDQMWLPRHRKPSRD 

LASLRSVAVSGGATVQPITAMHAIS 

TVDGS IRWSRS FDDAWGCTVEQPWVTPMLLLVR 

AHLTVPGNGVRRRQMDVMGISSVDGETLHEIDRK 

ETS SNTNVI ESRVKLI PHRNTMLA 

EIQLELLTYTFKEEGSDDNGDSEADVKTDEGDES 

DEQAEDVSE" 

/note="cosmid pircos-d2gl2/ cosmid 
pircos-dld02 joining point" 
/gene="devC" 

/ locus - tag= " RB8 04 6 " 
/gene=:" devC " 

/locus - tag= " RB8 04 6 " 
/function="ABC transporters" 
/note="PMID: 9823 0330 best DB . 
hits: BLAST: embl : CAA67986 . 1 ; 

(X99672) devC [Anabaena sp.]; 
E=2e-57 embl:CAA05976.1; 

(AJ003195) membrane spanning 
subunit [Anabaena; E=7e-57 
pir:S76812; hypothetical protein - 
Synechocystis sp. (strain PCC; 
E=4e-53 COG: Rv0072; COG0577 
Predicted permease; E=0.008 PFAM: 
PF02687; Predicted permease; 
E=5.7e-12" 
/codon- start =1 
/transl-table=ll 
/product = "exporter subunit 
devC-putative ABC transporter 
family protein" 
/protein-id="CAD78436 . 1" 
/db-xref="GI: 32445705" 



gene 
CDS 



VAAAGVGFAAVLMFTQNGFRNALL 
DS P VRL I EWDCELVAI S PARYMLPVDERFPRSL 
LRRVKAD PAVTDVQALL I ERTMAR 
VRVEGSPSRPIRVISVPNASLYSTGAGDSPRPWL 
NIAGLDENRDALRQRDTALVDRKT 
RSEYGFQFDHSGNPADQRIELSNQSIELVGSFTL 
GTDFANEGSLL I AES S FANYFPRR 
GYGKPLETVDLGLIRLQDGASPEEAAERLTEMAA 
GQWQVLTRQQIRDREQTFWSEQTP 
VGMIFLIGTLMGFAVGIIICYQILFTNLQDAMSE 
YATLKAMGYPNRFFVGLWRQSVY 
LSVLGFIPAVLIALLLFRMLESLTGLPMALTVSR 
AGSVLVLTTLMCLISGLLALRKLV RADPASLF" 
.48100 /locus-tag="RB8049" 



complement (47630 . 

complement (47630. .48100 /locus-tag="RB8049" 



) 



gene 
CDS 



complement (48175 , 



/function= "transcriptional 
control " 

/note=:"PMID; 1406588 best DB hits: 
BLAST: pir:G82098; conserved 
hypothetical protein VC2272 
[imported] -; E=4e-30 
swissprot :P2553 8; YBAD-ECOLI 
HYPOTHETICAL 17 . 2 KB PROTEIN IN; 
E=2e-28 pir:E72221; conserved 
hypothetical protein - Thermotoga 
maritima; E=4e-28 COG: VC2272; 
COG1327 Predicted transcriptional 
regulator, consists of a; E=4e-31 
PFAM: PF02644; Uncharacterized 
BCR, COG1327; E=5.2e-62" 
/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= "CAD78437 . 1" 
/db-xref="GI: 32445706" 
/ trans lat ion= " MRCPFCHSDNDKVQDSRTAE 
AGYWRRKRLCQTCQRRFTTLEQI 
DALNVRWKSDETREPFDREKIKRGIERACSKRA 
VTSDE I EKTVQKI EEAI YAEFDME 
IPTATIGEWLRKLATLDEVAYIRFASVYRDFDD 
AKDFLQIVSNLQRDAETEGR " 
.48393 /locus-tag="RB8050" 



complement (48175 . .48393 /locus-tag="RB8050" 



gene 
CDS 



48427. .49107 
48427. .49107 



/codon- start =1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id= " CAD7843 8 . 1" 

/db-xref="GI: 32445707" 

/translation= "MVHERFQSKNHANVGKSSQN 

QTIVLSRNRCPTWPKSPRSGALHC 

DGSVCDCRTRHLASEDVLDDHSTPSKDS " 

/ locus " t ag= " RB 8 0 5 2 " 

/ locus - tag= " RB 8 0 5 2 " 

/note="PMID: 98295987 best DB 

hits: BLAST: pir:G70979; 

hypothetical protein Rv3278c - 

Mycobacterium; E=0.59 PFAM: 

PF01944; Integral membrane protein 

DUF95; E=0.0077" 

/ codon -s tart =1 

/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAD784 3 9.1" 

/db-xref ="GI : 32445708 " 

/ 1 rans lat ion= " MTDPANPTDENATSKTPAQT 

DSVPDPLSSAASSTPVSDSGAPAQ 

PRADTPRERFMEQIAKKQSLDDHDPEESLWEGGY 

SPKAMIGSWIGLSLLSIVLLVAAG 

FIEQFTFGIALIVIWLWILVGLNYAAKRLGVQY 



gene 
CDS 



49110 
49110, 



.49499 
.49499 



gene 
CDS 



complement (49613 



D I SD VS YEQGP I QRMFKVGS I S I I STDKSDPKLE 

LVGINNVGEVAGLIDDIRRAERRR RSIHIEQN" 

/ locus - 1 ag= " RB8 0 54 " 

/ locus - 1 ag= " RB8 0 54 " 

/note="PMID: 2509712 best DB hits: 

BLAST : swissprot : P18746 ; 

Z026-XENLA OOCYTE ZINC FINGER 

PROTEIN XLCOF26; E=0.84" 

/codon- start =1 

/transl-table=ll 

/product=" similar to oocyte zinc 
fxnger protein" 
/protein-id="CAD78440 . 1" 
/db-xref="GI: 32445709" 
/ 1 rans lat ion= " MSVWFVQKGDEQLGPYKPAE 
LLALVRDGSWAETRIRKDDSNWF 
TASEVGGLFEAARRPSIEHYCPQCNVRVSQPPTI 
CPECGKDLDRTKTRIIEHSIGQSK 
RSSDTPSAGPSDSAKRWLQKKIRRDKK" 
50851 /locus-tag="RB8055" 



complement (49613 . .50851 /locus-tag="RB8055" 



gene 
CDS 



complement (50931. 



/ codon- start=l 
/transl-table=ll 
/product = " hypothe t i cal 
protein- signal peptide prediction" 
/protein- id= " CAD7 844 1 . 1 " 
/db-xref = "GI : 32445710 " 
/ translat ion= "MNNNWFPRFAARLFGTLAAV 
GLAWSGLGNAATHAQQFGGPSSSN 
TMMHDGQAFRWQASDPAAGGWDLGAMQSAMNAS 
S S S FGNE VAQVGHHGLGQAQACGS 
CGTQCGGACGSAGYGGGMACPTCDPYRYASVEAL 
FMRREGIDNFTVARDFALDDLDFE 
WAPRITVGWPDCVNGFEVGFTGVLDWESDVSVA 
AVDGLSTLLIEDPLQPGTLGTFGF 
DAADEADAQRQ I YESRFWS VEIVDSFRTMiyiAWDVAKL 
LIGGRYIDFEEDFNYSTVNGANTG 
LLRNNAANRMIGLQVGADIYNPMGRFSSSYVRAR 
AGGFLNIAESSVLVRNETDILANG 
SEESTELSGMFEFGTGVRYQVGELMAVRGGFEAW 
YLTGVATSEEQISSATVTPTFGRS 
LIADDDVFFIGLTFGAELKY " 
52712 / locus -tag="RB8 056" 



complement (50931. .52712 /locus-tag="RB8056" 



) 



gene 



52654. .53058 



/ codon - s tar t = 1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id= "CAD78442.1" 

/db-xref = "GI : 32445711" 

/ 1 rans lat ion= " MLIGKTMNPTGSNRMSSCRK 

RERLARWLTQAFVWGGCIAASQVA 

GNTVLADDAVAAKLPPIRLVSGTIHSNPFVERPA 

APTGVASTVAGKTVSSNVNLIQTQ 

GDAVSGGTTVLYPIRQTSDQTMSEIKLKSIGTAV 

GLLPIGAPATQQTPLWEPARTPL 

PRINPHAQPDYDRAESNGIVELTPAQVSTNVEAK 

QSISMDFEVDQTEVQWQQPKIASE 

PAP WEQNWAEED I VLAPAVQE PVS PQPAEVEV 

EAVDVAPIETPWANAEVSEPIEF 

SLNDATDSLSDADDSISLSFSDSDEPGENLVADE 

LVSMPEPMKIAVPTKIAPPVEQPT 

DESTLPRLPKPVQIKTPSEMMVQQQRHLPVKIDS 

PTQLERADSERMQNAPVRHRAAVA 

VEAPPMVAVAKRTASVSSESRVSAARIVPAAIiAS 

HRGEFHRSVPATEAERSLRIEDTI 

QCDANDVTSLTVDGVIEAVRVEDDS lARVIGSTS 

RHLRL I GVRPGKTRVLVQQKLAGD 

AESIREIYELHITSPSNVRGGESQQEQSLMELIE 

QKYRTASVQISRQGDRI I VQGECD 

DSRDAKEIVRLIRKTYLVPVEDQLILR" 

/ locus - 1 ag= " RB8 0 5 9 " 



/codon-start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD7844 3 . 1 " 
/db-xref = "GI : 32445712 " 
/translation= "MRQLLIRFEPVGFIVLPISM 
ADPYAHDLLGSCTPPAHFPRRWPS 
E 1 1 HRTS EHPLYRRRWPALT I LPDRTFGS AGS V 
LTQIRLGPATPTPLSGNAQPPARR 
YVFTTRLFQPRRCFTNS lARRSTARRQCHKAN " 
complement (53080 . .53925 /gene="spoU" 

/ locus - 1 ag= " RB8 0 6 0 " 
complement (53080 . .53925 /gene="spoU" 

/locus- tag="RB8060" 
/EC-number="2 .1.1. - " 
/ EC - number = "2.1.1.66" 
/function="rRNA processing" 
/note="PMID: 7590303 PMID: 2987648 
best DB hits: BLAST: 
gb:AAF73591. 1; (AE002337) spoU 
rRNA methylase family protein; 
E=2e-27 pir:C72066; rRNA methylase 
- Chlamydophila pneumoniae 
(strain; E=5e-27 pir:A75448; rRNA 
methylase - Deinococcus 
radiodurans (strain Rl) ; E=2e-26 
COG: CPn0530; COG0566 rRNA 
methylases; E=5e-28 PFAM: PF00588; 
SpoU rRNA Methylase family; 
E=4.7e-24" 
/codon- start =1 
/transl-table=ll 
/product="spoU rRNA methylase 
family protein" 
/protein- id= "CAD78444 .1" 
/db-xref ="GI : 32445713 " 
/ trans lat ion= " MQKCCPMETIRSLQNAAVRR 
IVSLRRSRKRRAAGWLVD6PRES 
LRAIEAGLKLTAFYEIEPKLSESAGPPDMPEQAV 
ARQHAIDSGVHRWVTSEVFRKIAY 
TASTDRCVAEFVAPDDSLEQLLERSSWNDGLILV 
LDRVEKPGNLGAVFRSADAAGVSA 
VLLSDCPSDRFNPNAIRGSLGAVFTVPSASGSES 
QMEAFLKTHGYRVAAMRVEGSRPL 
FDSDLRGKVAWLGSEADGLADRWAGDS IDPVAL 
PMAGHVDSLNVSVSAAIVAFEAVR QRSNG" 

53897. .55168 /locus-tag="RB8061" 

53897 . . 55168 /locus- tag="RB8061" 

/note="PMID: 97426617 best DB 
hits: BLAST: pir:C65084; 
hypothetical protein b2981 - 
Escherichia coli (strain; E=9e-35" 
/codon- s tart =1 
/transl-table=ll 

/product = "conserved hypothetical 
protein" 

/protein-id="CAD78445.1" 

/db-xref ="GI : 32445714 " 

/ 1 r ans lat ion= " MVSMGQHFCTFASMGEPTRV 

CYLLTFAKQLLCYQPLNARMSENV 

LCQPVSGRQDQKAFLDLEKRLYSDDPNWVPPLWS 

ERVKLVGFKHHPFYDNAEGQTFLV 

RRGDRWGRVLAWNHAHNRYHDETRGFFGFFEC 

EDDEEAAIELLDTAGDWLKQRGMT 

CVRGPVHPSLNYEVGLLVDGFDTPPTFLI PYNHP 

YYERLIQAAGFEKSQDLYSYEASI 

DILETLDPKLLFVIEESTRRFNAVCRSIDPKNFN 

ADVRVFLD I YNQSLQRTWGYVPMS 

EAEVDDQSNGLKNLLLPKLTS I AE IDGKPVGAGF 

GLLDYNPLIKKINGKLLPFGWLKL 

LMGRKKLKRLRLVSANVLPEYQKWGLGLVTLYKI 

LPEAIKFGIEMGEFSWVLESNQLS 

RGTIERGGATRSKTHRI YDRSLTPESEQAST " 



/Iocus-tag="RB8062" 
55165. .57267 /gene="yoaA" 

/ locus - 1 ag= " RB8 0 6 2 " 
/function=" recombination and DNA 
repair; DNA synthesis and 
replication" 

/note=:"PMID: 92332449 best DB 
hits: BLAST: pir:E82133; 
ATP-dependent helicase, DinG 
family VC1990 [imported] -; 
E=4e-87 swissprot :P76257; 
YOAA-ECOLI PROBABLE ATP-DEPENDENT 
HELICASE YOAA; E=2e-84 
gb:AAG56797.1; AE005403-8 
(AE005403) putative enzyme 
[Escherichia; E=5e-84 COG: VC1990; 
COG1199 Rad3- related DNA 
helicases; E=4e-88 PFAM: PF00270; 
DEAD/DEAH box helicase; E=0. 00011 
PF00271; Helicase conserved 
C- terminal; E=0.5" 
/ codon - s tart = 1 
/transl-'table=ll 

/product= "ATP-dependent helicase, 
DinG family" 

/protein- id= " CAD7 84 4 6.1" 
/db-xref="GI : 32445715" 
/ trans lat ion= " MSEPLS IDS ILGPGGS I SRR 
LPRYEPREQQLEMARS VS S ALTDR 
EHLWEAGTGTGKSFAYLAAAILHATSDQTESGS 
QKPKRSSDDVDDRIGPAGDRDKES 
DRPKRVLISTHTISLQEQLIGKDIPLLNSVIPRE 
FSAVLVKGRNNYLSLRRMGRAVEK 
SVSLMANDFQMQQLREIRKWSDNTPDGSLSTLPI 
KPDGQVWDEVRSDTGNCLRNKCPN 
FKDCFYFQARRRAQNAQLLIVNHAMLFTDIAMRR 
QGVSLLPDYDAIILDECHTIESVA 
SDHLGIRLTSGQFDYLFDRLYNDRQQKGLLVAHN 
LDALQKMVDRCRFAASEMFAGVLD 
WMQESRSRNGRVHHPEWPNPLSEPMEILARRLR 
AHADAQDNDSDRQDFQSAHDRLLA 
LAGGLREWLDQSLKQESVYWVETSGSRRGMDRVS 
LSASPIDIGQTLREEVFQNEQIGS 
VIMTSATLATGEQDKFKFFRSRVGLTTGRSLQVG 
SPFDYEKQAKLI IVRGLPDPSAKR 
DEFEAALPQQIKRFVGHTDGHAFVLFTSYSLLRK 
CAEAITPWCIERDLHLYSQAGDQN 
RTQLLDSFRKDPRGVLLGTDSFWQGVDVPGDALT 
NWITKLPFSVPDHPLLEARLETI 
RARGGHPFPDYQLPEAVIKFRQGFGRLIRTRDDS 
GMWVLDPRIRSKPYGRLFLSALP 
PLPCHDVGTVPRQKTKKQ " 
complement (57222. .57440 /locus-tag="RB8064 " 

complement (57222. .57440 /locus-tag="RB8064 " 

/codon- start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD78447 . 1 " 
/db-xref="GI: 32445716" 
/ 1 rans lat ion= " MPKGHESGGVHCVQTTGYRL 
QSLRDKACENLHPWRINALTASHS 
QRFLHTSNKRTALAHCFFVFCRGTVPTS " 
complement (57424. .57540 /locus-tag="RB8065" 

complement (57424. .57540 /locus-tag=:"RB8065" 

^ /codon- start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id=" CAD78448 . 1" 

/db-xref ="GI : 32445717" 

/translation= " MGGGCRVAAHPRKRKTRSDP 

ERCEVSTTPSGSEIAQGA" 



CDS 



complement (57519. .57635 /locus-tag="RB8066" 



gene 
CDS 



complement (57645 . 
complement (57645 , 



gene 
CDS 



57894. .58541 
57894 . .58541 



gene 
CDS 



58538. .58717 
58538. .58717 



gene 
CDS 



58838 . .61033 
58838. .61033 



/codon-start=l 
/transl -tableau 
/product= "hypothetical protein" 
/protein-id="CAD78449 . 1" 
/db-xref ="GI -.32445718 " 
/ 1 rans lat ion= " MWRSLVGHDFCMADFFLI PT 
S AFLHLGRDHRLWAEGAA " 
57842 / locus -tag="RB8 067" 

57842 /locus-tag="RB8067" 

/codon- start =1 

/transl-table=ll 

/product=" hypothetical protein" 

/prote in- id= " CAD7 84 50.1" 

/db-xref="GI: 32445719" 

/ translation= "MIGHNDVSRLGAETVANAKR 

P I FPTTPVS IDAGMMADDLGRMQR 

LDSMKVQRSMDLFETNPHRET " 

/ locus - tag= " RB8 0 6 8 " 

/ locus - 1 ag= " RB8 0 6 8 " 

/codon- start=l 

/transl-table=ll 

/product = "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein- id= " CAD7 8 451.1" 

/db-xref="GI: 32445720" 

/ trans lat ion= "MVSNGVLHRSSCPVWFFLMR 

LIALPLIVLGLAAPCLSAEQNRLT 

IQWFELDSTEQPELTKKLARSSQVKNSDTVLTEL 

RECESARLMVQHELSVTSGSRVNQ 

STKMDDTTFKLNLEVQPADQGSYTFHVTSSLETL 

APGKDIARRTAIRKVDSAFKLSPG 

TS FVLTGLERS GRERAI TQVL WRLQGFAEEQTV 

ETVQNGLEPHSSPEAALSTND " 

/ locus - 1 ag= " RB8 0 7 0 " 

/ locus -tag="RB8 070" 

/codon- start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id= "CAD78452 .1" 

/db-xref ="GI : 32445721" 

/translation= "MTWLVGSRALMETSFIGTAV 

LLFRDGLISFSSAIPFLLVSSKTD 

LSLNRDDCDYPQAPK" 

/ locus -tag="RB8071" 

/locus- tag="RB8071" 

/note="PMID: 2808524 PMID: 8276898 
PMID: 24 97103 best DB hits: BLAST; 
swissprot :P13 3 95; SPCA-DROME 

SPECTRIN ALPHA CHAIN pir: ; 

E=0.46 gb:AAF47569. 1; (AE003472) 
alpha -Spec gene product [alt 2] ; 
E=0.46" 

/codon- start=l 

/transl- tableau 

/product=" similar to spectrin 

alpha chain" 

/protein- id= " CAD7 84 53 . 1 " 

/db-xref ="GI : 32445722" 

/trans lat ion= "MSVTSSASTSLACGACKHNN 

APGAQFCGGCGHFLHEKCVQCGDS 

VSLTQKFCVGCGQDLNAWLEKRIAEQETKLSDAV 

AAVKCHEYERALGLLNLLAKSGDY 

RFQSVREQAAAAKDKVESLQKKVHAQASQRITAA 

KEAHAQNDLSTAVKLLAQVPENLL 

DDESRSIRQSSQVHLDQLKTLHGELQQALAEKSY 

SQVAGLLQQLLELQPDNQKYQQLS 

QQVGDKLLRRAEKLCARQE YQTARNALNSLPT I C 

HNAQFADLSRRSELACWLSKQFDV 

EPYATNALGRLAMRYAKEFPSDGKASDCVKQLSK 



GGRVGILANPQSLNFDELAESPPSFASFAEAIGL 
ALHALGLSRISGNLLPKKGVMSKL 
GLSKSKAVWGIDVGASGINAIKMRVEKGSDQPIV 
EAAHRVELKNPTCRGGSKSASELI 
PEAITRLMEEIDVSDSKVYANLPACEGIARFCEL 
PPVKDKDAERLIETEVKTRIPIST 
EDLALITWVAPLQKGSTVGRPVVMAAATKLTVSR 
RVDLLGIGGLKLDGLVPS P I ALAN 
FAAHEFSELLAPPADKSAKKKSKTVEETSDESSE 
DESFSLTSSSKQPTLALIDAGASK 
TTMLLI S PIS I WFWSHESGGED I TAWARRTKTT 
AEDAEQS KRNLAS I KEPHEVDDD I 
LEKQEITRARLRKLYEEADKTFRHFDIQATWCVG 
S AHQQHGFLRRVMMK " 
/gene="abf2" 
/locus -tag="RB8073" 
/gene="abf2" 
/ locus - 1 ag= " RB8 0 7 3 " 
/EC-number="3 .2.1.55" 
/ f unc t ion= " sugar , glucos ide , 
polyol and carboxylate catabolism" 
/note="PMID: 20125575 best DB 
hits: BLAST: ddbj : BAB05586 . 1 ; 
(AP 001513) BH1867'^ unknown 
conserved protein; E=le-23 
pir :B59296; 

alpha-L-arabinofuranosidase II 
[validated] -; E=6e-19 
embl:CAB89837.1; (AJ242516) 
hypothetical protein [Salmonella; 
E=le-1 

/codon- start =1 
/transl-table=ll 

/product="alpha-L-arabinofuranosid 
ase II" 

/protein-id="CAD78454 .1" 

/db-xref ="GI : 32445723 " 

/ translat ion= "MSVDGGRRVQLMATLFLTCL 

FGLPPAPKADAQESSPAEKEPFFH 

NPIAKGADPWVIRDPHHPRYLWCKSDNDHGILIF 

ESDDLTKLGKRHLVWKAPDEGPYS 

KEVWAPELHAIGDRYYVYFAASDGDNANHLTYVL 

KSKTSDPLGEYTLHGPMATGDGED 

GQSPNVWAIDMTILQHNDNLFAIWSGWDQPGSDN 

QFL Y I APMS S PTKI SGPRVLVCDN 

DDHLWERIEPDASQRGLNEGPEIFQAKGKTSLLY 

SCGASWLPTYKLGRLELVKDDPLA 

DGAWQKMPEPAFQSTESVYGVGHSCFVQSLDGKQ 

WWHIFHAKVGPEPGWNRALHLQPM 

NVADDGTPLLGSPLDRTTPIQRPSGQALGEER" 

/locus- tag="RB8076" 

/ locus - 1 ag= " RB8 0 7 6 " 

/function=" recombination and DNA 

repair" 

/note="PMID: 11466286 best DB 
hits: BLAST: embl : CAB93380 . 1 ; 
(AL357523) hypothetical protein 
[Streptomyces; E=le-12 
Swissprot:Q52673; YQGF-RHOCA 
HYPOTHETICAL 17.1 KDA PROTEIN 

; E=4e-12 swissprot :Q9ZDJ8; 

Y33 0-RICPR HYPOTHETICAL PROTEIN 

RP330 ; E=2e-ll COG: RP330; 

COG0816 Predicted endonuc lease 

involved in recombination; 

E=2e-12" 

/ codon- s tart =1 

/transl-table=ll 

/product=" conserved hypothetical 

protein-putative an endonuclease" 

/protein- id="CAD78455 . 1" 

/db-xref ="GI : 32445724 " 

/ 1 rans lat ion= " MPTTTTTAEDGFPATGRLAS 

VDYGTVRIGVAICDPDWILASPLE 

VHPVSTPEKDAQYFIDLAKSERIAAWWGLPIHC 



PTRLFDERFTTVAANAKIRQGKTTRKKTKQRVDA 

VAAQVLLESFLEACRYRGELAGHC 

LEDSTSDESLDDA" 

gene 63046. .64737 /locus-tag="RB8077" 

CDS 63046. .64737 /locus-tag="RB8077" 

/codon-start=l 
/transl-table=ll 
/ product = " hypothet ical 
protein- signal peptide and 
transmembrane prediction" 
/protein-id="CAD78456 .1" 
/db-xref="GI: 32445725" 
/translation= "MSQPFVWQNQKPLMRWIAMT 
SRSILFHSCTFALLGLCSAFIHPS 
ATDAADWWASPEFDETLEPWIQMRSEEGLRVA 
lARPADNASDTHQQIWNVGGADTR 
YIVLVGDTPDYDTPRNQPHQIPTWMIPAPVTSRF 
GSTSTLPTDLPYGDRDGDRVSDAA 
IGRLPVRSAEQLASVIGRIEAYENSDDFGLWRRS 
FQLTGGVGGFGAMVDTAIESVTRG 
VITTVLPADAKPQIAYASPNHPFCPPGESFTDAV 
LNRYRTGARFWVYAGHGQIDRLEL 
LQTTAADGKPAAREHWSVESLLNNQNATQLERAP 
EGATIAVLLACFAGAYDAPGDCLA 
ERMVLAEGGPIAVIASSRLSMPYGNACMGLGLLQ 
SVYSSGPQNTGCDRIGDAMLHAAR 
SLQSKQQGKQSTMRVMVDTLASMISPAGTDLKQE 
RLEHAALYQLLGDPTLRLHPPQPL 
DLSIEPSNQTDQQTDGWARSLSVAVTSPIAGTL 
IVAVERPLTAITKTPTDSTSEHDA 
HGTTITEHQIEVSAGKRILQTLTLPLGESGPLII 
RGFVHGKTGWAS AAQRTFLPD " 

qene 64750.. 66261 /gene="arsA" 

/ locus - 1 ag= " RB8 07 8 " 

CDS 64750.. 66261 /gene="arsA" 

/locus -tag="RB8 078" 
/EC-number- "3 .1.6.8" 
/f unct ion= " sulf atase " 
/note="PiyiID: 7910580 PMID: 1572648 
best DB hits: BLAST: 
swissprot :P50428; ARSA-MOUSE 
ARYLSULFATASE A PRECURSOR (ASA) ; 
E=le-56 embl:CAA36398.1; (X52150) 
arylsulphatase a [Homo sapiens] ; 
E=4e-55 gb:AAB03341.1; (U62317) 
arylsulfatase A [Homo sapiens] ; 
E=4e-55 COG: aslA; COG3119 
Arylsulfatase A and related 
enzymes; E=2e-31 PFAM: PF00884; 
Sulfatase; E=1.6e-47" 
/codon- start =1 
/transl-table=ll 
/product= "arylsulfatase A 
[precursor] " 

/protein- id= " CAD784 57 . 1 " 
/db-xref="GI : 32445726" 
/translation= "MRLPHLLALCLPAFFTALPA 
SATAPEDIAGSRPNIWIYMDDMA 
YADIGPFGAKGYSTPNLDRMANEGRKFTDFSVSS 
AVCSASRSALLTGCYHRRVGLSGA 
LGPQAKIGLAPAETTFAEVCKSAGYRTACH6KWH 
LGHHPKFLPTNQGFDQFYGIPYSN 
DMWPLHPDTIRRQQKDPNDPGNWPPLPIIESIAG 
QPPRIVNDNVQPADQEQMTVELTR 
RSVEFIKNQSSDKPFLLYLPHPMVHVPLYVSERF 
RGKSGAGLFGDVMMEVDWSVGEIL 
SAI ES IDQQKNTLVI FTSDNGPWLS YGNHAGSAA 
PLREGKGTQWEGGVREPTLMWWPE 
TIPAGTTCETFCSTIDVLPTIVELTGGEAPERKI 
DGHSIVDLMLDVPGAKSPHESFVG 
YYGGGQLQTIRNERFKLVFPHAYRTLGDREPGKD 
GMPDGYAMTKSGLELYDLDADVSE 
TTNVIEAHPEWKQLQAAAEVYRQQLGDKLQKVK 
GSEIRGPGKLNENEPQLSW" 
gene complement (66230 . .66388 /locus-tag="RB8079" 



complement (66230. .66388 /locus-tag="RB8079" 

/codon- start =1 
/transl-table=ll 
/product=" hypothetical protein" 
/protein-id="CAD78458 .1" 
/db-xref="GI: 32445727" 
/ 1 rans lat ion= " MRSPCPAGRACRNFALLRYG 
LRDVQPVARWSPTEQCINFSTNIT NSVEARFR" 
complement (66385. .67146 /gene="hisA" 

^ / locus - tag= " RB8 0 8 0 " 

complement (66385. .67146 /gene="hisA" 

^ / locus - 1 ag= " RB8 0 8 0 " 

/EC-number= "5.3.1.16" 
/function= "biosynthesis of 
histidine" 

/note="PMID: 2664449 best DB hits: 
BLAST: swissprot :P74561; 
HIS4-SYNY3; E=3e-56 pir:A83003;^ 
phosphoribosylf ormimino-5-aminoimi 
dazole carboxamide; E=5e-49 
qb:AAF05093 .1; AF150930-2 
(AF15093 0) phosphoribosyl ; E=2e-46 
COG: slr0652; COG0106 
Phosphoribosylf ormimino- 5 -amino imi 
dazole; E=3e-57 PA5141; COG0106^ 
Phosphor ibosyl f ormimino - 5 - amino imi 
dazole carboxamide; E=5e-50 
NiyiB0629; COG0106 

Phosphoribosylf ormimino- 5 -aminoimi 
dazole; E=6e-44 PFAM: PF00977; 
Histidine biosynthesis protein; 
E=4.7e-70" 
/codon- start=l 
/ trans 1 - table = 1 1 

/product= "phosphor ibosyl f ormimino- 

5 -amino imidazole carboxamide 

ribotide isomerase" 

/protein-id="CAD78459 .1" 

/db-xref ="GI : 32445728 " 

/ 1 rans la t ion= " MSSQTPGSEPRMEIWPAIDL 

RHGKPVRLRQGDYDQQTTFGDDPV 

EFAKRWQESGAKRLHLVDLDAARGDSPDANRDAV 

QRIIEATGLPCQMGGGVRDEITIE 

SLLNVGVTRLWGSRALKDPDWFVEMCDKYPGKL 

VAGIDARDGKVATQGWLETSDVSA 

FEFATKLRSRTENIAAIVYTDIAKDGMMQGPNFE 

GLAEMAAASDIPLVASGGVTTYDD 

IKQLVEMKMPAAIVGRSLYDGVMELGEWRLAGD 

V" 

67116. .68816 /locus- tag= "RB8082 " 

67116. .68816 /locus- tag="RB8 082 " 

/EC-number="4 .1.99.3" 
/function="DNA repair (direct 
repair, base excision repair and 
nucleotide excision repair) " 
/note="best DB hits: BLAST: 
pir:S75597; hypothetical protein 
slrl34 3 - Synechocystis sp. 
(strain; E=6e-94 gb: AAG19190 . 1; 
(AE005016) Vng0705c [Halobacterium 
sp. NRC-1]; E=2e-91 pir:G82413; 
conserved hypothetical protein 
VCA0809 [imported] -; E=2e-83 COG: 
Slrl343; COG3046 
Deoxyribodipyrimidine 
photolyase- related proteins; 
E=6e-95" 
/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein-putative a 
deoxyribodipyrimidine 



gene 
CDS 



complement (68890 . 



7protein- id= " CAD784 60 . 1 " 
/db-xref="GI: 32445729" 
/translation= "MVRFQVSARTKIAGLGQGRA 
GQGCVSATVNTREDNSRPPFTGPV 
MTKIRHLVLVLGDQLNHDSEAFTDFDPQSDRVWM 
AENDEEATHVWCHQTRLVGFFS PM 
RHFREELIQRGFDVIYHELTGDRRKARESSFASV 
LRKTLSAHSVEKILWAPGDYRVR 
EQLKATAEQEDVPLEFLTDNHFYCSPDQFDDWAS 
GRKSMVMEQFYRTMRKEHS ILLDE 
EGAPEGGQWNFDQDNRKTFGKGGPKEVPPTPSFK 
PDS ITQEVI AMVRDRFEDHPGKVD 
QFDLPVCRDDALNSLDDFIEHRLPLFGTYQDAMW 
EGETFLYHSRLSHS INLHLLSPKE 
WD AAVKAYKDGNAPLNCVEGFVRQ I LGWRE YVR 
GVYWNRMPDYEERNALHCDSEQDV 
PPFFWDGNTDMACVSDAMRLLVDTAYAHHIQRLM 
VLGLFAQLFGTHPLRFHHWHMAMY 
ADAIDWVSLPNALGMSQYGDGGLMATKPYCATGK 
YINRMSNHCKNCRYDPAKSTGEDA 
CPFTTLYWDFLDRHKQQFQNNNRMTLQLKNIERK 
DSSEWTAIRSRARKIRSGTIKLQQ " 
69006 /locus-tag="RB8084" 



complement (68890. .69006 /locus- tag="RB8084 " 



) 



gene 
CDS 



69030 . . 70517 
69030. .70517 



gene 
CDS 



complement (70518 , 



/codon-start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD784 61 . 1 " 
/db-xref = "GI : 32445730" 
/ 1 rans lat ion= " MSRRLAASFSSPHKWSERR 
WS FRPR I EAAS F YRVNPV " 
/ locus - tag= " RB8 0 8 5 " 
/ locus - tag= " RB8 0 8 5 " 
/note="PMID: 11705915 best DB 
hits: BLAST: gb : AAG31169 . 1; 
(AF314 961) unknovm [Salmonella 
typhimurium] ; E=0.70 PFAM: 
PFOlOll; PQQ enzyme repeat; 
E=0.017" 
/codon- start =1 
/transl-table=ll 

/product^" conserved hypothetical 
protein-putative PQQ-repeat 
containing protein" 
/protein-id="CAD78462 . 1" 
/db-xref ="GI : 32445731" 
/translat ion= "MDFILCTAWVIARLCERTEA 
NAKRRTSPMTPAWLLTINMKRI FL 
QSS ILLALGIiAGCTPAASERTFDVPLEEPTRADR 
PSEMIVADYQQSKRRDDLQTQMDW 
PVLFGPHRTGATQAS I PAVWPEQGPALRWDVEVG 
TGYGS PVTAENRWFSHRVDDLEW 
IQCHDAQDGAVRWKHSLETDAECDFEYSDGPYST 
PLIDQERRAVYHFSGSGTLVSLDF 
DTGEEVWRRDLHGEFQVEPELFPSGATPLLHGEH 
LILSLGGANNDAGI I AIHVRDGKT 
IWSCTDHKAAYTTPWAQLFGQEFAFVMTAEGLV 
CLDPANGEVDWEIPHRSRAPMSYN 
SVSPLVWNDHVLMVTGPGPGAVCLQILPDRSYEE 
RWKNRRLLDSQYTNLLLSDGHVFG 
FTSAGQGGAELRCIEFVTGELKWKYHSVLRRSQG 
LIAGDAMFFLGERGHLASLQRSIE 
EPRVLSFTQQPLMKADCYCSPAILKNGIVLKDED 
RIAVFDLPPSP" 
.71285 /locus-tag="RB8089" 



complement (70518. .71285 /locus-tag="RB8089" 

/codon-start=l 
/transl-table=ll 
/product = "hypothetical 
protein- signal peptide and 
transmembrane prediction" 



/db-xref = "GI : 32445732" 

/ 1 rans lat ion= " MHQVMTMKTVEKSKSQVWRP 

FVLFVCCFVCLANVLVSETVHGAV 

VTRTMQFNNSDGFTWGSSDDKPGASQYVIEGMY 

VGISTGKFLYPQQSSDEWWDGQSD 

QSTSLGSQAAGTGPIQFTLALNAPAPGQTFDMES 

I TLLGYS ELSTT I EQTVRVNFTGT 

KIVNGTMSSVTLLSGEFEADELTPLWTFGDEFH 

GLTSLTWSQGGTARAHQFDNITFS 

TVTAVPEPGTFLAMIALATMGFVTRGRRWIGKRL 

QW" 

complement (71254. .73476 /locus -tag="RB8 091" 

complement (71254. ,73476 /locus -tag="RB8 091" 

^ /note="PMID: 20039618 best DB 

hits: BLAST: ddbj : BAAS 6452 . 1 ; 
(AB032964) KIAA1138 protein [Homo 
sapiens] ; E=0.74" 
/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78464 . 1" 
/db-xref ="GI : 32445733 " 
/ 1 rans 1 at ion= " MRWLQTMSDNTSGSGI VLRP 
VGHRSWGTMVSLVLGCQLLCCGG 
CGPSKEELMMRAAQRARPKSSDEKEESAKAQTQP 
VSTAIASEWDAKAGVKETANAAT 
TSAVQKAAKDRDANS PEETIGDWKPVSDRKPANP 
LGEPERHAKAVENLNRLTEALMAY 
KEDKGHYPPSAQAKAGIKTLSWRVLILPYLGHQD 
LFEKFDLNMPWNRSPNKELIEFIP 
DELVS PERFDS KTNWML P AHRNYMFGDNRYPRDR 
NIEDGIENTLMLVEVNDDLAVEWT 
RPVDFEPADLGALKGSLGELRSGGTLVAWANGWP 
SYVANSVSTKQWTNAFTYESGDGQ 
RAGAIHKEPDALLAAHKPSRSETKVEAKVSGRVE 
ASKIVENTEVEAESWGENRLPVPP 
SPDLATSNQRIEQLFGEQLVRAKQNQTQAELSND 
FLKKSLVMSDDAAGAYALQNAAID 
LAIDSGDFMLFQAVLDQHASTFEVDLYQVNRDGL 
LEFSRRNDVDEDTASQMAFVRRAL 
VAIQEGLERNDFEGVGRIASALPRVEEERRGFRR 
ANVRGGKRDASAEKLVRLLQTQLS 
SANKQYEQAAEKVAEYRKNPDDAELASALGRFYC 
FLKGDWAMGLPLVINGTSEKLSRV 
AKRDLEGANDAEDFLAIGDMWWELSEGLPAGIYR 
QGTRDRAGYWYEQSLEVMPESLDR 
LHVQARVKEWQSQDPGSPLATIRTINRQLGLAEN 
ADLEQWARKRTQVNAPGDDYEDG " 
complement (73451. .74641 / locus -tag=:"RB80 93 " 

complement (73451. .74641 /locus-tag="RB8093 " 

/codon- start=l 

/transl -tableau 

/product= "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein-id= "CAD78465. 1" 

/db-xref="GI: 32445734" 

/ 1 rans lat ion= " MMS ITGKFLLVGILLASVLG 

GLSTGLATIVTYAPYGVSKTERAE 

YEKKVSLIKREEALRKEAQSMGLPQAVLPSTRYD 

FGLVDPHTTASHAFEIWNHGAGPL 

TIDVAETTCKCTVGSAQKGTLAPGEKTSVTLTWN 

TGQKSEQYEQAARVITNDPTREVI 

DLTVSGWRTELFVPAKGVFHSGDAGEWESTLL 

lYSQQHDDIAWGAESDLVGFDWE 

SNWPSDSQPSLSEQQPTVINQVKLRCRAQKPGR 

FQGEVKLHLLVNGESDVIEKSVEL 

TGRVHAP I S FHS PMLHSRDGLDLGTLGNDQEHE F 

HLIVRKHFDEERALSVLDVSPKSL 

DVSIEPTSRFGDYRLTIRIPKGIPSTIFNLDQKR 



complement (74638 < 
complement (74638 . 



complement (74851. 
complement (74851, 



complement (76119 . 



74838 /locus-tag="RB8095" 

74838 /locus-tag="RB8095" 

/codon- start =1 
/transl-table=ll 
/product=" hypothetical protein" 
/protein- id="CAD78466 .1" 
/db-xref="GI: 32445735" 
/ t rans lat ion= " MNNPFRKVESLRVPS I QATA 
SFNGCRGRFTQSMPPNEREFRNVC 
GQRWRAFGFHFRWFSFSLKNRV" 
76122 /locus-tag="RB8096" 

76122 /locus-tag="RB8096" 

/note="best DB hits: PFAM: 
PF00114; Pilin (bacterial 
filament) ; E=0.86" 
/codon- start =1 
/transl-table=.ll 
/product= "hypothetical 
protein- signal peptide and 
transmembrane prediction" 
/protein- id= " CAD7 84 67 . 1 " 
/db-xref="GI: 3244573 6" 
/ 1 rans la t i on= " MKSQFRKSGFTLIELLWIA 
I lALLAALLLPAITKAREAARAAQ 
CQANLKNIGIGLFKYSNRAPSGTFCSGASDFRRD 
GCMDEYGWVADLVNVGDGNMNESL 
DPSNPLKGSEKLNDLYGKDTTDAKDAAPLARLSA 
GQCGKADWQGVSGDGSSTTFGGTA 
PETAVRAEL I ARYFLTNGYNTNYAAS WHLVRGMV 
KTEADPSTGELTTWSGGGFKGLGG 
S TGPMS VADLERSRI S S SNVGF I GCAAPGD VDEA 
I LGATLGFDGTGVWGTALNQTEAV 
E Y I ASGSLLTEAFNDGPAYWNATAGNLDL I GSNE 
TLLAQIGCERGEPTTAGCAAPTGP 
GGNGIYMQDTRDWYAVHAGSCNILMGDGHVEVFA 
DSNADGFLNPGFPVTGLTEAEISG 
VGYADDKLEMPRDKFFAGIFINDGYFKGNFE" 
.76334 /locus-tag="RB8098" 



complement (76119. .76334 /locus-tag="RB8098" 



76358 763 

76464. .76715 
76464. .76715 



76684. .76875 
76684. .76875 



/codon- start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78468 .1" 

/db-xref ="GI : 32445737" 

/ trans lat ion="MQSRNGIEAWERVSPPGFVP 

KKKESVTLDKLKLAGTSLSSCESN 

ATKLTTSHFNTRSKPFGFRQLKSQERL " 

/note="cosmid pircos-dld02/ cosmid 

pircos-d3d05 joining point" 

/ locus - tag= " RB 8 1 0 0 " 

/ locus - tag= " RB 8 1 0 0 " 

/codon- start =1 

/ transl-table=ll 

/product= "hypothetical protein" 

/protein-id= "CAD78469 . 1" 

/db-xref ="GI : 32445738 " 

/ trans lat ion= "MGGVLQEADSTNRLHNLHSQ 

TFHNPYSFASHSHSCEGSFLDEFS 

RDKTHKTHRIKTEGPRVFTKTSHTFLRLIHAKLL 

CRFSR" 

/locus - tag= " RB8 1 0 1 " 

/ locus - tag= " RB8 1 0 1 " 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78470 . 1" 

/db-xref ="GI:32445739" 

/trans lat ion= " MQNCSVDSVDSKIAKRAFRW 

SSPERASALVIETKQLPDAWQRRL 



gene 
CDS 



complement (76885 
complement (76885 



gene 
CDS 



77942 . . 78313 
77942 . . 78313 



gene 
CDS 



78329. -79276 
78329. .79276 



gene 
CDS 



complement (79324 , 



.78048 /locus-tag="RB8102" 

.78048 /locus-tag="RB8102" 

/function= "protein secretion 
systems" 

/note="PMID: 1588814 PMID : 1309616 
best DB hits: BLAST: pir:E83258; 
general secretion pathway protein 
G PA3101 [imported]; E=0.063 
gb:AAC83358.1; (AF092918) outer 
membrane secretion protein T; 
E=0.069 gb:AAC79845.1; (AF089753) 
cellulose-binding protein CbpC; 
E=0.24 COG: PA3101; COG2165 
General secretory pathway proteins 
G and H and; E=0.006 PFAM: 
PF02361; Cobalt transport protein; 
E=0.11 PF00114; Pilin (bacterial 
filament) ; E=0.011" 
/ codon- s t ar t = 1 
/transl-table=ll 
/product=" similar to general 
secretion pathway protein G" 
/protein- id= " CAD7 84 71 . 1 " 
/db-xref="GI: 32445740" 
/ 1 r ans la t i on= " MS INRRHFES SLKHPLDPRN 
NAAFTLIEMLWI S I IGVIjAALLL 
PAVSKAREAARAAQCQNNLKQFGVGLTGRTVNAP 
NGEFCSGAFDFRRDGVPTEVGWVA 

dlvrrgvlvsemrctstsaqassaiehllsdnea 

dwatndcvdqlgaeeytnemgtvi 

knvarlieaspadrveyierkmledgyntnyaas 

wflvrsevildesgqpqpkdsact 

ndlkslnvtrgplttrlldtakapsstvpllcda 

s p igqlsgnvgeltsgtp yvtp 1 1 

gepilyketdpnylkepviaastpregvsgwlri 

wnydtrqd yrgms shhggvchvlm 

adgsvrglydangdgfinngfpmdanfwtdaeve 

agdlelas yyslnskgeel " 

/locus-tag="RB8103" 

/ locus - tag= " RB8 1 0 3 " 

/codon- start =1 

/transl- tableau 

/product^ "hypothetical protein" 

/protein-id= "CAD78472 .1" 

/db-xref ="GI : 32445741" 

/translation="iy[IEITTNISINVNAALFRGS 

NGCLRDDSKWRRLMDITMKTVEED 

ETRAQRGILVQLHFSHGLARHQRQKGTHARTFTK 

RAGSSSSQIHRHQNTTPSSAKYST 

GSEPQHLVSANTVGPFPPIKL" 

/ locus - tag= " RB8 1 0 7 " 

/locus - tag= " RB8 1 0 7 " 

/codon- start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78473 .1" 

/db-xref ="GI : 32445742" 

/translation= "MLLRIPPTSRRLALIGLACL 

GTQSVLVPQS IGNAKEQS VLEQAD 

DASGETYSLKHSLQPGQTLRYEVTHVAKTKTRIN 

GSEEIANVHTKSNRAWTVAEADNS 

EMTFDHSIESVAMTQQSGDADEVQWDSTTGEEPP 

KI FS WASQ I GTPLATVT INKQGQ 

EVRREDHAGSKSSLGMGTLALALPDKPVKIGESW 

AVPSEIQARTEDGFVKQIKIRQLY 

TLKKVKAGVATLSVKSETLTPIEEESLRAQVIQQ 

LSNGTLRFDVDNGYLLSKELIWDE 

SWGFQGPGSMMEYRAKMTEELLPEDSSTTVASK 

KKSQR" 

,82143 /locus-tag="RB8108" 



complement (79324. .82143 /locus-tag="RB8108" 



gene complement (82127 . .83044 

CDS complement (8212 7 . .83 044 



/transl~table=ll 
/product = "hypothetical 
protein-putative transmembrane 
protein" 

/protein-id= "CAD78474 .1" 

/db-xref = "GI : 32445743 " 

/ 1 rans 1 at ion= " MATSLDAFAYPAVSRPRSAF 

QHLVWKDFRMLQPLLIiAGVIGTAA 

LFLLAVLRAAIDPASDPSLTTLSVLWVLLPNLVA 

LGIPPISLGTENEERTLDWLRTLP 

VTWWQVLLSKLLIGFAAWVLAVLVASVCFALSWL 

VLRGGWDWSAIGRISGSESIFELM 

VYGVGLLICGWIAALIVRNPVGSVMLVIPLIVGL 

TLLWNYSVAGLVSTHFHYGVTFLE 

ATAAQRVSMLCLATVYLiAALGGWAGLARHRLAG 

ASDHRWQAPAVASTSAWQPSWQV 

AWHAKPRPFRALLWQQVRQTRLVIGAAFS FALFA 

LLS I WTDGNHGFHDLLLVLGWPAA 

HLMLGVAAFHGDRRGMRQGFFADRGWPPNLIWT 

RLIPTASASLFLILVFPFVSGLDL 

YLGMHYGTGTLWQAFGFSWVQTLLFLIGVLTGQ 

WGRRPSLAFFGAPWALLWMLPAF 

GLLAFYGGYTGLLWLAAGVLLFASRRLTPFVMEG 

VSSAAVWGRGLAYCLLALGVMCLC 

LFIHRWWTTPSEMPQWRKQMLAIVQEPITPETLE 

STVGVLPGTRNIHLNADVYSLEDY 

EMDARKSLLLSEFPTVNLVEASAFQYTVRLHPSR 

SWAGQRMNYAFENEAARYRDVELR 

LDAELKSSEVMGEHVSQSELYGLCVDTPIAAKGY 

ASVPNWNGLPYGVSPYQHFQWHPF 

RRKALEVAAKWSKLSREQSFVFGALGELLNNAES 

NEWIVINGLRQIAYEQNLFSDEAL 

PQSEEDQLDREAFIRWDQLVNKDLRRESRQRSL 

lAEWQRFQTGEWSGWFDDELGNEK 

SFLDAPLSEPFYRLEFEQTRFERILDVAVRNTLD 

QLEDGLPEWDDPELTKQRLLWTKL 

LKDRRSGVGHQEVASPPRAPVFYREQDLLIDELR 

QMADRLRETQQSNVGQVS PGTRAD 

ALSQSEVSHNLILEFERVPSNHPAR " 

/ locus - t ag= " RB8 113 " 

/ locus - tag= " RB8 113 " 

/f unction="ABC transporters" 
/note="PMID: 1924314 PMID: 8088782 
best DB hits: BLAST: pir:T36523; 
probable ABC- type transport system 
ATP-binding protein -; E=4e-37 
pir:C723 03; ABC transporter, 
ATP-binding protein - Thermotoga; 
E=6e-34 gb:AAF81232 . 1; (AF263012) 
ABC transporter ATP binding 
protein; E=9e-33 COG: TM1028; 
C0G1131 ABC- type multidrug 
transport system, ATPase; E=5e-35 
PFAM: PF00005; ABC transporter; 
E=5e-43" 
/codon-start=l 
/transl-table=ll 
/product = "probable ABC- type 
transport system ATP-binaing 
protein" 

/prot ein- id= " CAD7 84 75 . 1 " 

/db-xref ="GI : 32445744 " 

/ 1 rans lat ion= " MSKRFRNTVALDNVSLHVPA 

GTVFALLGENGAGKTTLIRILTGF 

QKPDAGMASILGHDCGQNSEEIRRKIGYVSDSPA 

LYEWMTPPEIGWFVSAFYDDGFIA 

RYRDL I EE FQVPPATKI KSMS KGQRAKVALALAT 

AHDPELLILDEPTSGLDPMVRRQF 

LESMVDRAAAGRTVLLS SHHINEVERVADMVAMI 

HGGQVKLVQSMNDLKQHTRIVTAT 

MDDAHVESPDLPGRVLSETSNGRQRRWVITDWND 

AGDLHWGPNDGVRQFHVSVPTLEE 

VFIAVCGQPIRDEPSSFDRRQEVNDGHFA" 



^ / locus - tag= " RB8 116 " 

CDS complement (83172. .83669 /gene="gntR" 

^ /locus- tag="RB8116" 

/function=" transcriptional 
control" 

/note="best DB hits: BLAST: 
pir:G69999; transcription 
regulator GntR related protein 
ytrA -; E=4e-13 pir:D72336; 
transcription regulator, GntR 
family - Thennotoga; E=9e-13 
gb:AAC62421. 1; (AF084104) 
hypothetical protein [Bacillus 
firmus] ; E=le-08 COG: BS-ytrA; 
COG1725 Predicted transcriptional 
regulators; E=4e-14 BH1940; 
COG1167 Transcriptional regulators 
containing a; E=:3e-06 BH1164; 
COG1725 Predicted transcriptional 
regulators; E=5e-06 PFAM: PF00392; 
Bacterial regulatory proteins, 
gntR; E=0.0001" 
/codon- start=l 
/transl-table=ll 

/product= "probable transcription 
regulator GntR related protein 
ytrA" 

/protein- id= " CAD784 76 . 1 " 
/db-xref ="GI : 32445745" 
/ 1 rans 1 a t ion= " MHRLVSRTPSQETPMFFS ID 
AASD VP I YEQ I VRQVKLAVADGTL 
VGGQMLPSVRQLSRDIALNPNTIARAYRELQAQE 
I LKPLRGRGMWRRDAI EACTSAR 
DDLVGDGVRRALADAIAGGMSPDQLRSLFESELA 
RLSAAATKSNSDESNLESASSSNN EPSHE" 
gene 83626. .85503 /locus- tag="RB8 117 " 

CDS 83626. .85503 /locus-tag="RB8117" 

/function= "biogenesis of outer 
membrane " 

/note="best DB hits: BLAST: 
ddbj :BAB06996 . 1; (AP001518) spore 
cortex protein [Bacillus; E=0.022 
COG: BH3277; C0G2244 Membrane 
protein involved in the export of 
0-antigen; E=0.002 PFAM: PF01943; 
Polysaccharide biosynthesis pro; 
E=0.05" 

/codon- start =1 
/transl-table=ll 

/product^" similar to spore cortex 

protein-putative membrane protein 

involved in the export of 

0-antigen and teichoic acid" 

/protein- id= " CAD784 77 . 1 " 

/db-xref="GI: 32445746" 

/translation= "MGVSWEGVRDTRRCTEPVCY 

HDSTPQCPVNVLQRKIRTTVPNGP 

SSHMISRSFPANDKVNKVTPISEPTFANAATQHV 

ASRREFVRVARNDPTDSRSQLPMS 

TADSATQSTTPRSAFRVDSLAIGMIVMLAMTVLG 

RGIGFIRGMAFCRLMDDTDVGRWS 

MAFGFITLITPVMLLGIPGVLPRFTEHFRLKRSL 

TPFVRRIAIGTLSCTVIFVTTMLW 

LPDWFGWIVFLQPQDNRLIYGVAGAWGMIVYNF 

ISDLNGSLRQVRMVSCMQFMQGVG 

FTLLSVAWLLTGGTFTGWWMFAASCLVASLPGL 

WSLVRSWDSAQILQDSEEESNSPT 

ADKTLPSDDVPFGLWNMIRRLAPYATALWLMNLI 

GNLFELSDRYMILHFIPATETLSA 

EIAGQAAVGQYHSGRIIPMLLLSLGTMIGGVMLP 

YLSADWEAKRFAAVQTRLRDALLA 

VS I VFTCGSAMAILLGPWI FNVLLQGRYTDGMTL 

MPMALCFCTWAALVTVGQNYLWTV 



gene 
CDS 



85427. .85855 
85427. .85855 



gene 
CDS 



85827. .86360 
85827. .86360 



gene 
CDS 



86357. .87190 
86357. .87190 



lATLCSHGWMSGVWLAMIGCGYK 

VDMTLWLSLLPVSLLVSPYVALLLAIAIGLMVW 

NDDATRQRWLESLPENVRAKLPLL G" 

/ locus - tag= " RB8 1 1 9 " 

/locus-tag="RB8119" 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id= "CAD7847B .1" 

/db-xref ="GI : 32445747" 

/translation= "MMMRLVNDGWKACQKTSERS 

CPCWADAQVPYDGIPIPSKWKPKA 

SARLSNVHDGPIRGDLLFFAKDSGLPPRAMEDGP 

IRGDADESRHSTRTFCVTIRTCIP 

ISTIHNWLGIADHRTRGGRSSANVNKPSFAGLDD 

RPKNGV" 

/qene="fabZ" 

/locus-tag="RB8123" 

/gene="fabZ " 

/ locus - 1 ag- " RB 8 1 2 3 " 

/EC-number="4 .2.1.-" 

/function=" fatty acid 

biosynthesis" 

/note="PiyiID: 7806516 best DB hits: 
BLAST: gb : AAF98278 . 1 ; AF197933-8 
(AF197933 ) beta-hydroxyacyl-ACP; 
E=2e-14 gb:AAK04875.1; AE006311-6 
(AE006311); E=3e-14 pir:C82731; 
(3r) -hydroxymyristoyl ACP 
dehydrase XF1044 [imported] -; 
E=6e-14 COG: XF1044; COG0764 
3 -hydroxymyristoyl/ 3 -hydroxydecano 
yl- (acyl carrier; E=6e-15 PFAM: 
PF013 77; Thioester dehydrase; 
E=2.1e-18 PF01575; MaoC like 
domain; E=0 . 0056" 
/codon- start =1 
/transl-table=ll 
/product = "probable 
beta-hydroxyacyl -ACP dehydratase " 
/protein- id= " CAD7 84 7 9.1" 
/db-xref = "GI : 32445748 " 
/ 1 rans la t i on= " MT I APKMAFDTGMNTPNDST 
ETRQLGPKD I EAK I PHRAPMLLLD 
EWEITENTLHARKTFSADEFFVQGHFPDYPLVP 
GVIQCECCLQAGAILLSEHTPAVG 
EFVPVATRMDSVKFKNMVRPGDTVDIHVTLKEQL 
SNAFFLSGKMLLNGKVTTRLEFAC 
SITQTQPTGSSDSDATS " 
/gene="fabl" 
/ locus - tag= " RB8 12 5 " 
/qene="fabl" 
/ locus - tag= " RB8 12 5 " 
/EC-number= "1.3.1.9" 
/function^" fatty acid 
biosynthesis " 

/note="PMID: 20138198 best DB 
hits: BLAST: gb : AAF05840 . 1 ; 
AF197058-1 {AF197058) 
trans-2-enoyl-ACP reductase; 
E=6e-31 ddb] :BAB0 6562 . 1; 
(AP001516) enoyl- [acyl-carrier 
protein]; E=5e-29 gb: AAD04184 . 1; 
{L10036) unknown [Nostoc sp. PCC 
7120]; E=2e-28 COG: BH2843; 
COG0623 

Enoyl- [acyl-carrier-protein] 
reductase (NADH) ; E=5e-30 TM0441; 
COG1028 Dehydrogenases with 
different specificities (related; 
E=le-09 BS-yfhR; COG1028 
Dehydrogenases with different 
specificities; E=le-06 PFAM: 
PF00678; Short chain 
dehydrogenase /redu; E=7 . le-05" 



/transl-table=ll 

/product = " trans - 2 - enoyl - ACP 

reductase" 

/protein- id="CAD784 80 .1" 
/db-xref= "01:32445749" 
/ 1 rans lat ion= " MSTEPTPAMDFLGMSGKTFL 
VMGVANKKS VAFAI AKQ I EQAGGE 
VIYAVRSEARRDSLSKLLADRRLIVCDVEQQEDI 
EAMAAELQRDNVTLAGWHS I AFA 
DYSDGIRPFHETTRRQFLQAID I S AFSLVAVCNA 
IKDQLANDASWTIGISTTRMASE 
SYGFMAPIKAALESSLAFLTKSFSRFSQVRFNAV 
AAGLLKTSASAGIPGYVDSYLYAE 
KVIPRGEAVSTNEVASTAAFLLSSRSSGITAQSI 
WDAGMS INYFDADWGAVTNASV D " 
/gene="ilvE" 
/locus - tag= "RB8 12 6 " 
/gene="ilvE" 
/locus - tag= " RB8 12 6 " 
/EC-number= "2.6.1.42" 
/note="PMID: 85289113 best DB 
hits: BLAST: swissprot :Q58414 ; 
ILVE-METJA PUTATIVE BRANCHED -CHAIN 
AMINO ACID; E=4e-69 gb : AAB90305 . 1 ; 
(AE001039) branched- chain amino 
acid; E=4e-65 pir:F69057; 
branched- chain amino-acid 
aminotransferase -; E=2e-46 COG: 
MJIOOS; COG0115 Branched- chain 
amino acid; E=4e-70 PFAM: PF01063; 
Aminotransferase class IV; 
E=:1.9e-91" 
/codon- start=l 
/transl-table=ll 

/product = "putative branched- chain 

amino acid aminotransferase" 

/protein- id= " CAD7 8 481.1" 

/db-xref="GI: 32445750" 

/ 1 rans lat ion= " MTQAI YINGEYFSREDAKI S 

VYDHGLLYGDGVFEGMRIYSGKVF 

ALEDHMTRLYESARAIMLD I PIAIDALTTAVNET 

VAKNGLTEGYIRLWTRGGNQLGL 

NPFSCEDPQVIIIADTISLYPEKFYTEGLELITA 

STIRNHPAALSPRVKSLNYLNNIM 

AKI EAIRAGC I EAVMLNTKGEVAECTGDNI FI VR 

GGRLITPPIDAGILEGITRNTVID 

LARENGIEVAEEAMTRHDI FVADECFLTGSAAEV 

I PAVKLDGRVIGDGKPGPMTQKLN 

AAFRAFVAR" 

/locus - tag= " RB8 12 7 " 

/ locus -tag="RB8 127" 

/codon- start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78482 .1" 

/db-xref= "01:32445751" 

/ 1 rans lat ion= " MASSATFS VRTLGHCDGNTR 

FVLRTREFAKEPPNYLAQTIGGWD 

NQGSGIRYRYNMQIHRPAQALSVROCERCTPQHL 

RLPNTRILIRAAHDRKAAAETGNR 

FASSKSSNPFEPPSSRDVGQYFHLIRTTTDGRGL 

ASSRKWGTPTONETLHGRGLHGTC 

TRNAGPTNLFDDLQSPSSSOPKQARDRTPDQAVS 

lAVSVSRPSKNPITPSRRERRSPA 

GKVAEVAKTFORPPTPKLLASSATTASSATAQSL 

TLLKTRRGDA" 

/ locus - 1 ag= " RB8 13 2 " 

/locus - 1 ag= " RB8 13 2 " 

/ codon- start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78483 . 1" 

/db-xref="OI: 32445752" 

/ 1 rans lat ion= " MLEVAGVLTEI PFADPRLTF 

RIDASENVTGGRNVLRQTHVDVID 



gene 
CDS 



89477. .90796 
89477. .90796 



gene 
CDS 



GFGLHPSLQIGPYVSDTLMQSHRS 
HLRVRGGI I PLGMI GRP I S S P I VDESRHTCDNRQ 
RYHPSDCRSCHCSSRLFSRAVIPL 
LTLNCHVDHGSICRSRS " 
/locus-tag="RB8134" 
/locus- tag="RB8 134" 
/function="DNA synthesis and 
replication" 
/note="PMID: 93188700 best DB 
hits: BLAST: embl : CAA19102 . 1 ; 
(AL023591) hypothetical protein 
MLCB1259,27; E=le-95 
swissprot:Q5073 9; YP59-MYCTU 
HYPOTHETICAL 47.5 KDA PROTEIN 
RV2559C; E=2e-91 embl : CAB93 386 . 1 ; 
(AL3 57523) conserved ATPGTP 
binding protein; E=2e-90 COG: 
Rv2559c; COG2256 Uncharacterized 
ATPase related to the helicase; 
E=2e-92 ycaJ; COG2256 
Uncharacterized ATPase related to 
the helicase subunit; E=4e-86 
HI1590; COG2256 Uncharacterized 
ATPase related to the helicase; 
E=le-85 PFAM: PF01057; Parvovirus 
non- structural pro; E=0.099 
PF00910; RNA heiicase; E=0,0011 
PF00004; ATPase family associated 
with; E=7.6e-20" 
/codon-start=l 
/transl-table=ll 

/product^" conserved hypothetical 
protein-putative a helicase" 
/protein-id="CAD78484 .1" 
/db-xref="GI :32445753" 
/ 1 rans 1 a t i on= " MDLFADQEADHLFAAQPLAA 
RMRPKKLSEFVGQQHILGEGKLLR 
RLIASGRVGSILLHGPPGTGKTTLAHLIASEQNS 
ELITLNAISSGVKDVREVLAKARD 
RVSAGDPRPLLFIDEIHRFNKSQQDALLADVESG 
I I S L I GATTSNP YFAVNAAL I SRS 
QLFGLEPVSVEDMRSLLKRAITDRECGLGNQNVT 
IDEDAIDYLSSASDGDARKALTAL 
EVAVHSHENPKASITRDDVAESMTSRIAGYDATG 
DDHYDLASALIKSIRGSDVDASLY 
WLARMLEGGEDIRFLCRRLVILASEDIGNADPQA 
LI lAVSAMQACEMIGLPEAQLTLS 
QTVAYLSLAPKSNATTSAI S AARRDVRDRQVI PV 
PKMLRCGHYTGAEELGHGDGYKSA 
HNTEEGVAKLDYLGVDRRYYKPVERGFESELASR 
LQKIRDQLGRETE " 
complement (90882. .91109 /locus-tag="RB8137" 

complement (90882. .91109 /locus-tag="RB8137" 



gene 
CDS 



90960. .91187 
90960. .91187 



gene 



complement (91189. .92931 



/ codon - s t art = 1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id= " CAD784 8 5.1" 

/db-xref = "GI : 32445754 " 

/translation="iyiQFIELTGKSLLDVINEGEI 

DMGQLHEAGVNGDS ILRINKFGEI 

ELRDRHEWVLVGGLLGNFEDRLRRITQLDWL " 

/ locus - tag= " RB8 13 8 " 

/locus-tag="RB8138" 

/ codon - s t ar t = 1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78486 .1" 

/db-xref ="GI : 32445755" 

/translation=:"MTIPQLDFAELVDSQNRVTI 

DAS FMELAHVDFAFVDNVQQRFAG 

EFDELHGRNLADGFKDESFLIAPRRSSVQPA" 

/ locus - tag= " RB8 14 0 " 



gene 
CDS 



complement (92961, 

complement (92961, 
) 



gene 
CDS 



complement (93191 . 
) 

complement (93191 . 



gene 
CDS 



complement (93667 , 
complement (93667 , 



/note="best DB hits: BLAST: 
pir:T47011; hypothetical protein 

I imported] - Yersinia pest is; 
E=0.020 embl:CAB583 05.1; 

(AL121854) hypothetical protein 
SCJ33.06C; E=0 . 11 pir : E69891 ; 
cytochrome c biogenesis protein 
CycX homolog homolog; E=0.28" 
/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78487.1" 
/db-xref ="GI : 32445756" 
/ 1 rans lat ion=: " MIQAQI AWMGTLPKTIGAC 
VFAILLTQIVGGCTRQPSAESTDP 
ASGAPSLETNDGQSEETDQTSDDDLLLVGDSSAA 
STPESPATSNNTNGEVAAQPWDD 
PAVIDVPQSVASQLDGRRKQLANDLKPEQLRTFL 
SEADIEMRMITSGQSGIEDSATAF 
SELKRIATWKREASRRLIEHADATEKEKAIGRRG 
ELQSLSHLASLGDLKAAESLQLLA 
ESLQEDSDPEVRSDSQLVLIGFAIEDLRNGKSDA 
PSRWSQIDRLLQASSSPDAATLM 
VMGQAKDALLQFEHVEEASRVRAMILEEFVESGE 
AIAGESDAMGGLVDMARQIAGPSF 
QISEATARVQNLMETFITEANENTDPSDSAVS IN 
DWKAAIESLADEQPDLLTTQFLAG 
ASLEAETVGRLDLSEATYQVLDDKFASMQDDRGS 
EARTALQARDNREKI IGKTFDPDL 
PSTKGDELSMDDYRGKWLMPFWSAAFADSLLW 
DNLQKIARQYPEKVAIVGMNLDVQ 
S TDVPAFESRNKI S FPS FRSVSD PEAS VANS VAY 
RFGWSLLFVAVIDQEGKVAAIEF 
SGRDLTP WENLLR " 
93179 /locus-tag="RB8143" 

93179 /locus-tag="RB8143" 

/codon- start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein-id= "CAD78488 . 1" 
/db-xref="GI: 32445757" 
/ 1 rans lat ion= " MQVRQVGVCTEPVLRITRQS 
VPDS IGGVYFERLPSRINGRDRKC 
LCQISECRSSNPTRIERTVFELARWPVE" 
,93622 / locus -tag="RB8 144" 

.93622 /locus-tag="RB8144" 

/ codon - s t ar t = 1 
/transl- table=ll 
/product=" hypothetical protein" 
/protein- id= " CAD784 8 9.1" 
/db-xref="GI: 32445758" 
/translation= "MALFPSDFAAKRMVCSGAIE 
TVGTFFLVFGVLAPWLHYHQSRRQ 
CLGWRHPNRYRRQRGTEPYESPVSGRIAMLTETM 
PSLENVSMISTASQNETAPSDAEI 
TRRVMKIRSSWSVAERLRRRREADDRFADLLDAL 
SAEHTAA" 
.94122 /locus-tag="RB814 6" 

.94122 /locus-tag="RB8146" 

/ codon - s t art = 1 

/transl -table=ll 

/product= "hypothetical protein" 

/protein- id= " CAD784 90.1" 

/db-xref="GI: 32445759" 

/translation= "MLANTKCRVLDPFPTQLAIF 

YPMTAAFTFQTYVYGPDRGAFDVS 

METVAGRFEALEGLFFEWDGSFTWANQSQGWQID 



gene 
CDS 



94339. .9455: 
94339. .9455: 



(5--, 



tRNA 

gene 
CDS 



94716. .94788 



complement (95002, 



) 



RQLLMERLRTLFSAFGEPASVSLMRIPDRTWQDL 
QGFEKDVCSTNASDR " 
/gene="csrA" 
/ locus - tag= " RB 8 14 9 " 
/gene="csrA" 
/ locus - tag= " RB8 14 9 " 
/function=:"mRNA processing 
3 ' -end processing, mRNA 
degradation) " 

/note="PMID: 8393005 PMID: 7751274 
PMID: 9211896 best DB hits: BLAST: 
ddbj :BAB07336.1; (AP001519) carbon 
storage regulator [Bacillus; 
E=2e-07 swissprot :083663 ; 
CSRA-TREPA CARBON STORAGE 
REGULATOR HOMOLOG; E=5e-07 
swissprot :P44879; CSRA-HAEIN 
CARBON STORAGE REGULATOR HOMOLOG; 
E=6e-06 COG: BH3617; COG1551 
Carbon storage regulator (could 
also regulate; E=2e-08 PFAM: 
PF02599; Carbon storage regulator; 
E=8.9e-27" 
/codon- start =1 
/ transl-table=ll 

/product= "probable carbon storage 
regulator" 

/protein- id= " CAD784 9 1.1" 

/db-xref="GI: 32445760" 

/ 1 rans lat ion= " MLVLSRKKNES I VINNDIKI 

VWEIRGDKVRLGVEAPREVPVHR 

REVYDAIQRNNEAFDANAPADSNDVS " 

/product = " tRNA-Glu" 

/note= " Glu , tRNAscan- SE- score 

56.70%" 

/evidence=not- experimental 
96339 /locus-tag="RB8150" 



complement (95002. .96339 / locus -tag="RB8 150" 



gene 
CDS 



96496. .9687 
96496. .9687 



/codon- start =1 

/transl- table=ll 

/product= "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein- id= " CAD7 84 92 . 1 " 

/db-xref ="GI : 32445761" 

/translation= "MVSLHSNGAAPSFHERCVFA 

VKSRVISGGLFCLWWMASSASAQ 

PDFERPPIDYLNAEVNDPVAKLAKRIEAGEVALN 

SDPKFGYLPSVLEALDVPLSSQTL 

VFS KTSLQLHR I S PRRPRAL YFNDDVYVGYCQHG 

DVLEFASTDAKQGAIFYTLSQSEE 

KEPEFVRDRGGCLSCHASSRTQNVPGFLIRSVFA 

DAAGRPKLGSGTFTTDHTSPFDER 

WGGWYVTGSHGSMRHMGNVI CTDEAQELDRESGA 

NQDDLGEYFRTDSHLTPHSD I VAL 

MVLEHQTQMHNAITAANFETRQALHQSYQMNELL 

EREPDFISESATRRIESSADRVLK 

YLLMCDEFTLTDSVAGTSTFAKEFASMGPRDPEQ 

RSLRDFDLETRLFRYPCSYLIYSD 

SFMELPAEVKSRVLEKLKHILSGEDQSEAYQHLT 

GKMRSEILEILRATHPDFQ " 

/ locus - tag= " RB8 152 " 

/ locus - tag= " RB8 1 52 " 

/function= "transcriptional 

control" 

/note="best DB hits: BLAST: 
pir:C75577; conserved hypothetical 
protein - Deinococcus radiodurans; 
E=le-13 pir:E70324; conserved 
hypothetical protein aq-268 - 
Aquifex aeolicus; E=3e-07 
swissprot :P44675; YFHP-HAEIN 
HYPOTHETICAL PROTEIN HI0379 ; 



Predicted transcriptional 
regulator; E=le-14 PFAM: PF02082; 
Uncharacterized protein family 
UPFOO; E=6.3e-10" 
/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein-putative a transcriptional 
regulator" 

/protein-id="CAD78493 .1" 
/db-xref= "01:32445762" 
/translat ion= "MGSQVGKPVPADHLAEQTKV 
PRRYLTRVLQDLGAAGLVRSRPGP 
GGGYELCVPTDTLTILDWNTVAPVERIRKCPLG 
LKSHTDLCPLHTELDRVYAATEEA 
FRGVTIDDLVNSTNPILPLCES " 
complement (96932. .98392 /locus-tag="RB8153 " 

complement (96932. .98392 /locus-tag="RB8153 " 

/note="PMID: 8168477 PMID: 
10360571 best DB hits: BLAST: 
swissprot :P56727; Y929-THEMA 

HYPOTHETICAL PROTEIN Tiyi0929 ; 

E=le-'09 embl:CAA57668.1; (X82178) 
orfl [Thermotoga maritima] ; E=0.34 
gb:AAA65436.1; (U24145) orfl; 
upstream of ferrodoxin; Method: ; 
E=0.34" 

/ codon - s t ar t = 1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78494 .1" 

/db-xref = "GI : 32445763 " 

/ 1 rans la t ion= " MSDSAASGKTDSKRDAKARE 

RITRWESWYLVDPLLFAAWTLHE 

WAKPE I ATLRVAHGRVEFNPAFVSSLKRDELGE 

VLRFEAMRILLGHPYARRQPKTEL 

SYSASNLTVQECLRSKLPMPRPREVLGSETHDHQ 

YFEYYYRQLAEREGDQPDPSEDST 

PEPDQSDDSKPGEGTSHPEETLGDDSSDAGDSGN 

DAVAEQRDETGDESSVEEDDGSSA 

ESNEPTPDSPSLEQAADSSVGNDLESYVDPSQTG 

AENADQWDADDLLHEE I KS AVADA 

AENGGWGTLPGHAQERLLATLRPPLDYRS ILRQF 

RQSVLSVDRRLTRMRPSRRYGFAQ 

MGSRYDFTTRLLFAVDVSGSMSHRDLQNGFS I IN 

RFFQYGIRSIDWWFDTQIRCEPL 

TFRSARRDVSITGRGGTDLGCVTEFIDEHRGYDG 

VWFTDGDSPHPKRPANRQTRILW 

LFNTQAAFRKSASDLKSLGHVAFLKPSRVGSPVH 

AS " 

complement (98389. .99483 /locus-tag="RB8157" 

complement (98389. .99483 / locus -tag="RB8 157" 

/note="PiyiID: 99287316 best DB 
hits: BLAST: pir:B72315; conserved 
hypothetical protein - Thermotoga 
maritima; E=le-25 pir:T35221; 
probable ATPGTP binding protein - 
Streptomyces; E=5e-15 pir:A75429; 
conserved hypothetical protein - 
Deinococcus radiodurans; E=le-07 
COG: TM0930; COG0714 MoxR-like 
ATPases; E=le-26" 
/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAD7 84 95.1" 
/db-xref="GI : 32445764" 
/ 1 rans lat ion= " MAVRVDVHEL IQLLEITPPD 
QNIMLVGRHGIGKSQI lAEHFSSR 



gene 
CDS 



complement (99522 . 
3) 

complement (99522 . 
3) 



gene 



CDS 



complement (100220 , 
90) 



complement (100220 , 
90) 



gene 
CDS 



complement (100987 , 
85) 

complement (100987, 
85) 



PPFWWPSDEEPIVLFLDELNRARP 
EILQSVMELALNKTLAGKRLPQGSVIVSAVNEGD 
EYQLTDLDPALVSRFNLYQFAPTV 
DDWLLWADQHKVDDRVTGFVQQQPQYLDGEGISP 
SAEDANSMSGLVKTPDRRGWARVS 
SMLQGVETIQPVHVKLIAGWGSPAAMAFRRSLA 
SRLQVSPDQVLLQFTKHRKAIEKM 
ALGELLMLNEQLLVHLVGDPYDDADRTKALKGML 
SYLKLLRKRKLDEAVAHLVSMTDQ 
PKYEPAMEFFAESMELTTLLTEYVDGIRIQ " 
10022 /locus-tag="RB8158" 

10022 /locus-tag="RB8158" 

/codon- start =1 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD7 84 96 . 1 " 
/db-xref ="GI : 32445765" 
/ 1 rans lat ion= " MKLPLPSVRKLFREHFSGES 
PIEIDPGQRAALIKQWRSKLEKIE 
GVRLTNHPGDRDSRSPTWVRFTIPDPNASQTYYR 
SDELRLERNEQGELECVFGKFDRE 
I AGP I SDFGEVESLVAEVLRRYVKRHAGEAKRAK 
VRSMKSKAIVARVKALAKEEQFDF 
ATSMDTQKLRLFVKLSSQHMIEIHIPFTRFEKVL 
PQLQETIRTLRSLYEDGLRFKMIG 
FMQADWRTEWIRYEE " 
.1009 /gene="exoA" 

/ locus - tag= " RB8 1 6 1 " 
/note= " synonym : apeX" 
.1009 /gene="exoA" 

/ locus - tag= " RB8 1 6 1 " 
/EC-number="3 .1.11.2" 
/function="DNA repair (direct 
repair, base excision repair and 
nucleotide excision repair) " 
/note="PMID: 1708495 best DB hits: 
BLAST : swissprot : P37454 ; 
EXOA-BACSU EXODEOXYRIBONUCLEASE 

pir:; £=76-78 pir:F64710; 

exodeoxyribonuclease - 

Helicobacter pylori (strain; 

E=3e-72 pir:C71809; 

exodeoxyribonuclease - 

Helicobacter pylori (strain J99) ; 

E=2e-71 COG: BS-exoA; COG0708 

Exonuclease III; E=7e-79 PFAM: 

PF01181; Deoxyribonuclease I 

(DNase I); E=0.25 PF01260; AP 

endonuclease family 1; E=2e-108" 

/ codon - start = 1 

/transl-table=ll 

/product= " exodeoxyribonuclease " 

/protein- id= " CAD7 8 497.1" 

/db-xref="GI: 32445766" 

/translation= "MIETLKLISWNVNGIRASMD 

KGFREFVESEQPDVLCLQETKAEP 

EQVDLSWADELGYHQVWNTATKRGYSGVSTWSRV 

EPLKVTKGLSIEEHDNEGRVLTTT 

FDDFHLVNVYTPNSQRGLARLDYRMQWDEAFLDY 

VKKLNRRKPVLFCGDVNCAHKEID 

LANPKANRKNAGFSDQERAGLDAVTEAGFIDSFR 

QFHDGPGHYSWWTYRSDARARNIG 

WRLDYFWVAKNFWDRVADARIRCEIHGSDHCPVE 

LTLT" 

.1016 /locus-tag="RB8163" 

..1016 /locus-tag="RB8163" 

/note="best DB hits: BLAST: 
pir:F754 96; conserved hypothetical 
protein - Deinococcus radiodurans; 



Vng0533h [Halobacterium sp. 
NRC-1] ; E=5e-34 embl : CAB93 73 1 . 1 ; 
(AL357613) hypothetical protein 
SC5F8.03C.; E=9e-30 COG: DR0625; 
COG2013 Uncharacterized ACR; 
E=le-64" 
/codon- start=l 
/ 1 rans 1 - 1 able= 1 1 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78498 .1" 

/db-xref ="GI : 32445767" 

/ translat ion= "MSERRYSLDRFLEKTRDKDL 

NQGLFELESDRMLDINLNGEVWTK 

MGSMIAYTGNVKFEREGILSQGLGNLLKKAVSGE 

GTALTKVSGKGSVFCADSGKKITI 

LELQNEAICVNGNDLLAFEMSLNYNIKMMKKIVITA 

MLAGGLFNIRLEGTGMVAITSHYD 

PITLPVTPHQPWTDPNATVLWSGQLEPQLKTDV 

QFKTI IGRGSGESLQFLFQGHGFV 

WQPYEEWFQNQS " 

complement (101702 . .1018 /locus-tag="RB8165" 

63) 

complement (101702 . .1018 /locus-tag="RB8165" 
63) 

/codon- s tart =1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78499 . 1" 

/db-xref ="GI : 32445768 " 

/ t rans lat ion- " MILRSQQPARTDETRRGTTG 

SDGIATLRLVWHPPDAFNGWGVQ 

SDWLFLSP" 

101882. .104650 /locus- tag= "RB8167 " 

101882 . . 104650 /locus- tag= "RB8167 " 

/note="best DB hits: BLAST: 
pir:G71299; conserved hypothetical 
protein TP0648 - syphilis; E=0.027 
pir:S49376; hypothetical protein 3 

- Pseudomonas aeruginosa ; 

E=0.11 pir:F69210; conserved 
hypothetical protein MTH83 - 
Methanobacterium; E=0.20 COG: 
TP0648; COG0457 

TPR- repeat -containing proteins; 

E=0.003 PFAM: PF00515; TPR Domain; 

E=0.0036" 

/ codon - s t ar t = 1 

/transl-table=ll 

/product=" conserved hypothetical 

protein" 

/protein- id="CAD78500 . 1" 

/db-xref ="GI : 32445769" 

/ 1 rans lat ion= " MRSLPPAFPTTHQSMRPVLV 

NASRFLVDNQYDDQLLEVIDLGHR 

SRTLHGQFGQTMTTTANCLLWCVLLVSLLTPAP 

VFAASYDDAVQQFCNGQYDAAATT 

AASEVERGVWSERWPRLLIQCQLTQGDYADALQT 

YREALQRYPTS I ALRYMGLDVLRF 

NGLHDEVGTAEADLFAQMQRAFAGYITRDNLIAA 

GRFLTGRGEDARKVLEMFYDRVRD 

RDPDYLDAYLATAELAIRKGDFQVAANTLQQALK 

LEEETPDLHHLLAKAMESSDGQAA 

AEQIAIALRINPQHLPSLQWLAERAIDRERYDEA 

KDVANQMLQINPHEPSAWALMAVI 

SHLRGEYDVEKLMRAAALSTWEQNPNVDHLIGRK 

LSEKYRFAEGAAYQNLALDADPLH 

LAASFQLAQDMLRLGEEEIGWEIAGEVAKSDPYN 

WAYNLMTLRDRTAKFTVLKQDGI 

QVRMDATEAKLYGDAVMELLLDAKQVLCEKYDEM 

PDKPILVEIFPHQNDFAIRTFGLP 

GGAGYLGVCFGRVITANSPASQGERPSNWKSVLW 

HEFCHWTLEKTNNRMPRWLSEGI 

SVYEERQRNPSWGEKMSPQYRSMLLSDDLTPVSD 

LSAAFLSPPSAIALQFAYYESSLV 



gene 
CDS 



104647. .105831 
104647. .105831 



gene 
CDS 



105881. . 106900 
105881. .106900 



gene 
CDS 



106851. .107960 
106851. .107960 



GSLRRLDAQFDEYAKKRANQFGAL 

VDWSRDTLPESGDLATWKGWTRANPTNYWGLREL 

AKSAIEGEQWEQ7VLIPLSRMQQLG 

VLTGERGGPLEWLARAHRELGQDRQEIRAIQDNL 

SQSSDALPALRRWINIGQSEEQWE 

NVLDASQQALAIQPLLPEFHLASAVAAEKLDRHE 

LAVEALSALLALDPVDPAALHFRL 

ANAYDEQNESFPAKHHALMALELAPRYRDAHRLL 

WKLHHAVSEENPADAEQNDAENVS AVEEEEAL" 

/ locus - tag= " RB 8 1 7 1 " 

/ locus - 1 ag= " RB 8 1 7 1 " 

/note="PMID: 11259647" 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical 

protein- transmembrane prediction" 

/prot e in- id= " CAD7 8501.1" 

/db-xref="GI: 32445770" 

/ translat ion= "MTPKNLNLMGILRALARGSV 

HSHSPLPRPSSGERSDAAFSRRPG 

EGADHMAVEIAMPQNAPCHADAPSPEQRLKGVLD 

LSPNEIGGEVTTAFTVGDKTQTRQ 

QSTPLTFSHKPFGHRRWILAALIVLMGGAAFAQR 

GYWGRRNRSLPTDRNGVPTWEVDP 

RFPQDSFTFARIEYDSYGGRGRGGGGCWTDYPDS 

DLNFSLRLQQLTSLKVNPDPWIR 

LTDDELFDYPFIYIIEPGGLVFSEEEVKALRKYC 

LNGGFLMVDDFWGDSQYENMRSEL 

KRVFPDRDPFEVPLEHEIFHIVYDLKEKPQVPAI 

NSARRRADGSVGSWEWSRDGSDTS 

VPHYRAITDDEDRIMVFICHNTDLGDGWEREGED 

QWYFDEFSVKKAYPMGINIVTYAM TH" 

7gene="moxR" 

/locus-tag="RB8173" 

/qene="moxR" 

/locus-tag="RB8173" 

/ f unct ion= " C- compound , 

carbohydrate catabolism; 

transcriptional control" 

/note="best DB hits: BLAST: 

pir:C83106; conserved hypothetical 

protein PA4322 [imported] -; 

E=2e-61 pir:G71146; hypothetical 

grotein PH0385 - Pyrococcus 
orikoshii; E=le-60 pir:H82493; 
MoxR- related protein VCA0175 
[imported] - Vibrio; E=2e-60 COG: 
PA4322; COG0714 MoxR- like ATPases; 
E=2e-62 PFAM: PF00004; ATPase 
family associated with; E=0.0029 
PF01078; Magnesium chelatase, 
subunit; E=l. 7e-06" 
/codon- start=l 
/transl-table=ll 

/product= "probable MoxR- related 
protein" 

/protein- id="CAD78502 . 1" 

/db-xref="GI: 32445771" 

/ trans 1 at ion= " MHFMTTDTMTHEDEAQWEQ 

IRDGRERIVQELSKVIIGQEEVIE 

QLLICLFAGGHCLITGAPGLAKTLLVSSVAKIFH 

LNFQRIQFTPDLMPADITGTEILE 

QSADGHRQLQFVKGPIFANVILADEINRTPPKTQ 

AALLEAMQEHQVTAGGQRFELEEP 

FFVLATQNP I EMEGTYPLPEAQLDRFLFNVLIDY 

LPPKDELAWLQTTSTKPEPIQPI 

FTGEDVARFHAAVRRVPIAESIAAYAVRLVAATR 

PGRDGTPDFVNQYASWGAGLRAAQ 

TLVLGAKARALLNGHAHVRTED IQALAHPTLRHR 

VLLSYRAEAEGFNVENLVTRLLQE IPTEV" 

/locus-tag="RB8174" 

/ locus - 1 ag= " RB8 1 7 4 " 

/note="best DB hits: BLAST: 

swissprot :P71761; YE8 0-MYCTU 

HYPOTHETICAL 34.3 KDA PROTEIN 



conserved hypothetical protein 
VCA0174 [imported] E=le-12 
pir:H83261; hypothetical protein 
PA3071 [imported] - Pseudomonas; 
E=2e-12 COG: Rvl480; COG1721 
Uncharacterized ACR; E=le-18 PFAM: 
PF01882; Protein of unlcnown 
function DUF58; E=8.4e-16" 
/codon-start=l 
/transl- tableau 

/product =" conserved hypothetical 
protein" 

/protein~id="CAD78503 . 1" 

/db-xref ="GI : 32445772 " 

/ t rans lat i on= " MRTWSHDFFKKS PRKSELSW 

FQRQSRSSVKATNGSTGTLQGNSS 

KSSRAAASPAGGNGKTAASMDPHALMRIKNLQLR 

S KL WEGFFGGLHRS PLHGAS VEF 

S E YRAYS PGDD PRGLDWKLFARTDRYY I KKFEDE 

TNRRCYLLVDQSQSMGYGSLEYTK 

MEYARTLAATLAYFLTLQRDHVGLMTFDDTIADV 

VPARSRVGHLRQILACLARPTSGS 

GTDVNGPISQIAAITRRRGLWLISDMLSPTEDL 

QRSLALLRSRQHEVIVLRILDPNE 

VNLSVDESKVLVDMETGRKIQVDPDATRKAYQAA 

FQEHAETIESICNAVGAAIYTSTT 

DRPLQEALSDFVSAHQRRAVGASRSGMLSSGGTF 

G" 

gene 107957. .110071 /locus-tag="RB8176 " 

CDS 107957. .110071 /locus-tag="RB8176" 

/note="PMID: 2450046 PMID: 2446322 
PMID: 2462430 PMID: 3663330 PMID: 
2476436 PMID: 1384548 best DB 
hits: BLAST: swissprot : P19823 ; 
ITH2 -HUMAN INTER-ALPHA- TRYPSIN 
INHIBITOR HEAVY; E=0.69 
ref :XP-011910.1; similar to 
inter- alpha (globulin) inhibitor, 
H2; E=0.69" 
/ codon - start = 1 
/transl-table=ll 
/product=" similar to 
inter-alpha-trypsin inhibitor 
heavy chain H2 [Precursor] " 
/protein-id= "CAD78504 .1" 
/db-xref= "01:32445773" 
/ 1 rans lat i on= " MSMLAPFFLAGALAVAAPIL 
FHLIRQHPKGAVPFSSVMFLREVP 
PRLTRRSRLDQWPLLLLRALALLLLAAAFARPFL 
RGADDQFNNTPVRRIAVLVDRSAS 
MQREDLWEQAIQKAETVLTDLSAQDQFAVFVFDT 
KIERIWETSLAATSDERQNALALQ 
ALTEVQPTWRATDLGTALREAADWAQWPTES PTA 
GEDGLADEEAITRGQLPGPAS I VL 
ISDLQDGSKLDALQQNTWPEQVTLDIRRWPKEE 
GNATAIMMNDQTLENLQESSASNA 
NRKLPVRWNSKLASQFDLTLQWNDASDKQTIQV 
EADATQI IRLDQPADSVTAMVLSG 
DNSPFDNTVHWVAPEKRSYRLLHVGDVQDDPRES 
LFYYLERVPLDNATRTVS IQTTTV 
AKLPTDLDPRECPLMWSDVANTDAFIQSLVTYT 
EAGGRVLFVLDESDDAKDLITCIA 
SVTDNDSLSAEEATVDDYAMWSRVDFSSPLFASM 
ADPQFNDFSKIQFWNHRNVSGLEE 
AGWNILANFDSGAPALAQTGNVWLLTTGWQPSSS 
QLALSTKFIPLIAQMFTGNEGTTR 
YVAGHWGTPLPWPATEGAELVRSDGESSPYEQA 
SDADFIDRPGVHQLKLPDEAPITF 
ASNLSESESQTDAMDEDTLERLGIGLGEAATAEK 
LTQATLQLRDRELESNQRLWQWLL 
LAAIGMLVIETLWGGRYAKRIA" 
gene 110052. .114008 /locus-tag= "RB8177 " 

CDS 110052. .114008 /locus-tag="RB8177" 

/codon- start =1 
/transl -tableau 



protein-putative transmembrane 
protein" 

/protein- id= "CAD78505 .1" 

/db-xref ="GI : 32445774 " 

/ 1 rans 1 at ion= " MRNELLDCIDPVIRRMRRLR 

FWQSLGWLAWTITLIVLLWLVWTQ 

RPLAFLSSMTLSTSSLSTSVAIVSTLVIAMVACL 

lAARLRIRNIQSVARKIEHRYPDL 

DQRL I TAVDD S VAKQPDNYLNQRVMREAREHARR 

NDWPDWSSRWLLVNRLFGLSGTL 

ATLALAGFFLLAKPDSSAKASSWLPKPDWIQP 

GSMEIERGTNLLVTAEFQADWIGR 

QSAVEPELVLTAIDNQPVANVANVANTETVPEDQ 

AEPRVARLPMRRSMADP I WGGLVP 

EVDESFTYRWLPHWSSEEYRVEVFEYPAVTRSD 

AEIDYPEFTGLQNRRIEDTVRVSV 

AKGSEVTWWVRINKPVTSAQLFPKPYKKNEANYE 

DSAPLEMTQDPERPLWWSVAIKVE 

ESQRYDVELKDDKDRTNQLSTQFIVRMLPNQLAK 

IRPTVGGDVSVSALEEFLIAAEVR 

DDHGLARAGVSLTFAGKPPQDIVLAENIAANQKQ 

TIQHMVDMESLEAKPDDLLSYSFW 

AEDKVAGESSETEVRRTQSDLYFAEVRRFEDIFR 

EGDPQSSQPSEPSEPSEQEQQAEE 

LIETQKQI IQATWKLLQSTVRWNEQTTADVRVIA 

DSQQDVIAQLDELAQELNEAEAKQ 

HAANVRQSMQTAVAELNRAIQTSDREPLSAAWA 

EQPAYAGLLRMRASEFEVTRSQQQ 

QQPSQGSAGQQRKQQQLDELELDQEQDRYETQQQ 

AQMTPEQQAQREIRQVLSRLSELA 

RRQQDLNEELAQLQSAMQLADDEEEKEELRRQLE 

RLREQQQDMLRQTDELMERMNS PE 

NASNMQEASEQLEQTRENIREAGEAMEQQDAETA 

LSSGKRAERELESVREEIREQAAG 

QFNEAMQSMRDRARELNTQQKELEEQLSEMDAPS 

ESPGLRSSGNREELKDALTEQRES 

LGELMEDIEEWQEAEESEPLLAQNLYDAFRETQ 

QRRTEERLDAASQLLQRGFDEQSQ 

QMAGQAGEAIQELSEQIETAAESVLGNEAEGLQR 

AASLAEQLESAVSDEVQQARGDDS 

GGAGQGETSEPTERQPGEDKPQQGQPGENQPGQG 

QEQGEGEGQPGQQQPGEESQPGQG 

GPSQSQDGQEPGEQSAGESETGQPSNPPSQSTSP 

GQGGAGQGGQAPDEEPSTEPTPGE 

PSENTEPQESTERPQGPPGLRPSSQSQQGQPQPN 

QPQEGQSGGPNPSAAQAPSSPSNF 

TSPLTGEGFREWSDGLRDIEEMVSDPELRSRATR 

IRERARDMREEIRRESKPPQWDLV 

EEMIVEPMRELKRDLAEEYMRRSAKKQSLVPIDR 

DPVPDQYSEAVRRYYERIGSGL " 

/locus- tag="RB8181" 

/ locus - t ag= " RBB 1 8 1 " 

/note="best DB hits: PFAM: 

PF00672; HAMP domain; E=0.22" 

/codon~start=l 

/ trans l-table= 11 

/product =" hypothetical 

pro te in - 1 ransmembrane pr edi c t ion " 

/protein-id= "CAD78506.1" 

/db-xref="GI: 32445775" 

/ translation= "MNDAFNSSDNWSQHLVWGAP 

EWIFPAGILAWLTAAIGWSYARS 

RATT P I RVLAATLKL I AVALMLLCLLQPMWRGER 

PRPQANLFPILVDTSQSMQLQDVD 

EELSRGQNFANALAADSDWRVRLEQTFDVRLFGA 

DSRLSRMADIDTIEFDGMQSSIQT 

S IDALSQRLKGRPVAGALLFSDGNLTDPSAEQIT 

GTDSPFPIFPVLPDAPTTPTDLRL 

ERVS VTQSDFETS PMTVRAD VMATGLKDQD VWQ 

LLNASGTSVEEKTVTIEKEGELQS 

VTFRFRPEQSGTQFYRVLAFTPSQRQSIRSQIDE 

GNAWLEGSSGDEATLRNNQRWIDV 

QPRRGPFRILYLAGRPNWEFKFLRRALQSDAETQ 

LVGLLRIADKEPKFSFRDKGVNSS 

NPLFAGLGEDEEETREQYDEAVMIRLGVKESEEL 



misc- feature 



gene 
CDS 



115650 1156 



116386. .117576 
116386. .117576 



gene 
CDS 



117573 . .118352 
117573. .118352 



gene 
CDS 



complement ( 1 1 8 3 3 9 . 
34) 

complement (118339 , 
34) 



EFFTQDQMLLLRQFVSLRGGGLLMMGGQESFNDD 
SFGKTPLGELSPVYPPRNRSDQPA 
PAGKWSITREGMLQPWQRLRETEAAEKLRLLKMP 
PFRVTHEVGDIKPGAIELAQLETS 
SGQPAPAFITQRFGKGRTAAMPIGDFWRWSMRKN 
QYDNTTEVSDDPAQAWRQMARWLV 
GEVPRRVELTIQPDGSTGSTNLRVLVRDESYLPM 
ENATVTLNVTGEDGVS I PLTARPD 
ANELGTYAATFVSRDSGRFTVSADVKAADQSFIG 
NDEGGWVSDPGMEETQRLSWNENW 
LKNLAESTGGEVLRVSDLGDFVSDLPNREIPVTE 
TWVYPLWHQTWVMLLAMACLCGEW 
GLRRWKGLA" 
/ locus - tag= " RB 8 1 8 1 " 
/note="cosmid pircos-d3d05/ cosmid 
pircos-c3hll joining point" 
/ locus - 1 ag= " RB8 1 8 6 " 
/locus-tag="RB8186" 
/codon-start=l 
/transl-table=ll 
/product = "hypothetical 
protein- signal peptide prediction" 
/protein- id= " CAD78 507 . 1 " 
/db-xref ="GI : 32445776" 
/ trans lation= " MRRHGTSTMEGTRMSNRLKM 
LRRKSWPKLCVATALCLPVSTVAS 
GQEPDTLQESGSGQTSDSGQDTSSNEPATKPDVA 
SESARTDSDETQVMRGDLIVWGA 
AGDPTYEPLFREWSDTWMKLAKTAKLNAIRIGSE 
SLESEQTDQQLLEQALQGLETDSP 
RPLWISLIGHGTFARNVAKFNLRGPDVSADQMKD 
WLAGFQRPWWNNASSSGPFINA 
LSAPNRVIVTATKSGNEQNFARFGQYFAEALAAI 
DSDLDHDDAVS IREAFVRASKDVE 
LFYEAEDRIATEHALMDDNGDGLGTKADDLRKKV 
LTPEQLAKLDGKQAGRMT I APVAN 
SLNWTDDQLKRRDELEDRLDALRMQRLSDPAAFP 
DNQWADHLEKILVPLAELYHEVES NSPP" 
/ locus - tag= " RB8 18 8 " 
/ locus - tag= " RB 8 1 8 8 " 
/note="best DB hits: BLAST: 
gb:AAG54864.1; AE005231-11 
( AEO 0 5231) glyoxylate - induced 
protein; E=9e-21 gb: AAG59852 . 1; 
AF284750-1 (AF284750) HT036-ISO 
[Homo sapiens] ; E=le-20 
swissprot :Q44 015; YGB4-ALCEU 
HYPOTHETICAL 28.3 KDA PROTEIN IN 
GBD; E=le-20 COG: gip; COG1082 
Predicted endonucleases; E=3e-21" 
/codon-start=l 
/transl- tableau 
/product= "probable 
glyoxylate -induced protein" 
/protein-id="CAD78508 . 1" 
/db-xref="GI: 32445777" 
/translat ion= "MNSPARLKASVCVDAVLENL 
STVDAMKLVADCGYPAFEFWKWWD 
KDLDS INQAREELQLQVAAVCTKFVSLVDPDVRA 
EYLAGLKESIAAAGSLNCPVLISQ 
VGDARPGVDRAEQRDCL I AGLKEAAPMLEDSGW 
LAIEPLNELIDHAGYYLVRSDEAF 
SIIQEVDSPNVKVTFDIYHQQISEGHVIANLTQN 
VDAIAHFHAAGNPGRHELDRGELH 
YPS I FAAI ADTGYSGHVGLEYWPVNEAAAGLREA 
ASWIKKV" 
1211 /gene="deaD" 

/ locus - tag= " RB 8 1 8 9 " 
1211 /gene="deaD" 

/ locus - tag= " RB8 18 9 " 
/function="DNA repair (direct 
repair, base excision repair and 
nucleotide excision repair) " 



BLAST: pir:G82830; ATP -dependent 
RNA helicase XF0252 [imported] - 
Xylella; E=le-151 
swissprot:Q1103 9; DEAD-MYCTU 
COLD-SHOCK DEAD-BOX PROTEIN A 
HOMOLOG; E=le-128 pir:JX0314; DEAD 
box protein - Klebsiella 

pneumoniae gb: ; E=le-128 

COG: XF0252; COG0513 Superfamily 
II DNA and RNA helicases; E=le-152 
PFAM: PF00270; DEAD/DEAH box 
helicase; E=9.2e-70 PF00271; 
Helicase conserved C- terminal 
doma; E=7.8e-35" 
/codon-start=l 
/transl-table=ll 
/product="ATP-dependent RNA 
helicase" 

/protein-id="CAD78509. 1" 

/db-xref= "01:32445778" 

/translat ion= "MNEKRRRGCVGMQGFEGPGG 

LNRTADNQIQGSERVLLVWHSILT 

NS AATMTACETQRGCGYLFKRTS FLMSDHPASNE 

TLVD PTS ELNS QERAQDTFEMS TT 

ETWDPSAQWAAHYANPVIDPSEMPSPAVDESLL 

FYQADCTDEASDVIVEVQAEPAIE 

LNTEPVQVEIAEPEQHVQEQPKSQEQPAESDAVA 

SPDAVIAEPEVAREPEPANEAWE 

SQPVATMAPPWEEPVAAAETAASPEPWEPEW 

AEQKPAVQSEPVAKQEQSVREELA 

PPAQSQSAQTEEPAKAEKAPESNEPAEPLFRDLD 

LRAEVQKAVAASGYDKPTPIQAEI 

I PYMLDGRDVLAQSQTGTGKTAAFALPILSRVDV 

RSRRPQVLVLAPTRELAIQVARSF 

TKYGADLSGFQVAAIYGGQDYEPQLKQLRRGVQV 

WGTPGRVIDHVPCRGTLDLGSLDC 

LVLDEADEMLNMGFLEDVQFVLEKSPEGRQVALF 

SATLPKPIRKIADEYLNDPARITI 

KSKTITAASVRQRALFVSPRDKIDALTRILEVEE 

TDGVIVFTKTKDSTLSVAEKLSQQ 

GFSAIALNGDMPQKVRERTIDQLKRGQLDVLVAT 

DVAARGLDVPRISHVFNFDLPHDS 

ES YVHRIGRTGRAGRSGQAI IFLTNAQRRQLRFI 

ENTTKQP I E I ID I PTVAD INEARV 

RRFKQRITDVSADQDLTVYKDMIAQYAEESGKPM 

EMIAAALAEMSQNGRPFLLKERPK 

KQRERSDRDNSRGRDSFDSNDRSGNRGSFGDDRP 

RRPKGRPLTPVKEGMTRYRLDVGW 

QDGVKPGNIVGAVANEAGIDGEYIGPINIRDSHT 

LIDLPEGMPSDIYQTLRKTWVAGK 

RLNLEEAGDWDDFEGDGGRSNFRKEGFRKGKFS 

GNGGSGGKNFSGGKRSFGNKSRNN 

GSTAPTAKRGKKKFKPS " 

121096. .121314 /locus-tag="RB8191" 

121096. .121314 /locus-tag="RB8191" 

/codon- start=l 
/transl-table=ll 
/product=" hypothetical protein" 
/protein- id="CAD78510 .1" 
/db-xref ="GI : 32445779" 
/ 1 rans lat ion= " MHSDTTTTPLLIHQKESVSS 
DQSRRSTDWVATRVETERITLKTC 
RRKVDSRRQFARRRLLWPRRVIESSILQ" 

complement (1212 97 . .1214 /locus-tag="RB8192" 

04) 

complement (121297. .1214 /locus-tag="RB8192" 
04) 

/codon- start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein-id="CAD78511. 1" 
/db-xref ="GI : 32445780" 
/ translation= "MRWLLQRVLLAGEGRGNRIR 
GNPNEILVFLLLQNR " 
121501. .121656 /locus- tag="RB8193" 



gene 
CDS 



gene 
CDS 



gene 
CDS 



/codon-start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78512 . 1" 

/db-xref= "01:32445781" 

/ 1 rans lat ion= " MQHLRGFPTHYSGKKALDSS 

QSEMKDFEPHINMLKLKPVQISFR VWLLRAN" 

121684. .122916 / locus-tag= "RB8194 " 

121684 . .122916 /locus- tag= "RB8194 " 

/function="type II protein 
secretion system" 
/note="best DB hits: PFAM: 
PF00114; Pilin (bacterial 
filament) ; E=0.038" 
/codon- start =1 
/transl-table=ll 
/product= "hypothetical 

grotein-putative conserved 
ypothetical protein" 
/protein- id= " CAD7 8 513.1" 
/db-xref="GI: 32445782" 
/ 1 rans lat ion= " MHKRFSRPRGFTLVELLWI 
AI IGVLVGLLLPAVQAAREAARRM 
SCSNNFKQLGLAAHNYHSAYNNLPVQGSGTYAVG 
GRDPWDGNPNGDISSNYRLSYLVG 
MLPFLEQQGLWEQ I SNPLS INSDGTVRS PS WQAM 
GPHPDRVQYPPWATELPTLRCPSD 
PGSGLPSLGRTNYAACHGDSAVYSRDSYLDVDEV 
AEGGKLP Y I SDTAHAAAS S ASHRG 
MFVTGRHIVIKFRDTLDGLSNTIMCGEIATDLGDND 
KRTTVATNVGAHAAPNEKNQCRLD 
PSYAEPYLDPERPQFWTSTVTKQWWGRGYRWHD 
YMPPYSQMTTILAPNAQLCSEGTD 
HRD WS PPS SRHQGGVHVLMGDGAVKF ITDS I EA 
GNKNAPQVSDRAGS PAAGS KS P YG 
LWGALGSRSAKEVIDGEF " 
1233 /locus-tag="RB8195" 

1233 /locuS'tag="RB8195" 

/codon- start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein- id=: " CAD7 8 514.1" 
/db-xref= "GI : 32445783 " 
/ 1 rans lat ion= " MWFFSACCFGNRVGLAGYKK 
SGPLLLAAHFCLAHESKGAGERVF 
ARRMQSGATSCLRILLTLRLLLRVQLRIRPTFPR 
PSCRRCCYCFQSRIRWRRASRAF 
RRSASRFSS IHP I VRTKS IQNEQLHSLR " 
123410 • - 124729 /locus-tag="RB8199" 
123410 . - 124729 /locus-tag="RB8199" 

/note=:"best DB hits: PFAM: 
PF00114; Pilin (bacterial 
filament); E=0.0077" 
/codon- start =1 
/transl-table=ll 
/product= "hypothetical 

grot e in -putative conserved 
ypothetical protein" 
/protein-id="CAD78515. 1" 
/db-xref ="GI :32445784" 
/ trans lat ion= " MRVLQRRQPFLGCSVAQDFW 
KLDSLRHQMQRNRKRRRGFTLVEL 
LVWAI IGVLVGLLLPAVQAAREAARRMSCSNNF 
KQIGLAIHNYHSAYNNLPVHASGT 
YAVGGRDPWDTNPGREISSNKRLSFLVGTLPFLE 
QQGLWEQISNPLSVNSDGSTKSPP 
WQPMGPHPDRVQYPPWATEIPSFRCPSDPGRGLP 
SLGRTNYAACMGDSAVYSRDSYLD 
VDEVASGATLPYRTDTAHAAASNSAQRGMFVNGR 
EVQFRDTLDGLSNTIMCGEITTDL 
GNNDKRTTVPTNTGGHAAPNEKNQCRLDPSYAEP 
YIDPERPQFWSPSGLTKQAVWGRG 
FRWHDYMPPYSQMTTVLSPNRELCSDGRDHRDW 



complement (122 964 . 
56) 

complement (122964 . 
56) 



gene 
CDS 



complement (124 820 . 
06) 

complement (124820 . 
06) 



gene 
CDS 



complement (125548 , 
49) 

complement (125548 
49) 



gene 
CDS 



126234. .127472 
126234. .127472 



DSIEAGNKNAPQVSNRAGSPPPGSPSPYGLWGSL 
GSRAAKEVIDGEF" 
1255 / locus -tag="RB82 02" 

1255 /locus-tag="RB8202" 

/note="PMID: 8407798 best DB hits: 
BLAST : swissprot : P50619 ; 
YMAB-BACSU HYPOTHETICAL 23.4 KD 
PROTEIN IN; E=le-17 pir:G72261; 
conserved hypothetical protein - 
Thermotoga maritima; E=9e-13" 
/ codon - start = 1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78516 .1" 
/db-xref="GI : 32445785" 
/ 1 r ans 1 at ion= " MTAPVGLNFQLVHDLDCRMS 
SNATTEEHVLVI PATL I E S I GHLE 
GFDRNVDRFLQPILASDELSYRPRGAMELDPSFK 
QL I P YWMQWTDPADGAVKWFQYT 
RGGGSGEKRLHAKRSIGVGGHISQEDAGGEEDPY 
TIGMRRELEEEVTIGAPYVDQREG 
LLYDPSNEVGRVHLGWHRFILEQPLVTSNEAEL 
AEGQFVSVKQLRQEREHLETWSQL 
TLDALFPESV" 
.1262 /gene="mpg" 

/ locus - 1 ag= " RB8 2 0 5 " 
.1262 /gene="mpg" 

/locus-tag="RB8205" 
/function="DNA repair (direct 
repair, base excision repair and 
nucleotide excision repair) " 
/note="PMID: 95134354 best DB 
hits: BLAST: embl : CAA52 615 . 1 ; 
( X74 5 0 9 ) N-methylpur ine -DNA 
glycosirase (MPG) ; E=9e-26 
pdb:lF4R; A Chain A, Crystal 
Structure Of The Human Aag Dna 
Repair; E=3e-22 gb: A7^58369 . 1; 
(M71215) alkyl-N-purine-DNA 
glycosylasee [Homo; E=5e-22 COG: 
APE2247; COG2094 3 -methyladenine 
DNA glycosylase; E=8e-18 PFAM: 
PF02 24 5 ; Methylpur ine -DNA 
glycosylase (M; E=2.2e-44" 
/ codon - start = 1 
/transl-table=ll 
/product = "N-methylpurine-DNA 
glycosirase (MPG) " 
/protein-id= "CAD7&517 . 1" 
/db-xref ="GI : 32445786 " 
/ trans lation= "MMNDAKRNQFREDDEEDRLI 
DWGSTQSLQSQFFDRRPAWARQL . 
LGCGFARRIEGVWVGGWIVETEAYLSSRDAASHS 
ARGEKPGNASMFGRPSTLYVYPIH 
AKHCVNLVTESVGCGSAVLIRALQPVWGIDRMFQ 
HRGLHRSETTDGRALTTGPGRLCQ 
SLAIDRTCDGVDPIRDPNWCVFSGPKLPSSRVTT 
TPRIGISQAAELPLRFFVDGNRYV 
SGLVRHHRRPRRDSL " 
/gene="trzA" 
/ locus - tag= " RB8 2 0 8 " 
/gene="trzA" 
/ locus - 1 ag= " RB8 2 0 8 " 
/EC-number="3 .8.1. -" 
/ f unct ion= " dehalogenases " 
/note="PMID: 7592318 best DB hits: 
BLAST: ddbj : BAB05411 . 1 ; (AP001512) 
BH1692~unknown conserved protein 
in; E=le-18 swissprot :Q9RW4 5; 
Y824-DEIRA HYPOTHETICAL PROTEIN 



gene 
CDS 



gene 
CDS 



swissprot :027549; YF05-METTH 
HYPOTHETICAL PROTEIN MTH1505 

; E=5e-16 COG: BH1692; 

COG0402 Cytosine deaminase and 
related metal -dependent; E=le-19 
PFAM: PF00744; 

Dihydroorotase-like; E=0.025 

PF01979; Adenine deaminase; 

E= 0.099 PF00 962; Adenosine /AMP 

deaminase; E=0.0 

/codon-start=l 

/transl-table=ll 

/product= "probable 

cnlorohydrolase " 

/protein- id=" CAD78518 .1" 

/db-xref= "01:32445787" 

/ trans lat ion= " MHHSSKQTYRARWVMP IDGD 

PIPNGMMIVQSGRIASISSFQDSK 

NEGELIDLGDVAILPGLVNTHTHLEFSDLPKPVG 

HTGMELADWIREVI TTRGMVDHQT 

RQQNVLKGHAEATASGTQLIADIETTPMESIAPN 

TIAFAEVLGLSQERGDERMSQAER 

HLANNTSPDSRCDVAAISPHAPYSTPLPLIERCV 

QVAKQRKTTLAMHVAES PAERELL 

TNGSGPFAESLRTLGLPIEAFFPWTKANPVSHLI 

QLLAKSPRALIVHGNDLSETEIEQ 

VKSHPNCS I VYCPRTHHFFQHSNHPVAELLAAGI 

NVAIGTDSRASNPDLDLWGEVQHL 

LKHRPDLNPTDVLRMATANGADALGRSATHGRLA 

AGMPANFITVATSADRVDQLWSDF 

SSSSVSSSSMSSAARPAPQS " 

complement (127391. .1281 /locus-tag="RB8209" 

88) 

complement (127391. .1281 /locus-tag="RB8209" 
88) 

/note="best DB hits: PFAM: 

PF02517; CAAX amino terminal 

protease family; E=0.1" 

/codon-start=l 

/ trans l-table= 11 

/product=" hypothetical protein" 

/protein- id= " CAD78 519 . 1 " 

/db-xref="GI: 32445788" 

/trans lat ion= " MTQTNADDEQTPDDVFRTAV 

AVEAGLGALAL I LGYLLGPS AREL 

VPPLNEVASSAVIGGIGLGIVATIPLVLFVAVLR 

RIKHPAIEELDKLSDHPMIGLMLR 

LNAWELFAI SLCAGVGEELLFRGWLLP WLAGDAA 

SLAPDLEAPSRWWAYGGWLGSLPN 

SVTEFAWPEEGLLAWWSRVGGWELTAAWLVSSFA 

FGMFHPITKLYIWTALMGLYFGA 

LLIVTGNLLIPITAHALYDAVQLWGAGRAAEDMD 

EDDTDEDEKSDQS" 

complement (128198. .1300 /locus-tag="RB8210 " 

78) 

complement (128198. .1300 /locus-tag="RB8210" 
78) 

/function=" detoxification by 
export; ABC transporters" 
/note="PiyiID: 1349418 best DB hits: 
BLAST: ddbj : BAB04170 . 1 ; (AP001508) 
lantibiotic mersacidin 
transporter; E=5e-21 
swissprot :P4 5861; YWJA-BACSU 
HYPOTHETICAL ABC TRANSPORTER; 
E=7e-21 ddbj :BAB04845-1; 
(AP001511) ABC transporter 
(ATP-binding protein) ; E=5e-20 
COG: BH0451; COG2274 ABC- type 
bacteriocin/lantibiotic exporters, 
contain; E=5e-22 BS-ywjA; COG1132 
ABC-type multidrug/protein/ lipid 
transport system, ; E=6e-22 
VCA1084; COG2274 ABC- type 
bacteriocin/lantibiotic exporters , 



ABC transporter; E=0.024" 

/codon-start=l 

/transl-table=ll 

/products " lant ibiot ic mersacidin 

transporter system" 

/protein-id= "CAD78520 , 1" 

/db-xref="GI: 32445789" 

/ 1 rans lat ion= " MARYPTRLATVNRFSSLKQP 

RREPEAAGAPLHLVWAAIAGLI IP 

VLVLVFGI I AALVDEGDGLTQSPVRLGTHLS I PV 

PEFFLAQSKVSQLFGLVGLSLAIA 

AVFCLAVWLHRRSADARSRRVI KMLHSRTLKQSL 

RRAEVEGAAAQRERARVI IGRHLP 

EVGRGLSLWYRS I PRSVLMLIGCVTIjALLVNVWL 

AI LAVI SGVALWRLYAKLRNPDWL 

EL S R F E I P Q I RDRL I GL I GD APMMARLQAGGLAD 

QAYEAELDALDRRIAGDDARRGRL 

WPVMMMAGGVAIAVMLLGLGLNTLQGEQGLSLPS 

ALVLGLSLTGAALAAARLSELNRQ 

LRRSSKACESVYLYLQPNSEVSPSEQRVGLGGLR 

DCVTMEDVSLQDSSGKAILRDLSL 

SLQPKSLVAFLGTEDVSTRALLELLMGFGRPHRG 

KVQIDGI SLLDVHPQALAKNVMWI 

GPDGPLWEGTLRENLMAGVDRSVDNRDMVETLEQ 

LGI YERITRLPDGLETI lEPGSNL 

GATEDSDSLGTDVRYAVGIARAMLHRPPIVLAKE 

PPAPTEHVNEDPCLNALRELADAG 

SLWMLPHRLKTLRSCDRVMLLNGPNLVGEGRHT 

ELLNSSDLYRHLNYLLFNPYRHRS PS" 

complement (13 0179. .1319 /locus-tag="RB8214 " 

48) 

complement (130179. .1319 /locus-tag="RB8214" 
48) 

/note="best DB hits: BLAST: 
pir:S76926; hypothetical protein - 
Synechocystis sp. (strain PCC; 
E=le-06 gb:AAC36156.1; (AF003632) 
Pill [Myxococcus xanthus] ; E=0.016 
COG: S111107; C0G1277 Permease 
component of an ABC -transport er; 
E=9e-08" 
/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein- putative membrane bound 
transporter" 

/protein-id="CAD78521 . 1" 

/db-xref="GI: 32445790" 

/translation^ "MIAGSAISLFSDFAGAESGL 

LPLAQVGKWLPVFVTPIWVLSMGL 

LLGALVTAVIHGILSALSFIPGLGNLADDPKRGV 

TLSLIAGAIFSALLCYFYVPQGGE 

NGQLLFLPLVTLGMILGFGLIYGMWHRTRSEWMS 

I LGEGV I P Y I LGTLGLFAVIGLGS 

TPFVENPMAILESVPAVNLVGDGTWEVATIEPS 

ADPDIPEFLPAGITYNFRNVAELR 

I ESDKKVFLADSDKAEAFSRAP I ELNPGSTEAVT 

YKYQDREQPPIPGDPAKLHIYNQE 

IDPAEVTFTFKNLPQIPQASSIVFSAVALFLTLT 

GFMAMRQAAPRVWALALSTAKNEM 

AQPLYLLLLAIGIFAVLLFGIFPFNTLGDDIRLL 

KDSGVTMIMVLGMLLAVWSAGTSV 

SDEIDGRTALTVLSKPVSRRSFILGKYTGIMLAV 

LVLFVI LAAVLLWMS YKP I YDAR 

ETSQQQPPWQVGHEE I I TTLP I LGLYFMETMAIG 

GIAVALATRLPLLANFITCFAIYV 

VGNLLSPLVASARENTELVGFVGKLIAVWPNLN 

SFNVQAAVDAGNAI PLI YLAAAFN 

YLWF VI AI WMVAMLLFEDRDLA " 

complement (132113. .1327 /locus-tag="RB8217" 

09) 

complement (132113. .1327 /locus-tag="RB8217" 
09) 

/note="PMID: 11997336 best DB 
hits: BLAST: pir:H72201; conserved 



maritima; E=le-14 pir:A70337; 
hypothetical protein aq-4 08 - 

Aquifex aeolicus ; E=le-08" 

/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAD78 522 . 1 " 

/db-xref ="GI : 3 2445791" 

/translation="MSEIRLIEPWRFCAVISRH 

QEARQWAKQRLAERWGELGEKGEP 

SPFEAGGFYQPEMGDGLTKELIGIAEFADPAGLA 

DWKNVTNEWESEYASLSDHPEPRP 

LNLDPGYVSQAKLVLATIKDRDHRIYLRDGIFAE 

VTLNYVGGRWIHHRWSYPDYRTET 

VASFAANCRSRLRQHLKATGGFRVGVKKTPGVKP 

SPKV" 

gene 132980 134161 /gene="aspC" 

/ locus - 1 ag= " RB 8 2 1 9 " 

CDS 132980. .134161 /gene="aspC" 

/ locus - 1 ag= " RB 8 2 1 9 " 
/EC-number="2 . 6.1.1" 
/function= "degradation of 
amino-acids of the 
aspartate -group; amino-acid 
degradation (catabolism) " 
/note="PMID: 8907187 PMID : 
10029535 best DB hits: BLAST: 
ddbj :BAB07069.1; (AP001518) 
aspartate aminotransferase 
[Bacillus; E=2e-59 
embl : CAC11669 . 1 ; {AL445064 ) 
probable aspartate 
aminotransferase ; E=9e-55 
pir:G69119; probable aspartate 
transaminase (EC 2.6.1.1) iyiTH1894; 
E=2e-54 COG: BH3350; COG0436 
PLP-dependent aminotransferases ; 
E=2e-60 PFAM: PF01041; 
DegT/DnrJ/EryCl/StrS family; 
E=1.5e-05 PF01053; Cys/Met 
metabolism PLP-depend; E=5.4e-05 
PF00222 ; Aminotransferase 
class-II; E=0.28" 
/codon- start =1 
/transl-table=ll 
/product = "aspartate 
aminotransferase" 
/protein- id= " CAD7 8523.1" 
/db-xref = "GI : 32445792 " 
/ translat ion= "MPEVSSASNHRFQPSSNRQR 
VNMHPWIADRTASFDSSGIRKVFD 
LAAKLKDPINLSIGQPDFDVPEEIQDATVDAIRS 
GKNAYSPTQGIAPLREKLLAEINA 
KYPGQNRDVFVSSGTSGGLVLSLLSMINPGDEVI 
FLDPYFVMYPALVSLCGGIPVTVD 
SYPDFRLDPAKIEAAITPKTKMILVNSPANPTGV 
TASEQDLRDVGDLAAKHNIALLSD 
EIYSRFFYDGDFASPAATNPDTIVIDGFSKSHAM 
TGWRVGYVHGPPE II ATMLKIQQY 
SFVCS PQPAQWGALRAMEI SLDGHIDDYRRKRDF 
MVEQLSPHFELTSPGGAFYLFPKA 
PGTEGGTAFVERAIADGLLIIPGKIFSSRDSHFR 
VSFAASDDTLRRGAERLIKLAGDN R" 

gene 134234. .135553 / locus -tag="RB8 221" 

CDS 134234. .135553 /locus-tag="RB8221" 

/EC-number="4 . 2 . 99 . 10" 
/f unction="assimilatory reduction 
of sulfur and biosynthesis of the 
serine family" 

/note="PMID: 7704274 best DB hits: 
BLAST: ddbj : BAB06322 . 1 ; (AP001516) 
0-acetylhomoserine sulf hydrylase ; 
E=le-123 pir:D72324; 
0-acetylhomoserine sulf hydrylase - 



pir :T40463 ; 0-acetylhomoserine 
(thiol) -lyase (EC 4.2.99.10) -; 
E=le-110 COG: BH2603; COG2873 
0-acetylhomoserine sulf hydrylase ; 
E^le-124 DR0921; COG0626 
Cystathionine 

beta- lyases/cystathionine ; E=7e-63 

PFAM: PF01053; Cys/Met metabolism 

PLP-depend; E=6.3e-204" 

/codon- start =1 

/transl-table=ll 

/product = "0-acetylhomoserine 

sul f hydrylase " 

/protein- id="CAD78524 .1" 

/db-xref ="GI : 32445793 " 

/translation= "MSADPTQNHRLATRALHAGQ 

VADPTTKSRAVPIYATTSYQFDST 

EHAAALFGLAEFGNIYSRLMNPTVDVLEKRIAAL 

DGGVTGLCFASGQAAITAAVLAIA 

HSGQNIVSSTSLYGGTWTLFTQTLKNLGIEVRFF 

DPDHPEQIHGLVDENTRLVYMES I 

GNPRNDVPDFKAI ADAAHS APHGAI PVLCDNTVM 

TPYLLRPIDHGIDIVIYSTTKFLG 

GHGTHIGGCIVDSGNFKWADQPEKWPEFCGPSPS 

YHGAVFEEHLRGMGNIAYNVHIRT 

HWLRDTGAAMSPFAAFLFLQGIETLHLRMPRHCE 

NAMKVAEFLEAHDAVEWVNYPGLK 

THSHNKLAEQYLTNGKGAILGFGIKGGMEAGKKF 

INACQLCSHIiANIGDAKTLVIHPA 

STTHQQLAEDEQRRAGVSPEYIRVSVGIEDIDDI 

LDDLKQALAVATA " 

/qene="metX" 

/locus-tag="RB8222" 

/gene="metX" 

/Iocus-tag="RB8222" 

/EC- number= "2.3.1.31" 

/function="assimilatory reduction 

of sulfur and biosynthesis of the 

serine family" 

/note="PMID: 9209059 best DB hits: 
BLAST: swissprot : P57714 ; 
METX-PSEAE HOMOSERINE 
0-ACETYLTRANSFERASE; E=le-83 
swissprot : P 94 891; METX-LEPME 
HOMOSERINE 0-ACETYLTRANSFERASE; 
E=7e-78 swissprot :Q9JZQ5; 
METX-NEIMB HOMOSERINE 
0-ACETYLTRANSFERASE; E=le-77 COG: 
PA03 90; COG2021 Homoserine 
acetyltransf erase; E=le-84 PA0389; 
COG0500 SAM- dependent 
methyltransf erases ; E=2e-26 
MTH1820; COG2021 Homoserine 
acetyltransf erase; E=8e-14 PFAM: 
PF00561; alpha/beta hydrolase 
fold; E-6.7e-07" 
/codon- start=l 
/transl-table=ll 
/product = "homoserine 
0-acetyltransf erase" 
/protein-id= "CAD78525 .1" 
/db-xref ="GI : 32445794 " 
/translation= "MTSGRSTRVTMFESQASMGE 
PSNEDLSSTDDVRTDAPLAYAKYV 
TFDQSLPLERGGELPEIRCCYETWGTLNDDGSNA 
VLVCHAVSGDSHAARHDEDDQPGW 
WDGLIGPGLPIDTDRLFWCPNVLGGCRGSTGPG 
DADPTSPDGKPYGANFPRITIGDI 
VEAQKLLADHLGIRQWRAWGGSLGGHQVLQWIN 
RYPDAAKTCVAIATSPRLNSQALG 
FDVIARNAIQTDPHYAGGQYYDKDQRPDTGLAIA 
RMLGH I T YLS VEAMEAKFDPDRHD 
PRQI ASQFEQRFS IGS YLAHQGQKFTTRFDANSY 
VTLSMAMDLFDLGGTRLKLMETFD 
EATCDFLLISFSSDWLFPPAQSREIVNALTALDK 



YGPLIRERLRDTETHPAVPSDITLNVDEESILEI 
IPAGSSVLDLGCGNGQLLAAIRDR 
HRTPGPPTTEHRLMGVEVAQENLLATAMRGIDVI 
DYDLNHGLPAFIDDQFDYVILNAT 
LQAVENWELLNEMLRVGRHAI ISFPNFAYRQLR 
DHYVTHGRSPKAPGEFDFDWHNTP 
NRRFPTIADVRDLLGQLNWIDEEVFWDVDQGQR 
lEPDNDPNLNADTAVIAFHRENR " 

complement (137527. .1387 /locus-tag="RB8223 " 

59) 

complement (13 7527. .1387 /locus-tag="RB8223 " 
59) 

/note="best DB hits: PFAM: 
PF00114; Pilin (bacterial 
filament) ; E=0.03 8" 
/codon- start=l 
/ transl-table=ll 
/product= "hypothetical 

grotein-putative conserved 
ypothetical protein" 
/protein- id= "C:aD78526 .1" 
/ab-xref ="GI : 32445795" 
/ 1 rans lat i on= " MTCHAPPFHRTSFSFKVFSM 
SFQRKHRGFTLVELLWIAI IGVL 
VGLLLP AVQAAREAARRMS CSNNFKQLGLAIHNY 
HSAYNAMPKHGSGTYTQYPPDTGI 
NPPGGNRSDLSALVGMLPFIEQQGLWEVISNPTD 
TGFQAMGPWPNRQLADHAAI P YDP 
WLTNLPS FRCPSDPGVGLPAHGRTNYAVCLGDSG 
TLCDVGDTADNGTSRGLRIYQQSA 
CRGAFI PRTQTKFRDILDGLSNTLAMGEIITDLG 
DYDTRSHAVDSPVEVWREGNALSC 
RQFIDVERPQFWRNDGSTPFTGDAEQRRGFKWAL 
ARPLYTGFVTVLAPNDELCFQGNQ 
EREGDVTPSSRHQGGVHVLMCDGAVKFITDSIEA 
GNSNSAQVREGGGFLTPGSKSPFG 
VWGALGTRAAREIIDTEF " 
complement (13 8783 . .1390 /locus-tag="RB8226 " 
97) 

complement (138783. .1390 /locus-tag="RB8226" 
97) 

/codon- start =1 

/transl-table=ll 

/ product = " hypothet ical 

protein- signal peptide and 

transmembrane prediction" 

/protein-id="CAD78527. 1" 

/db-xref="GI: 32445796" 

/translation="MLFTFWPILSSHFAEWFVAK 

EKSEVFLIWRCHDQSQVYGVLPVN 

FCRFKACGLAALLGVAAISGCGGSSEPEATSTDE 

LTQFLNENPELKEPKPEVSPSDPT KP" 

complement (13 9186 . .1393 /locus-tag="RB8227" 

65) 

complement (13 9186. .1393 /locus-tag="RB8227" 
65) 

/codon- start =1 

/transl-table=ll 

/product=" hypothetical 

protein- transmembrane prediction" 

/protein-id=: "CAD78528.1" 

/db-xref="GI :32445797" 

/ 1 rans lat ion= " MLSLS IGLLCGAFLRLGGGV 

FSPPMRFWHGFSTDDSFLLRMAGP 

ISEQANFLKS IQFRE " 

complement (139412. .1404 /locus-tag="RB8228" 

49) 

complement (139412. .1404 /locus-tag="RB8228" 
49) 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical 

protein- signal peptide prediction" 

/prote in- id= " CAD7 8529.1" 

/db-xref ="GI : 32445798 " 



SFPLACTPVAMGQTGDFDPEDRGS 

KVEDDAWYD I S EWFDGND YNPTDEA IGRWDDE I F 

SYEDARTSSDSDNDQLIIPADTFY 

GEDYNESYVSYEDRDEDGTYERSTRYIDSDGDYL 

NDAYVTYDDEDGDGVYESYDYSEL 

GSTSKNAVPPSRIAQSVKDGLSGKRHEVKGTVRD 

I KTVQRGNEYSLMMQVKSQDGKDV 

WIDLGGHATGVQLFKDDQATFYGPMVKRGEKEVL 

VATAVDLPKHGMYKVERSGSRYTG 

QVQSTKTAKVRGKEHLWKLKTKGDKMMTVDMGL 

SSEAKKPSEGDQVTVTGVPVKIGD 

RVILIADKKQS" 

gene 140422. .140721 /locus-tag="RB8229" 

CDS 140422. .140721 /locus-tag="RB8229" 

/codon-start=l 
/transl-table=ll 
/product=" hypothetical protein" 
/protein-id="CAD78530 . 1" 
/db-xref="GI: 32445799" 
/ 1 rans la t ion= " MWGDEGI SWGEPFDWCCRV 
PATAADMGENTTKEGKRRATLPVY 
GRMTDLTRLHHLISHHCGRRAIISHRRAKSTRTR 
KMLAYLRHPASNRSTRRFAVC " 

qene 140779. .141291 /gene="ilvH" 

/locus- tag="RB8231" 

CDS 140779. .141291 /gene="ilvH" 

/ locus -tag="RB8 231" 
/EC-number= "4.1.3.18" 
/function= "biosynthesis of 
pyruvate family and D-alanine" 
/note="PMID: 98037514 best DB 
hits: BLAST: pir:A69115; 
hypothetical protein MTH1854 - 
Methanobacterium; E=4e-13 
pir:D69083; hypothetical protein 
IVITH162 - Methanobacterium; E=4e-09 
gb:AAB89585.1; (AE000988) 
acetolactate synthase, small 
subunit,; E=5e-07 PFAM: PF01842; 
ACT domain; E= 0.0 014" 
/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein-putative acetolactate 
synthase small regulatory subunit" 
/protein- id= " CAD78 53 1.1" 
/db-xref="GI: 3244 5800" 
/translat ion= "MSFGAGADTQFETMRGREFP 
AIRQFTIFLENRVGQLLEWRRFE 
GTGIRIVALSINDAAECAFVRFLISDADRGREIL 
ERAGLAIIESDLVGVELPEGPQPL 
LRVCTALLQAELNI IQAYPLFMRPHGKPAVAIMV 
ENIDLAMKTLTEKGFRI ITEDDLD 
ADNLPPQDIY" 

gene 141467 .. 142522 /gene="hemH" 

/locus - 1 ag= " RB 8 2 3 3 " 

CDS 141467. .142522 /gene="hemH" 

/locus-tag="RB8233" 
/EC-number="4 . 99. 1. 1" 
/function= "biosynthesis of 
porphyrins " 

/note="PMID: 9163953 PMID: 1459957 
PMID: 9384565 PMID: 10704318 PMID: 
9141132 best DB hits: BLAST: 
swissprot: 050533; HEMZ-STRCO 
PROBABLE FERROCHELATASE 
(PROTOHEME; E=5e-82 
swissprot :P71765; HEMZ-MYCTU 
FERROCHELATASE (PROTOHEME; E=2e-62 
swissprot :P72183; HEMZ-PROFR 
FERROCHELATASE (PROTOHEME; E=3e-42 
COG: RV14 85; COG0276 Protoheme 
ferro- lyase (f errochelatase) ; 
E=2e-63 PFAM: PF00762; 
Ferrochelatase ; E=7 . 2e-89" 



/ trans 1 -tableau 

/product= "probable f errochelatase" 

/protein-id="CAD78532 .1" 

/db-xref="GI: 32445801" 

/translation= "MSELPPYDSFLLVSFGGPEG 

QDDVMP FLENVLRGKNVPRERMLE 

VAEHYKHFGGVSPINEQNRQLIAALQKRFDANGI 

DLPIYWGNRNWDPYFADTLRQMKA 

DGKKRSLAFFTSMFSCYSGCRQYRENIIQAREEV 

GEGAPLVEKVRMGFNHPGFIAAMA 

DNVSKAAQT I GAS PARTKVLFTAHS I PMGMADNC 

DYEKQLRESCRLVADACGAVDWDL 

VYQSRSGPPSQPWLEPDVLDAIAEMDDAKKLESL 

VILPIGFVSDHMEVLFDLDEEAAQ 

LCRERGI KMARASAAGTHPDFVEMICGLVQERLG 

KLNEKPALGELGPWHDVCPQDCCL 

YTPRRPPVAGGRPVQAN " 

complement (142418. .1436 /locus-tag-"RB8234" 

29) 

complement (142418. .1436 /locus-tag="RB8234" 
29) 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein-id="CAD78533 . 1" 

/db-xref="GI: 32445802" 

/trans lat ion= " MKLPHGI SS lACLAFVFGAE 

ELFAAFHVRFDCTRAFLPVGWADF 

AIGFEVLEGFDHSQCFVDAAAQRQWDDLVTHNA 

FLVDQEQTTQCDAAVQQHWITRD 

LLVQVGDQWVSGLANATFVAWRVGPCQVSEMAVD 

RNADNVHAEVFEVLGPVAERDDFG 

WADESEVEWPEEQQDVLALVIRQLEWFERAIGH 

DGGGGE I RGFLGDQNTHGD I S FLK 

RVKRGCEGWMASVAKIRIEHANFGPIGFLTLDEF 

QVPRCCLISVMGGTVFEDCMQHHI 

EVHVDRTIVFGLAASHGEAHQTGMIVQIGIxAIVQ 

QRVDGFVAAVFGREEDLVGESRHG 

AHEGSKRGQKHRFVDRRRSRCGRNQDGSSVILAA 

ASIGLNRSPARHRRPPWRVQAAIL 

RADIMPGTQFT" 

142595. .142942 /locus-tag="RB8236" 

142595. .142942 /locus-tag="RB8236" 

/codon- start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein- id="CAD78534 .1" 
/db-xref ="GI : 32445803 " 
/ t rans lat i on= " MFLPPFRSFVSTMPRLAHQV 
FFTTKDRSDETINALLDDCQTYLN 
DHPGLVSFAVGRCEPEYDRPVNMDFDWLHTVFE 
DRAAHDAYQTAPRHLEFIERQKPN 
WAEVRVFDSNLRD" 

142997. .143593 /qene="prdX2 " 

/locus-taq="RB8238" 

142997. .143593 /gene="prdX2 " 

/ locus - 1 ag= " RB8 2 3 8 " 

/ f unc t ion= " de t oxi f i cat on ; s t re s s 

response" 

/note="PMID: 9115640 best DB hits: 

BLAST: ref : XP- 009063 . 1; 

TR000714 8 0-P [Homo sapiens] ; 

E=2e-62 swissprot :Q61171; 

PDX2 -MOUSE PEROXIREDOXIN 2 

(THIOREDOXIN; E=3e-62 

swi s spro t : P3 2 1 1 9 ; PDX2 - HUMAN 

PEROXIREDOXIN 2 (THIOREDOXIN; 

E=4e-62 COG: PA0848; COG0450 Thiol 

- alkyl hydroperoxide reductases; 

E=:4e-60 PFAM: PF00578; AhpC/TSA 

family; E=le-71" 

/ codon -St art = 1 

/transl-table=ll 



/brotein-ia="CAD78535 .1" 

/db-xref ="GI : 32445804 " 

/ 1 rans lat ion= " MSVLVTQKAPDFTATAVMPD 

GTFKDDFKLSDYKGKYVLLFFWPL 

DFTFVCPTEI lAFSDRAKDFEDLGVNIVGVS IDS 

HFTHLAWTNTPRNEGGIGKTAYPL 

VADLNKQ I SRD YDVLLDGGVALRGLFL IDQEGW 

RHQWNDLPLGRSVDEALRMVKAL 

QYFETNGEVCPANWQEGSRTIKADVEGSKEFFGA 

EYKS " 

/ locus - 1 ag=: " RB 8 2 3 9 " 

/ locus - tag= " RB 8 2 3 9 " 

/codon-start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id= " CAD78536 .1" 

/db-xref= "GI: 32445805" 

/ trans lat ion= " MWQFQDFAAPTPGSACDQNA 

GQCFTAHWNADGHDWVCHTAGGSK 

GIS FGDPDGSLHVNLLPVGDDALPVATEQSVRGD 

VWNVTLPQADSQYSLRLALRWEA 

TETRLVI EPTLS IQTLLLDTHPTLDLTALGDCGV 

I FDEGDTLGDGS PP I STVQLHGET 

GSIAVLLAPTDAPFTTDLRREGHLQLRLFGEFLE 

KGVIRKARPWIVLDRSEQGVSEAD 

LQAYLKRLCETPLPLTA " 

/ locus -tag="RB8 24 0" 

/ locus - tag= " RB8 24 0 " 

/function= "protein secretion 

systems " 

/note="PMID: 9804807 best DB hits: 
BLAST: swissprot : P44973 ; 
60IM-HAEIN 60 KDA INNER -MEMBRANE 
PROTEIN HOMOLOG; E=7e-31 
qb:AAG58902 . 1; AE005601-8 
(AE005601) 60 KD inner -membrane 
protein; E=4e-30 swissprot : P25714 ; 
60IM-ECOLI 60 KDA INNER -MEMBRANE 

PROTEIN ; E=4e-3 0 COG: 

HllOOl; COG0706 Preprotein 

translocase subunit YidC; E=6e-32 

PFAM: PF0 0595; PDZ domain (Also 

known as DHR or GLG; E=5.2e-06 

PF02096; 60Kd inner membrane 

protein; E=2 . 8e-49" 

/codon-start=l 

/transl-table=ll 

/product="60 kDa inner-membrane 

protein homolog- putative part of a 

protein secretion system" 

/protein- id= " CAD7 8537.1" 

/db-xref ="GI : 32445806" 

/translation="MADSSVERRLFSFILSSMAF 

FLIYMSLRTMFAPPLPPEEDAIAE 

VDGETVEPAEDLVADTDASEPGEGEGEEANEDTD 

AVERPSSPTWSTLGSMDPTSGYVM 

LVTLNSRGGGIERIELTERKENGRLKYRRVDVRS 

GYLGYLAADPTATDLGIRVNWGP 

GTPADLATASGVQGGLKPGDIITGFNQNNVNNLS 

MLREAMLETKPGESATVTVLRNEK 

SIDFTTTLTEHPLDLIRLAEHGGDDEVEGNLSRL 

S CLLTVGRVGRRE I QSGEKT I EGM 

VDTGDLIWDASQDGDNVSYQLQLSDSEMKPASGK 

SVGLQRTYSLKPDSYSLDMDVQID 

NRAEEAQELAYR I EGANG I TLEGWW YSNKI S PNW 

GGSAARDIVYKTTAEGHELVSGYA 

LLKRAKNESEADDQTLFAPDSAPPARNLSYIGVD 

AQYFTVAMLPPEGQESLKTFRRAA 

ANI lADPSAVPDNKERAVNASFYLDSAIADVPPG 

SSLKQSLRLFAGPKQPDVMEAYGL 

GDCIYYGWFSFVAKPLGGLLHLFSNVGNYALAIV 

LLTLCVRGLMFPLSRKAAINAQRM 

QELAPELKKIAEKHKDDMEARVRAQRELQQRVGF 

NPMAGCAPMFLQLPIFIGLYRTLS 

VDIELRQAAFASWTTWASNIJ^PDMMYYWGDWMW 



LFIx?^QQKKFMPPATDEQTAMTQKMiynSIYMTLVMGL 

FFFRVPAGLCIYFITSSLWGIGER 

I LVKKTLPS KPHFD PATLQGAAAGGGTVDGKVNN 

SAGANGKKSSDGKPKTMADRLRER 

LGTPEEEAAPLPKDRKRPPSKKPGNKKRR" 

complement (146 852. .1471 /locus-tag= "RB8243 " 

84) 

complement (146852. .1471 /locus-tag="RB8243 " 
84) 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical 

prote in- transmembrane predict ion " 

/protein-id="CAD78538 . 1" 

/db-xref="GI: 32445807" 

/translation="iyiPGGTVETTVLLWPRSTRSC 

GLPFSENAFRRIFTLRTSPETKFW 

GRSSDAVQAYAREGAERGTWRGLPSPRNLAERSL 

SEPSRCAVGGLKTLAAFLVAGLLR WWAFAIFG" 

complement (147205. .1474 /locus-tag="RB8245" 

44) 

complement (147205. .1474 /locus-tag="RB8245" 
44) 

/codon-start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein- id= " CAD7 8 53 9 . 1 " 

/db-xref="GI: 32445808" 

/translation= "MTLRPALPRSQRQEIGVFAC 

TALRSTFTGIVTDEVWGVLSYLL 

PTEALRRPELFEWFIVPGPDDSDNGSVAPSDLDG 

W" 

complement (14 7441. .1482 /locus- tag= "RB8246 " 
41) 

complement (147441. .1482 /locus-tag="RB8246" 
41) 

/note="best DB hits: BLAST: 
embl : CAB59607 .1; (AL132662) 
putative transmembrane protein; 
E=6e-08 pir:G83504; hypothetical 
protein PA113 9 [imported] - 
Pseudomonas; E=6e-06 pir:T35659; 
probable transmembrane protein - 
Streptomyces coelicolor; E=2e-05 
COG: PA113 9; COG234 0 
Uncharacterized protein with 
SCP/PRl domains; E=6e-07" 
/codon- start =1 
/transl-table=ll 

/product= "probable transmembrane 

protein" 

/protein-id="CAD78540 .1" 

/db-xref="GI: 32445809" 

/ 1 r ans 1 a t i on= " MHHAAHWVIRFQFKFSTRSD 

AVRGWLMLGCLLFVGPLMAQEEST 

AVAADSESDGELVTQVEEAIVKQINDYRKKKGLG 

QLQVNDELQATAKKFAEFMAESGK 

YGHHADGKTPAERAEAAGYEYCWRENIAYRTNT 

GEVTAKSLIDIFVPGWIDSPPHHE 

NIVAKHITQTGAAVATKDQTTYYAVHLFGRPKSA 

Q I E I S VLNRSGKAQTVS I ETNE S V 

DEFEMPPRTTVSMKRCFPTTFRVDGLEARTVDKS 

QELVLTKKGWEVAE " 

complement (148183. .1483 7locus-tag="RB8247" 

86 ) 

complement (148183. .1483 /locus-tag="RB8247" 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78541. 1" 

/db-xref="GI: 32445810" 

/ 1 rans lat ion= " MGTAKGALLIRGATRFTSSV 

KAEGIAGRDRLIWVGAVTKQTRQR 

IGMTIASCCSLGDSVSIQILYQE" 



/ locus - tag= " RB 8 24 8 " 
/EC-number= "3.1.1.45" 
/ f unct ion= " ortho- cleavage " 
/note="PiyiID: 11571162 PMID: 
89125597 best DB hits: BLAST: 
embl:CAA98652.1; (Z74134) ORF 
YDL086W [Saccharomyces 
cerevisiae] ; E=2e-36 
gb:AAC69477 . 1; {AF087482) 
dienelactone hydrolase 
[ Pseudomonas ; E= 3 e - 0 9 
ddbj :BAA74533.1; (AB019032) 
dienelactone hydrolase [Ralstonia; 
E=le-08 COG: YDL086W; COG0412 
Dienelactone hydrolase and related 
enzymes; E=2e-37 PFAM: PF01738; 
Dienelactone hydrolase family; 
E=4 . 2e-05" 
/ codon - s tart = 1 
/transl-table=ll 

/product = " carboxymethy lenebutenoli 
dase " 

/protein-id="CAD78542 . 1" 

/db-xref= "01:32445811" 

/ 1 rans lat ion= " MTPSQLSMRIQAPSFVDLET 

PTGPMRTHLFRPDGPGRYPGVILY 

S E I YQMTAP I ARTAAVLAGHGLLVAVPDVYHE YT 

ELGESFAYDKEGTDRGNRLKIEKE 

lAAYDADARSVIDFFGSDEGCTGKVGTVGICLGG 

HLAFRAAMNEEVKAGVCFYATDIH 

KRSLGKGMHDNSLDRIPEIQAEMLMIWGRQDPHI 

PAEGRRMIYDAMTAAGVSFSWHEF 

NGEHAFMRDEGHRYDPELALLLNSMACQHLNRHL 

R" 

/gene="aspS" 
/Iocus-tag="RB8253" 
/qene="aspS" 
/Iocus-tag="RB8253" 
/EC-number= "6.1.1.12" 
/ f unc t ion= " tRNA- synthetases " 
/note="PMID: 2129559 PMID: 1944398 
PMID: 10562565 PMID: 10873442 best 
DB hits: BLAST: swissprot :O32038 ; 
SYD-BACSU ASPARTYL-TRNA 
SYNTHETASE; E=le-165 
ddbj :BAB04971.1; (AP001511) 
aspartyl-tRNA synthetase 
[Bacillus ; E=le- 161 
swissprot : 067589 ; SYD- AQUAE 
ASPARTYL-TRNA SYNTHETASE; E=le-157 
COG: BS-aspS; COG0173 
Aspartyl-tRNA synthetase; E=le-166 
PFAM: PF01336; OB-fold nucleic 
acid binding dom; E=2.2e-ll 
PF00152; tRNA synthetases class II 
(D, K; E=8.2e-21 PF01409; tRNA 
synthetases class II (F) ; E=0-69" 
/codon- start =1 
/transl-table=ll 
/product = " Aspartyl - tRNA 
synthetase" 

/protein-id= "CAD78543 .1" 

/db-xref="GI: 324458 12" 

/ trans lat ion= " MLRTHTCGALRKSDVGSPVT 

LCGWVDS KRDHGGAVF IDLRDRYG 

LTQWIGPPEANEALIKQAGHVPNESVILIRGW 

ADRLEGKTNAKLETGEIEVRSEHF 

EILSASETPPFTPGQSDLPGEDLRLKYRFLDLRR 

KEMQQALIRRSEI IKCMRDYFAEH 

DFIDVETPILGRSTPEGARDYLVPSRVHPSNFYA 

LPQSPQLYKQILMVAGFDRYVQVA 

KCFRDEDLRADRQPEFTQLDLEMSFVDSEDIIGL 

IDGLVAKTAKQVLGKD ITLPLPRM 

TYEEAMRRFGSDAPDLRFGLEIVDVTSVAAKTDF 

RVFRGTADAGNFVRGINVKDSALK 



misc- feature 



gene 
CDS 



150511 1505 



151322. 
151322. 



.153511 
.153511 



gene 
CDS 



complement (153523 
90) 

complement (153523 
90) 



S P I AKNFDEEHLAE I KALMGGEPG 

DLLMFLADTWEVTCKGLSGLRKRLAVELKLYEDG 

ELNCSWTEFPMFEKDEEAGRYVA 

MHHPFTAPLEEDLPLLKESPEKCRAQAYDLVING 

SEAGGGTIRIHDSKVQSQVFELLG 

MDEETARDRFGFLLDALRFGAPPHGGIALGVDRW 

VMLFAGLENIREVIAFPKTQKAAD 

MMTGAPGE VDADQLNELHLRTVS AKT " 

/gene="aspS" 

/locus-tag="RB8253" 

/note="cosmid pircos-c3hll/ cosmid 

pircos-clglO joining point" 

/ locus - tag= " RB8 2 54 " 

/ locus - tag= " RB8 2 54 " 

/ f unc t ion= " amino - ac id 

transporters" 

/note="PMID: 9308174 best DB hits: 
BLAST: gb: AAB89635 . 1 ; (AE000991) 
cationic amino acid transporter 

(cat-1) ; E=3e-38 gb:AAG20879.1; 

(AE005155) cationic amino acid 
transporter; Cat2; E=le-29 
gb : AAGl 8789.1; (AE 004983) cat ionic 
amino acid transporter; Catl; 
E=6e-21 COG: AF1612-1; COG0531 
Amino acid transporters; E=2e-32 
PA5097; C0G1113 

Gamma -aminobutyrate and related 
permeases; E=4e-12 BS-ykbA; 
COG0531 Amino acid transporters; 
E=8e-12 PFAM: PF00324; Amino acid 
permease ; E=2e-08 " 
/codon- start =1 
/transl-table=ll 
/product=" cationic amino acid 
transporter (cat-1) " 
/protein-id= " CAD78544 ,1" 
/db-xref ="GI : 32445813 " 
/ trans lat ion= " MFGATSVGVGAIVGGGILAL 
AGVAFATTGPSAILAFGLNGVIAI 
LTALSFAEMASKFPESGGTYTFSRKVLSVESAFT 
VGWWWFAS I VAS VL YA I GFGS FA 
TLLLSELYSTQGSVPHWLGEPWSVPWSLATTVG 
IGGLMTFRTSGGGAWINVAKVAVF 
SVLIIGGFWMLTGQPVSKTTAELRPFLASGWGGL 
VQAMGYS F I ALQGFDL I AAVGGEV 
REPTKNIPRAMLLSLVIALLIYLPLLFVLTTVGT 
DGSQS IRELAASDPEAWALAARH 
YLGTSGYWLVLIAAVLSMFSALQANLFAASRIAL 
AMSRDNTLPNALSRLAAGSGSPWI 
SVLVTTGLVCLLIQLLPDIAAAGAASSLIFLVTF 
AIAHWLAILVRQRSVLTPPPFRVP 
GYPAVPWGGLACLALAI FQGI AVPEAGI I AVMW 
lAIGGVLFLSLFARRARLTDVSNI 
ASNPELSRLRGNSPLVLVPIANPNNARAMIALAD 
TLVPAALGRVLVQTVWAPHDWDP 
LVNHRPSAQLHSVMNEILHASASLGVRCETLTTV 
SAEPMMEIARVAKLHQCQSVLLGL 
SEITPEARDTPLEGLLGQLSSDVWLRAPKDWQL 
DQSQQILVPVGGRGGHDYLLTRLL 
SSLSREQQRQVKFLRVI PTDTLRADQKRIRKELD 
RTTRVNAGRVCEREWRSNDPVKT 
I ADRAGTAGLIILGAQRLGPRQKLFGDFTRKVAL 
ESDCPVIIISRRG" 
1536 /locus -tag="RB8257 " 

,1536 /locus-tag="RB8257" 

/codon- s tart =1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id= "CAD78545 .1" 

/db-xref ="GI : 32445814" 

/ 1 r ans la t i on= " MPRTGPS WEMS S YLS PTEAL 

RRPERDQSQQAIGLSSRCEAMRLP 



153716. .154126 /locus- tag= "RB8258 " 

153716. .154126 /locus- tag="RB8258 " 

/ codon- start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78546 . 1" 

/db-xref= "01:32445815 " 

/translation= "MSQLFRRAAFKGQASSATKS 

TSNRSRHATRRQRRPTQTFGDKCS 

TLFPTTAPSGGRPMQRTNEPHCQSPSFHPTRSEG 

STQQFPRAEGRWQLTDIRLRRAHC 

LTPAREPNRLLRLTVITTTLFSALRRQCSSHPAT 
II 

complement (154068. .1543 / locus -tag="RB82 60" 

94) 

complement (154068 . .1543 /locus-tag="RB8260" 
94) 

/note="PMID: 20150255 best DB 
hits: BLAST: pir:H72072; conserved 
hypothetical protein CP0257 
[imported] -; E=2e-10 pir:F81677; 
conserved hypothetical protein 
TC0667 [imported] E=4e-10 
pir:B71520; hypothetical protein 
CT3 88 - Chlamydia trachomatis; 
E=6e-10 COG: CPn0497; COG1872 
Uncharacterized ACR; E=2e-ll PFAM: 
PF02594; Uncharacterized ACR, YggU 
family COG; E=4.7e-22" 
/ codon- s tart =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78547 . 1" 

/db-xref="GI: 32445816" 

/ 1 rans lat ion= " MMSEPGAGLDRQCDGLTWRF 

RVRVTPKAKKASVGGLHDGALKVS 

VHTVPEDGKANKAVIASLAKWLRVSKGRVAIVAG 

ETSRLKTIWEFKSQDEMNTADAK LKKELS" 

complement (154391. .1581 /gene="putA" 

04) 

/ locus- tag="RB8262" 
/note= " synonym : poaA" 

complement (154391. .1581 /gene="putA" 

04) 

/ locus - 1 ag= " RB8 2 6 2 " 
/EC-number= "1.5.99.8" 
/EC-number= " 1 . 5 . 1 . 12 " 
/function=" degradation of 
amino- acids of the 
lut amat e - group " 
note="PiyiID: 96062224 best DB 
hits: BLAST: pir:H64526; proline 
dehydrogenase (EC 1.5.99.8); E=0.0 
pir:B71980; proline dehydrogenase 
(EC 1.5.99.8); E=0 . 0 pir:C81297; 
proline dehydrogenase (EC 
1.5.99.8); E=le-180 COG: 
jhp0048-2; COG1012 NAD-dependent 
aldehyde dehydrogenases; E=le-123 
Cjl503c-1; COG0506 Proline 
dehydrogenase; E=2e-56 slll561-2; 
COG1012 NAD-dependent aldehyde 
dehydrogenases; E=le-51 PFAM: 
PF01619; Proline dehydrogenase; 
E=2e-36 PF00171; Aldehyde 
dehydrogenase family; E=0.0014" 
/codon- start=l 
/transl-table=ll 

/product = "proline dehydrogenase / 
1 -pyrroline - 5 - carboxy late 
dehydrogenase " 
/protein-id="CAD78548 .1" 
/db-xref ="GI : 32445817 " 
/ 1 rans lat ion=: " MS S I FVS AESLVKDLVLNQP 



ADLLRESRALQTPQERRQQSELDRMIGHDADKAT 

LVEMTDQAFRTHTPARVADQMTHL 

LDLQGI PRFFS PVEQAMLKGFQS FGE YLPGVAVP 

LVKEKMRRETANVILPAEPDLLTP 

HLASRQRHGVAMNVNLLGEALLGDDETRKRMQSF 

YELLRMPEVRCISVKLTTLYSQVS 

PLAREHTIRMVADRLERLYRNANHMDDGSKAKDK 

FVYLDMEEYSDLYLTADVLRETLE 

RGDLDSVQAGIALQAYVPDSCLVMRDLIEWSTKR 

VAGGGAPLTIRLVKGANLEMERVH 

ASVGGWPLAPYTNKHDTDANFKRMLRELISAAAD 

GNIRVGVASHNLFDVALAMIWTQR 

SGASDSVQFEMLEGMANHQRRALTERDAAMLLYA 

PACKKSDFLHAITYLIRRLDENTG 

AENFLRHAYRLQPDSPDFELLAGRFKDALMDCET 

VSVQPRRTQNRYQTPVQPPAAEHW 

TQFVNEPDTDWSLPHNSKWAQETLDRWKPLCDEQ 

ATRVTPVLVGLPDVDSLDGDDRVS 

AEWFTTDDVSRPGWTCRYKNATS PAVESAVRAS 

SESTWAADLSVDERYSVLRSVAQL 

ARERRGDWIGSMVAGAGKTVAEADPEVSEAIDFL 

EFYPLTMKAWDECAGIQCEPRGW 

AVITPWNFPLAIACGGIAAAIAAGNPWVKPSMQ 

TLLPAWLMCQAFWDAGVPRDALQF 

VICDDEVAEEQLVKNPIVDTVILTGGTSTAKRML 

DVRPDLHLLAETGGKNVTIVTAMA 

DRDLAIKHVLHSAFGHSGQKCSATSLLLLENEVF 

DDPEFREIIADAVRSLPVGSAWDL 

HTRVTPLVDPPTDKLRRGLQELEGGESWLVMSEH 

VDGNPTLYRPGVKWDVQPGSFTHT 

TELFGPVMGVMRFGRLEEAIEIVRSTGYGLTSGL 

ESLDDRE IKLWRES I PAGNLYINR 

PTTGAIVLRQPFGGVGLSAYGPGIKAGGPNYVLA 

LSKVTHASDWTPPTDEVEASHDQL 

DRMRDWIERGRYSDEMLGDGQKFRIERAITSQEI 

ANATDFALEHDTYRLTGQDNIRRY 

RS VDGMT I RI EKDDD I VDAI VAI S AAVCVGTQ FT 

LSLSPELDRSKSAMLETTADWLPG 

L IDP I EESVEQLI ERI EGGGVSRMRTIiTHAKES I 

RRACGRRFITVIDEPWDHGRIEC 

LRYLNEQS I SHDYHRYGNLGRRSAKLSQPTDAE " 

/locus ~tag="RB8265" 

/ locus - tag= " RB8 2 6 5 " 

/function=" degradation of 

exogenous ester compounds" 

/note="best DB hits: BLAST: 

swissprot:06746 6; YE94 -AQUAE 

HYPOTHETICAL PROTEIN AQ-1494 

. E-3e-12 pir:B75463; 

conserved hypothetical protein - 
Deinococcus radiodurans; E=4e-10 
pir: 139564; hypothetical protein 6 
gbd-region [imported] -; E=le-08 
COG: aq-14 94; COG0824 Predicted 
thioesterase; E=3e-13 " 
/codon- start =1 
/transl-table=ll 

/product^" conserved hypothetical 

protein-putative a thioesterase" 

/protein-id="CAD78549 .1" 

/db-xref="GI: 32445818" 

/ 1 rans lat ion= " MAFRRDPKTPAMTTDLPQHV 

FRLRVRYDECDPMGLVHHSNYLRY 

FEIGRTEFLRSSGGRYREVEEAGLYWWHIDCR 

YRASARYDDEIDIVTRIAKITAAK 

I IHEYEIRRGDEVLVQATVTLAVIDKTGRLQRVP 

EALLT" 

/locus-tag="RB8266" . 
/ locus - 1 ag= " RB8 2 6 6 " 
/note="best DB hits: BLAST: 
gb:AAK03329 . 1; {AE006164) unknown 
IPasteurella multocida] ; E=le-04 
swissprot :Q58707; YDll-METJA 

HYPOTHETICAL PROTEIN MJ1311 ; 

E=2e-04 pir:S47803; hypothetical 



gene 
CDS 



159685 
159685 



.160662 
.160662 



gene 
CDS 



complement (160746 
67) 

complement (160746 
67) 



intergenic region); E=0.002 COG: 
MJ1311; COG1082 Predicted 
endonucleases; E=2e-05" 
/codon-start=l 
/transl~table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78550. 1" 

/db-xref ="GI : 3244 5819" 

/ 1 rans lat ion= " MRPDQPTRSLSNRRQFLRRI 

GVTS AVGYAAYLAHQVS PS PLS VN 

GLPGQGRALWAEELSESKTETSTGYLKQSLKAGM 

IRAGKSDAANPWIERFRIAKDAGF 

EGVEPNTS PGMDVEAMVAAS AETGLT IDGTVGGY 

HWGTTHTSSDAATRKKAQQLLEES 

LQQTAJ3LGANTFLIVPGHGKDGTAEEVRQRAFEA 

LDRAVPLAEKLGVKILIENVWNHF 

LYDHGGDEKQSAQPLADFVDSFNTSSIGVQFDLG 

NHWKYGDVAEWVKTLGHRIGKLD I 

KGFSREQGRFTDVTEGD IDWAS VRKALAE I KFEG 

WVAAEVGGGDAERLKKIHGQIETA 

LHCSKSLSQIKAETA" 

/ locus - 1 ag= " RB8 2 6 8 " 

/ locus - 1 ag= " RB8 2 6 8 " 

/note="best DB hits: BLAST: 

swissprot :P73460; YB44-SYNY3 

HYPOTHETICAL 34.4 KDA PROTEIN 

SLL1144; E=2e-40 swissprot : Q9WZX6 ; 

Y872-THEMA HYPOTHETICAL PROTEIN 

TM0872 ; E=7e-40 

qb:AAK02218.1; (AE006048) unknown 
[Pasteurella multocida] ; E=4e-38 
COG: S111144; COG0275 Predicted 
S - adenosy Ime thionine - dependent ; 
E=2e-41 PFAM: PF01795; MraW 
methylase family; E=2e-100" 
/codon-start=l 
/transl- tableau 

/product=" conserved hypothetical 
protein-putative 
methyltransf erase" 
/protein-id= "CAD78551. 1" 
/db-xref="GI: 32445820" 
/ 1 rans lat ion= " MTASQPLDQADQDSESSSAG 
S S AAETEHVS VMPNE I VQWVRE IN 
PTTI IDGTYGGGGHTRLLAEVLADAGSDSDTPRV 
lAIDRDPAWRRDEQKSWIKDDPR 
lELFLGSYESSPKALDALDLTHADALVLDLGLSS 
DQLADRNRGFTFTIDDAELDLRFD 
PENGVPAHRWLQQHSEKEIADAIYRFGEERFSRR 
lAKQIFLRARERNPVTKVGELVEI 
CRRCVPRSRNHDIHPATRTFQALRIAVNDELGGL 
TRTLQS APDW I APGGRVAV I S FHS 
LEDRIVKNAFREDHRWEILTKKPLRPTDEEVQAN 
PRSRSAKLRVAKRVE " 
.1618 /locus-tag="RB8271" 

.1618 /locus-tag="RB8271" 

/codon- start =1 

/trans l-table= 11 

/product = "hypothetical protein" 

/protein-id= "CAD78552 .1" 

/db-xref ="GI : 32445821" 

/ 1 rans lat ion= " MGRERVSRTVSKSSSHKITC 

RETLCLATVRFGSISAGIRFVFER 

KLIFCMLGSAIPIERRWPSPWVSRGAWIGVCACW 

I LLTGCKTTAP VHVWQP PQLAVE P 

AGVAFMGISGPDEWAEPIEDALLTDQPSRWRLIA 

AEQLESPANIRLVSGFEEEPNDMA 

VSAAARREGLRYLLHGEILQATGHEDRDDKVSLS 

WRLTGLQADAESAGMPVSVDEALI 

SQRYPHLLNI PDVAERTRRAWLETKRLLAASVD 

RQQVTLASPRLSPGSRAIRRGNEL 

ARSGNWPAAEHRWQQVLESHPRNAAALINAS lAA 



gene 
CDS 



161882. .163753 
161882. .163753 



gene 
CDS 



complement (163760 . 
97) 

complement (163760 , 
97) 



gene 
CDS 



complement (163907 , 
57) 

complement (163907 , 
57) 



ANKSLAEETLFWIEMRQRDYHNAFDLPPPPEGWL 
ISRGE" 

/ locus - tag= " RB8 2 74 " 
/ locus - tag= " RB8 2 74 " 
/function=" two -component signal 
transduction" 

/note="PiyiID: 8346225 PMID: 9097040 
best DB hits: BLAST: 
swissprot :Q06067; ATOS-ECOLI 

SENSOR PROTEIN ATOS pir: ; 

E=2e-17 gb:AAD47419.1; AF100457-2 
(AF100457) histidine kinase 
[Myxococcus; E=2e-16 
ddbj :BAB04255.1; (AP001508) 
two- component sensor histidine 
kinase; E=3e-14 COG: atoS-3; 
COG0642 Sensory transduction 
histidine kinases; E=2e-18 Rvl827; 
COG1716 FHA- domain- containing 
proteins; E=le-08 PA2882; COG0642 
Sensory transduction histidine 
kinases; E=2e-08 PFAM: PF00498; 
FHA domain; E=3.2e-19 PF01590; GAF 
domain; E=4.8e-18 PF00512; His 
Kinase A (phosphoacceptor) doma; 
E=6.6e-07" 
/codon-start=l 
/transl-table=ll 
/product=" sensor protein atoS" 
/protein-id="CAD78553 . 1" 
/db-xref = "GI : 32445822 " 
/ 1 rans lat ion= " MASLFVIRGRDQGKHFQLAP 
TVTRLGRESSNNVQLLDNEASRSH 
AE IRGDGTGRRYELVDLGSSNGTLVNNRKITRHV 
LTSGDRVE I GSTLL I FTGTGNVS A 
LDAAHGVDIVRQVRGGDASNIVSSLSREGWSPY 
TPGGSSSNGPSTDKPATSPTTLPP 
PIPPPVENPPTPSEDERVDSSASGSDPPGQFHGP 
PVIGRLDADRSLEVMYLTAIAVGR 
TDDLDEVLDRVLKLVFDWVEADRGCIMLRDPESK 
RLTPAARCDRESATGGTKQNPSSK 
PQDR I Q I S HT I LD YVLQHKQGVRTNDALDDERFD 
S AAS I VQGGVREALCVPLQGRYE I 
VGML YVDT YTT PGELVRKGGATRFHDEHLRL I TA 
VGHQAALAI EDTFYYSALLQGERL 
AAMGQTIATLSHHIKNILQGVRGGSYLIESGLQK 
DDTDAI RRGWS I VDRNQERI SNLV 
LDMLTFSKEREPDLQEGNLNEWSDWELMQSRA 
GQSEIELVSDLDATLPIATFDSEA 
IHRAVLNLVTNAIDAATKTVLVKTFYDAAQGWIV 
DVEDDGEGVPEEDREQIFSLFESK 
KGARGTGLGLPVSAKVLREHGGTLTVDDAASGGA 
RFRMILPATGTDINELSRSETQA" 
1638 /locus-tag="RB8275" 

1638 /locus-tag="RB8275" 

/codon-start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein-id="CAD78554 . 1" 
/db-xref ="GI: 3244 5823 " 
/ 1 rans 1 a t i on= " MHH PRVAFGSKERKS FSQFP 
DEKARPGVDNRIGGIISLRMSFTP F" 
.1642 /locus-tag="RB8278" 

.1642 /locus-tag="RB8278" 

/codon-start=l 

/transl-table=ll 

/product = "hypothetical 

protein- transmembrane prediction" 

/protein-id="CAD78555 . 1" 

/db-xref ="GI: 32445824" 

/translation= "MDRNDAGATVFQQHELHPPL 



164128. .164343 
164128. .164343 



164318 
164318 



164707 
164707 



complement (164696 . 
38) 

complement ( 164696 , 
38) 



complement ( 165169 , 
14) 

complement (165169 
14) 



complement (166175 . 
67) 

complement (166175 , 
67) 



AFHRSPSHDYRSLRVIIWFSSFGSGVLSPIGPQ 
QCNFPVFNCQI EHWCWNRNGGLMC 
RQPAMAPNEVLNEA " 
/l0CUS-tag="RB8279" 
/locus- tag="RB8279" 
/codon- start =1 
/transl-table=ll 
/product= "hypothetical protein" 
/protein - id= " CAD7 8 556.1" 
/db-xref="GI: 32445825" 
/ trans lat ion= " MKKLLEQGSRDQFVAIDVET 
GDYEVADEAHAAGTRLRQRRSDPQ 
VFIAKVGHSAAFHALRLTSGDGIERND " 
/ locus - 1 ag= " RB8 2 8 0 " 
/locus-tag="RB8280" 
/note="PMID: 11759840" 
/codon- start=l 
/transl- table=ll 
/product="hypothetical protein" 
/protein-id="CAD78557 . 1" 
/db-xref="GI: 32445826" 
/ trans lat ion= " MTGSSGMIEGRIDNNVEAFV 
DLEIQLDDTTHTIAFLVDTGFNGF 
LS I PFRLVHQLGLPLSDVQQGVTADGRSSFFDTV 
ELTVIWHGNPLTVQAQVLDEPLLG 
TRLLRGSKIEAAWIQGGMFQIELILDG " 
.1649 /locus-tag="RB8281" 

.1649 /locus-tag="RB8281" 

/ codon- start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78558 .1" 

/db-xref ="GI : 32445827" 

/ trans lat ion= "MLSHFPWMDHYPLTLALIW 

RGRAIDS IQLAGPNGLPSRI KPCA 

GI EKGLEGECRIWPLLFFDFHFVKLCNNDHKRS 

AI" 

.1660 /locus-tag="RB8282" 

.1660 /locus-tag="RB8282" 

/codon- start=l 
/ transl- table=ll 
/product= "hypothetical 
protein- transmembrane prediction" 
/protein- id="CAD78559.1" 
/db-xref="GI: 32445828" 
/ 1 rans lat ion= " MAGRRI VCPQCKTTLQI PAT 
MGAGKVKCPKCELMLAVRAPVTVH 
VPEENLFDNLPSLGSSAAAPPSSVFRPSGPVTVY 
QPPKPAKKRGGGS KAAVKII STIA 
GLGLLCVLLCAGGIFAVGYLGSRHSGWTSVTYKG 
YTVSMPAGEDRRDKSQQFPGTTVH 
ELTGRRKETGSQYSLWADLPAVIDPNIPIAELL 
RDMRIRLSNPRPVTRSGVEGMAGT 
MQSGVGAIEGSDCEIYVHNRNLWAVYSPYSEIK 
DLVGGQRKPRSNESELDKPSEFFD SLEFK" 
.1671 /locus-tag="RB8284" 

.1671 /locus-tag="RB8284" 

/note="PMID: 98196666 best DB 
hits: BLAST: pir:F70440; conserved 
hypothetical protein aq-1630 - 
Aquifex; E=3e-18 pir:D71323; 
conserved hypothetical protein 
TP0436 - syphilis; E=3e-18 
pir:F69999; conserved hypothetical 
protein ytql - Bacillus subtilis; 
E=le-16 COG: aq-1630; COG0618 
Exopolyphosphatase- related 
proteins; E=3e-19 PFAM: PF01368; 
DHH family; E=2.4e-12 PF02272; 



/codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein-putative a 
phopsphoesterase or an 
exonuclease" 

/protein-id="CAD78560 . 1" 

/db-xref= "01:32445829" 

/ 1 rans lat ion= " MGVNWKAFVDQIQHYQSFVL 

VSHIRPDCDALGSELGMAEVLRAV 

GKDVRIINAHRTPPALSFLDPAGNIEVLGDSVEA 

EDVKADCIMILDTSAWAQLGDMGD 

VIRASRADKMVLDHHVGEDELGARMYKDYQSEAT 

GHLWQAADALNVPLTRTMATPLF 

AAIATDTGWFRFPSVTSDTYRTIARLIDAGWPS 

E VYGDL YERDTLGRLKLRGL I LS R 

TTTEMDGALVYTSVNKEDFELCGAQPNDTEDAIN 

LTLAI RGTQAGVI FVGQVRGGFKL 

S FRSRCSLDCNEVARQFGGGGHKAAAGAFLEGTL 

DEVQDRVLP I VREALAVALG " 

complement (167183. .1681 /gene="ribF" 

54) 

/locus- taq="RB8286" 
complement (167183 . . 1681 /gene="ribF" 
54) 

/locus-tag="RB8286" 

/EC-number= "2.7.1.26" 

/function=" central metabolism" 

/note="PMID: 7772835 PMID: 3023344 

best DB hits: BLAST: pir:T35984; 

probable riboflavin kinase (FAD 

synthetase) -; E=3e-46 pir:C83077; 

riboflavin kinaseFAD synthase 

PA4561 [imported] E=4e-42 

pir:C82293; riboflavin kinaseFMN 

adenylyltransf erase VC0681; 

E=2e"-41 COG: PA4561; COG0196 FAD 

synthase; E=4e-43 PFAM: PF01687; 

Riboflavin kinase / FAD 

synthetase; E=3 .3e-32" 

/codon- start =1 

/transl- tableau 

/product= "probable riboflavin 

kinase (FAD synthetase) " 

/protein- id= " CAD7 8 56 1 . 1 " 

/db-xref="GI: 3244583 0" 

/ trans lat ion= " MTSLIHLSDLAGDSDLHLSL 

QGGAVS IGNFDGVHLGHRALLERV 

RWHAQRVGGKAIAIVMDPHPASVLRPHGAPPSLT 

SLSRRAELMSGSGIDHLLVCEATR 

DFLNQTAEEFFQRLI VEQLSAKAI lEGPNFFFGR 

DRAGNTTRLKELAAERQIDVE I W 

PS VRDERMVS S SRI REAI ASGD I VLANQMLGSRY 

RLTGWTTGEQRGRTLGFPTANLG 

GVTTLLPEHGVYAALACGDECDGPTNHPAAVHIG 

PNPTFNERQETKIEAHLLDYDGDL 

YGKSLSLTLVSQVRGVIKFADASALKQQLQIDLQ 

TVRETVSSLQSRP" 

complement (168273. .1691 /locus-tag="RB8287" 

87) 

complement (168273 . .1691 / locus -tag="RB8287" 
87) 

/function= "unspecified kinase or 
ATP dependent regulatory protein" 
/note="PMID: 20406833 best DB 
hits: BLAST: pir:H82489; conserved 
hypothetical protein VCA0191 
[imported] -; E=2e-23 pir:G81111; 
conserved hypothetical protein 
NMB1197', NiyiB1159; E=4e-21 
pir:C72317; conserved hypothetical 
protein - Thermotoga maritima; 
E=9e-21 COG: VCA0191-2; COG0063 
Predicted sugar kinase; E=8e-24 
PFAM: PF01256; Uncharacterized 



7codon-start=i 
/transl-table=ll 

/product=" conserved hypothetical 

protein-putative sugar kinase" 

/protein- id="CAD78562 .1" 

/db-xref="GI: 32445831" 

/translation="iy[PSIETQSTSPPPLRIPHRE 

ASAHKGNFGRVLLIGGSRGMAGSI 

ALSS lAALHTGSGLVAAAVPDC ILDCVAGFHPAL 

MTIGLGDDGEKFADPAWQSLKDRL 

SAQAAIGCGPGMTTGSGGATIVEGLLAKKDLPLV 

LDADALNI I AQNDWLSDGRFERSK 

ADAACVLTPHPGELQRLTGASAKDVDAQVEAAAE 

LACRLGLTIWKGGPSHVAYQNGD 

GGKQVWQNTTGNPGMATAGCGDVLTGIVTSLLGQ 

GLSGPDAAKLGWIHGRCGDEAAA 

RWSHAGMTSLHALDALALVADEMTQPAD " 

169350. .170591 . /locus- tag= "RB8290 " 

169350. .170591 /locus-tag="RB8290 " 

/function=" other cation 
transporters (Na(+) ,K(+) ,Ca{2+) 
,NH4 (+) , etc . ) " 

/note="PMID: ' 8170937 best DB hits: 
BLAST: pir:S76018; hypothetical 
protein - Synechocystis sp. 
(strain PCC; E=2e-61 
swissprot :Q58752; YD57-METJA 
PUTATIVE POTASSIUM CHANNEL 
PROTEIN; E=le-36 swissprot : Q57604 ; 
Y13B-METJA PUTATIVE POTASSIUM 
CHZ^EL PROTEIN; E=2e-34 COG: 
S110536; COG1226 Kef-type K+ 
transport systems, predicted; 
E=le-62 MJ1357; COG1226 Kef -type 
K+ transport systems, predicted 
NAD-binding; E=le-37 MJ0138.1; 
COG1226 Kef -type K+ transport 
systems, predicted; E=2e-35 PFAM: 
PF00520; Ion transport protein; 
E=0.45 PF02254; KTN NAD-binding 
domain; E=5.2e-39 PF02080; 
Potassium channel; E=2.2e-09" 
/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein-putative potassium 
channel" 

/protein- id="CAD78 563 .1" 

/db-xref = "GI : 32445832 " 

/ 1 r ans 1 at ion= " MSGKKI SRPFLPSRSTYQSE 

RMKCKWRANVAPLNRILIHESRGF 

PEPNFSTFNSSQFPFVMQPTPLRRILLGLGVFSL 

ICAIAIIAYIQMGWSVSDAIYMW 

ITIFGVGYGEVKPVDTPALRTLTIAIIVLGYGAA 

lYTVGGFIQFLVDGELQSLLRNRK 

MSQGIASLRAHTIVCGFGRMGVRVAEELQELGQP 

FWIDENAARVEEAQQAGMLAMVG 

NATDEDILMAAGIDHARGLATLLPDDAANAFICV 

TARDLAEKVEIVSRAENHSAQKKL 

IRCGANYWMAATIGAMRVTQLLVRPTASAVLES 

HGLSHGISEELSAIGLNLEELRLT 

SSSPLVSKPLAEIQVRGNRGFLIVGVKRGEDPIQ 

MNPSGTLILEVNDIVIWGHQNDI 

AELCLAHKVQRQEILYRGVKG" 

complement (170605. .1714 /gene="purE" 

53) 

/ locus - tag= " RB8 2 9 1 " 
complement (170605. .1714 /gene="purE" 
53) 

/ locus - 1 ag= " RB8 2 9 1 " 
/EC-number="4 .1.1.21" 
/function= "purine- ribonucleotide 
metabolism" 

/note="PMID: 98049343 PMID: 
8809759 best DB hits: BLAST: 



conserved hypothetical protein; 
E=4e-29 pir:S76797; hypothetical 
protein - Synechocystis sp. 

(strain PCC; E=7e-28 
qb:AAD29318.1; AF117208-1 

(AF1172 08) circadian phase 
modifier; E=2e-24 COG: -AF1275; 
COG1691 NCAIR mutase 

(PurE) -related proteins; E=4e-30" 
/codon- start=l 
/transl-table=ll 

/product = "conserved hypothetical 
protein-putative 

phosphoribosylcarboxyaminoimidazol 

e mutase (purE) " 

/protein- id= " CAD78564 . 1 " 

/db-xref ="GI : 32445833 " 

/translation="MNLPSNQPPSPALLQILQDL 

ASGQTDVEAAAES IQASS IGSAAD 

MQTVPGATVDLGRLARCGFGEVIYGEGKSTDLMT 

QIVQTQVAVGQGCLITRIDATAAA 

QLRRVFENTRYNPSARTLRIGCGDENLDPIGVEG 

HSTHVAWTAGSTDAHVAEEAAET 

LAWMGVACERIDD IGVAGPQRLLAAVPRLRAAAA 

IVWAGMEGALPAALAGHVATPVI 

AVPASTGYGANFAGLTPLMGMLTSCAANVAWNI 

DAGFKGGYFAGLIASGIANAKSEA ASAEEA" 

gene 171457. .172791 /gene="argG" 

/locus - 1 ag= " RB 8 2 9 3 " 

CDS 171457. .172791 /qene="argG" 

/locus - tag= " RB8 2 93 " 
/EC-number="6.3 .4 .5" 
/function="urea cycle, 
biosynthesis of polyamines and 
creatine; assimilation of ammonia, 
biosynthesis of the glutamate 
family" 

/note="PMID: 2 0158877 PMID: 
90046714 PMID: 3174461 best DB 
hits: BLAST: pir:H72210; 
argininosuccinate synthase - 
Thermotoga maritima (strain; 
E=2e-97 pir:D75490; 
arginosuccinate synthase - 
Demococcus radiodurans; E=3e-96 
pir : C83204 ; argininosuccinate 
synthase PA3525 [imported] -; 
E=2e-87 COG: TM1780; COG0137 
Argininosuccinate synthase; 
E=2e-98 PFAM: PF00764; 
Arginosuccinate synthase; 
E=2e-190" 
/codon- start =1 
/transl-table=ll 
/product = " argininosuccinate 
synthase" 

/protein-id= "CAD78565.1" 

/db-xref ="GI : 32445834 " 

/translat ion= "MLLDFPARILLTVLACFAPD 

NLNARPSNIADSPAFGCRSEMKSC 

VLAYSGGLDTSVILGWLQDQGYEVHCVYVDLGQP 

CEDRDAIMEKARTCGAKSSRLVDV 

REELCRDFAFPVLAWQAKYEQI YLLGTS lARPLI 

SKVCLEVAREVGATAYAHGATGKG 

NDQCRFQLAAEALDPNIEMIAPWRIKSFRDAFPG 

RTELIEYCDVKRIPVKASTAKPYS 

SDENVLHISYEAGKLEELDVNGVELVEFGMGVSP 

QDAPDKPEEVTIGFESGVPKTLNG 

KAVNALEMVEQLNDIAGRNGVGRIDMVENRFVGM 

KSRGVYES PGMTVLYDALMYVEQL 

TMDRDLMHLRDRIVLAPEVAEMVYYGFWYTPKMDAL 

MS F I ETAQRP VTGEVTLQLYKGNI 

MVSSRTSPNSLYDEEIATMEGGGSYNQDDAEGFL 

RIQGLPSRVQGRVSPRKF" 

gene complement (172907 . .1738 /locus-tag="RB8295" 



CDS 



complement (172907 
30) 



gene 
CDS 



173814. .174359 
173814. .174359 



gene 
CDS 



complement (174384 
12) 

complement (174384 
12) 



gene 



175602. .176399 



1738 /locus-tag="RB8295" 

/ codon- start =1 

/transl-table=ll 

/product= "hypothetical 

protein- transmembrane prediction" 

/protein-id="CAD78566 .1" 

/db-xref ="GI : 32445835" 

/ 1 rans lat ion= " MTFLFTCPHCQSQTEVEDEY 

SGRTGDCWCGREITMPEFAGSRR 

MGNRPGKRNKSAIWFVAAGLALLLIGAGLIAAVQ 

VGSRTAKKIRTGRQRLSS IKNLEK 

lASALNAYAADHGVYPAPYTVDAAGRKLHSWRVT 

ILPYLDEDGLYNQIDKDVPWNEGE 

NQMLLYSQTPAVYRHPESSSWGTGTVYHLVTGAG 

TL FP S TGPLGPRQVTDGATKT I LL 

AEGQMNTMTESWMEPYDLDIGSVGGLINPPSGNG 

LGGATDGGVCVATVEGSGYFLPDT 

TPPLTVQALITPTGGEPLSDDVLDEWASTQP " 

/gene="dtd" 

/ locus - 1 ag= " RB 8 3 0 0 " 

/gene="dtd" 

/locus " tag= " RB 8 3 0 0 " 

/EC-number= "3.1.-.-" 

/ function= "other detoxification" 

/note="PMID: 20459059 PMID : 

10383414 PMID: 10918062 PMID: 

11568181 best DB hits: BLAST: 

qb:AAF82117.1; AF276071-1 

(AF276071) conserved hypothetical; 
E=3e-3 7 ddbj :BAB04 962.1 ; 

(AP 001511) BH1243'^ unknown 
conserved protein; E=3e-3 6 
pir:E7233 8; conserved hypothetical 
protein - Thermotoga maritima; 
E=9e-36 COG: BH1243; COG1490 
D-Tyr-tRNAtyr deacylase; E=3e-37 
PFAM: PF02580; Uncharacterized 
ACR, COG1490; E=3.8e-5 
/ codon -St art = 1 
/ transl-table=ll 
/product = " D - ty rosy 1 - tRNA ( Tyr ) 
deacylase" 

/protein- id= " CAD7 8 56 7 . 1 " 
/db-xref ="GI : 32445836" 
/translation="MNRNVIRSFHLISDAPRVRP 
ATNLAHLATTWIANHPMKIVLQRS 
QHASVSVDGKIVGQIERGLVALIGIGHEDTEATA 
S ALADKTAGLRI FSDDNGKMERNV 
IDAGGDVLAI SQFTLLADCRKGRRPAFTDAAPPD 
RANELYEHYVSELRKTGLSVPCGI 
FAADMAVSLTNDGPVTI ILEL " 
.1754 /locus-tag="RB8301" 

,1754 /locus-tag="RB8301" 

/ codon - start = 1 

/transl-table=ll 

/ product= " hypothetical 

protein-t ransmembrane predi c t ion " 

/protein-id= "CAD78568 .1" 

/db-xref ="GI : 32445837" 

/translation= "MRDRPAPPSWLGGLAVGVSG 

NLVGPPFPARRLLSVRVSLAGFKT 

H I STST WGCAYGYWGVFDQGMSMESALLAGGLC 

SVSGMLPDLDSDSGVPLRETSLFL 

AAWPMLMIDRWRDLGLSHEAMALAAMIVYIAIR 

FVAVEGFRKFTVHRGMWHS I PAAL 

VAGLIAYMAMPCASEAVRVYKSCAVLVGFLTHLI 

LDEIWSLDFSRGSMRVKKSFGTAL 

KFFGS S PLANI FVWGQLGLFI YLAWGDHE I LDRL 

RQRVRMDRDRYAVPSQDE INPDYT 

NPSLFAPWESREPPQWQPRVTSPQESPSNFGWPS 

PAQPGATQPGWQPGMGQRVARPN WPPALPQR" 

/ locus - 1 ag= " RB 8 3 0 6 " 



/function= "biosynthesis of peptide 
antibiotics" 

/note="PMID: 2542219 best DB hits: 

PFAM: PF02674; Colicin V 

production protein; E=8.4e-14" 

/codon-start=l 

/transl-table=ll 

/product =" hypothet i cal 

protein-putative a gene in the 

gurF operon involved in colicin V 
iosynthesis" 
/protein-id="CAD78569 .1" 
/db-xref="GI: 32445838" 
/ 1 rans lat ion= " MQTYD ILMTVILVGATLLGA 
IRGFAWQLAS IAS I WS YCVAYHY 
REPFSQNIHAAPPWNQFLAMFILFVGTSFVIWVA 
LRMVSGMIDRMRLKEFDRQIGALF 
GLAKGALLCTI ITLFAVTLFGERTQRAIVASESG 
RLIARVLAESNSIMPPELDSWRP 
YLDQFSDEELGEPSASEGSWLSQTPIAPNIDPNW 
SHSNAPTANNFAPQNGFQQAQNDP 
RSQSPFGGFGGSSTPTPNADFNSAAGSQQPAPTW 
RQSATPLWQTPRR" 
176354 . . 176554 /locus-tag="RB83 09" 

176354. .176554 /locus-tag="RB8309" 

/codon-start=l 
/transl- table=ll 
/product= "hypothetical protein" 
/protein-id= "CAD78570 .1" 
/db-xref="GI: 32445839" 
/ translat ion= "MASIRDAAVANATTMIARHL 
GIAMTANDRGGAYSRSPGHSDLQR 
HQAKAQTRKTPETRVLRTSNTT " 
complement (176695 . .1776 /gene="accA" 
54) 

/locus -tag= "RB83 10 " 
complement (176695 1776 /gene="accA" 
54) 

/ locus - tag= " RB8 3 10 " 
/EC-number= "6.4.1.2" 
/function=" fatty acid 
biosynthesis " 

/note="best DB hits: BLAST: 
gb:AAK023 76 . 1; (AE006064) AccA 
tPasteurella multocida] ; E=4e-72 
swissprot :P43872; ACCA-HAEIN 
ACETYL -COENZYME A CARBOXYLASE; 
E=6e-72 ddbj :BAA77860.1; (D83536) 
Acetyl -CoA carboxylase (EC 
6,4.1.2),; E=le-68 COG: HI0406; 
COG0825 Acetyl -CoA carboxylase 
alpha subunit; E=5e-73" 
/codon- start =1 
/transl -table=ll 

/product ="AcetylCoA"- Carboxylase" 

/protein- id= " CAD7 8 571.1" 

/db-xref="GI: 32445840" 

/translat ion="MAGPGLEFENEIADLEEQIA 

SLERNTDRSEEIDSAIRSLRLARV 

AKLKETYSSLDPWQTVQVARHKNRPYTRDYLNLA 

FDEFVELHGDKHFGDDRAMLSGFA 

KLDRFKVMVIGHQKGRTYKERAACHFGCAHPEGY 

RKAMVKMKMAEKYRLPVICFIDTP 

GAYPGIGAEERGQAQVIAESMFMMSDLKTPVICV 

VIGEGGSGGALGIGVGDRVAVLQH 

AYYSVISPEGCAGILWKSHEHAPKAAAALRFTSD 

HLLRLGWDDVLEEPLGGAHRDHH 

QMATRMKTYLSRQLSELEEMPVDLMLEQRYEKFR 

KLGVFLEES " 

complement (177658. .1787 /locus-tag="RB8311" 
52) 

complement (177658. .1787 /locus-tag="RB8311" 

/EC-number="2 .7.1.-" 

/function=" unspecified kinase or 



gene 
CDS 



complement (178844 . 
88) 

complement (178844 . 
88) 



/note="PMID: 7774814 best" DB hits: 
BLAST: pir:H69878; probable 
protein kinase (EC 2.7.1.-) yloP - 
Bacillus; E=2e-27 pir:T36717; 
probable serinethreonine protein 
kinase - Streptomyces ; E=le-26 
pir : T3 6 5 0 1 ; probable 
serinethreonine protein kinase - 
Streptomyces; E=2e-26 COG: 
BS-yloP-1; COG0515 
Serine/threonine protein kinases; 
E=le-28 PFAM: PF00069; Protein 
kinase domain; E=3e-39" 
/ codon- start = 1 
/transl-table=ll 

/product= "probable protein kinase 
yloP" 

/protein-id="CAD78572 .1" 
/db-xref="GI: 32445841" 
/translation= "MAVSRLPPSDLRCRAHVFRL 
WGRLDYHAMPQNRFDRSTRSSRFR 
QSPSRSMSKSRDFLGPYRLARLIRIGSTAEVWEA 
IDEGDQKRYALKILRQSMSNDKAE 
LASLKHEYNVGKALNSPRIIKMIDHRVENGRPFL 
VLELFS EMNMKQALRKGPESLAYM 
LDKIIEQAAEGLYYMHTKNWIHRDIKPDNYLVSH 
TGETKLIDFTITEPKRTGLSKMFY 
KAKNIQGTRSYMAPEQIRGKLCDERSDIYSFGCV 
LFEACTGKPPYTGQTPNDLLSKHL 
SAS I PSPIVYNDNVTKDFADLVKSMMAKSPDARP 
PSVWEFLKVFRNIEVWRKRPRRPE 
VTIFDDDTSMKSPEDMIKRPEPTSDEEEEI " 
1800 /locus-tag="RB8313" 

,1800 /locus-tag="RB8313" 

/f unction= "glycolipid 
biosynthesis; protein modification 

( glycol sylat ion, acylation, 
myr IS tylat ion , palmitylat ion , 
farnesylation and processing) " 
/note="PMID: 10531250 best DB 
hits: BLAST: gb : AAD50490 . 1 ; 
AF172324-8 (AF172324) WbnE 

[Escherichia coli] ; E=7e-21 
pir : C7003 6; capsular 

golysaccharide biosynthesis 
omolog yveN -; E=2e-19 
pir:T35514; probable glycosyl 
transferase - Streptomyces 
coelicolor; E=2e-17 COG: BS-yveN; 
COG043 8 Predicted 
glycosyltransf erases ; E=2e-20 
PFAM: PF00534; Glycosyl 
transferases group 1; E=9.5e-43" 
/codon- start=l 
/transl-table=ll 
/product= "putative glycosyl 
transferase (WbnE) " 
/protein-id="CAD78573 .1" 
/db-xref="GI: 32445842" 
/translation= "MRIAHVITRMIIGGAQENTL 
HNCQDLVHLHGDEVLLVTGPAVGP 
EGDLLSERVRKDSSDIGDSANDSKGRAGRFELDS 
LP I EL IDS LRRAI HPLHDWNATRD 
LRRTLLDFDPDWHTHSAKGGLLGRHVGWGLKRA 
STGKRPVWHTVHGAPFHEYQSKL 
AHDFFVRCERWAASRCHKLISVADAMTDLMVEAG 
VAPREKFVTIHSGMNVDPFVHAVD 
HREAVRQRYGLRDEHVWGKIARLFHLKGHADLV 
PAAKLVADRHPNVRFLLVGDGILR 
GELEQQIESLGLKEHFIFTGLVPPSEVPAMIGAM 
DILVHASYREGLARALPQALIAGR 
PAISYDIDGAREWIDDQTGYLVGAGQVADLADR 
MIRLVGDGELRLRMGEEGRTRFTD 



gene 

CDS 



complement (180097 , 
15) 

complement (180097. 
15) 



raise- feature 

gene 

CDS 



180953 1809 

complement (181037. 
18) 

complement (181037 . 
18) 



gene 
CDS 



complement (181308 
21) 

complement (181308 
21) 



gene 
CDS 



complement (181485 < 
79) 

complement (181485 



,1809 /gene="whiG" 

/ locus - taq= " RB8 3 1 6 " 
.1809 /gene="whiG" 

/locus-tag="RB8316" 

/ f unct ion= " transcriptional 

control" 

/note="PMID: 2507166 PMID: 
94259284 best DB hits: BLAST: 
pir:S29615; whiG protein - 
Streptoverticillium griseocarneum 

; E=5e-44 gb:AAB41960.1; 

(U58281) sporulation sigma factor 
[Streptomyces ; E=9e-44 
prf :2013252A; RNA 
polymerase :SUBUNIT=sigma factor 
[Streptomyces; E=le-43 COG: 
TP0709; C0G1191 DNA-directed RNA 
polymerase specialized sigma; 
E=8e-41 fliA; C0G1191 DNA-directed 
RNA polymerase specialized sigma 
subunit; E=2e-38 BS-sigD; C0G1191 
DNA-directed RNA polymerase 
specialized sigma; E=7e-34 PFAM: 
PF01399; PCI domain; E=0.026 
PF0 014 0; Sigma- 70 factor; 
E=1.3e-59" 
/codon-start=l 
/transl-table=ll 
/product="RNA polymerase sigma 
factor whiG" 

/protein- id= " CAD78 574 . 1 " 
/db-xref ="GI : 32445843 " 
/ 1 rans lat ion= " MIITEDSMPATATADDP ILK 
VWTSFKSTTKDAPDYEPLRNQLVE 
RYMQLVRYNGERIWQRLPDGVELDDLISAGMFGL 
MDAIDAYDTERGVKFETFCVPRIR 
GAMLDELRKMDWVPRLVRSKASKLATATKTLEAR 
FGRAPTVQELSAHMEIDVKEVEKM 
QSEANAVGWSLNKKWYETDSYKDVREIDILEDK 
KGEDPTRRVQKNDLMRLVTKGLNR 
NERLIIILYYYEELTMKEIGATLDLSESRVSQMH 
TAI VNRLQQQLGVRRLEFGA " 
/note="cosmid pircos-clglO/ cosmid. 
pircos-c4e02 joining point" 
.1813 /locus-tag="RB8318" 

.1813 /locus-tag="RB8318" 

/ codon - start - 1 
/transl- tableau 
/product = "hypothetical 
protein-t ransmembrane predict ion " 
/protein- id= " CAD78 575 . 1 " 
/db-xref="GI: 32445844" 
/ translation= "MDGDVTRSLALSLGSFAMGL 
WLRGIWHGEMAGDVASEAIGTLI 
VFMGIGGLAGAIADQLIRDGVEDLYRKRVKWFQE 
GVAETASEETENQTK" 
.1814 /locus-tag="RB8319" 

.1814 / locus -tag="RB83 19" 

/codon- start =1 
/transl-table=ll 
/product = "hypothetical protein" 
/protein-id="CAD78576 .1" 
/db-xref="GI: 32445845" 
/ 1 rans lat ion= " MIDESHLRLFLKCPFNEFGI 
EEFDPFLFSGLPPLFGR " 
.1828 /gene="flhF" 

/ locus - taq= " RB8 3 2 0 " 
.1828 /gene="flhF" 



/ locus - 1 ag= " RB8 3 2 0 " 
/function= "biogenesis of 
flagellae" 

/note="PMID: 1447978 best DB hits: 
BLAST: pir:B72319; flagellar 
biosynthesis protein FlhF - 
Thermotoga; E=9e-32 
ddbj :BAB06156.1; (AP001515) 
f lagella-associated protein 
[Bacillus; E=3e-22 
swissprot :Q0196 0; FLHF-BACSU 
FLAGELLAR BIOSYNTHESIS PROTEIN 
FLHF; E=2e-21 COG: TM0907; COG1419 
Flagellar GTP-binding protein; 
E=8e-33 AF0622; COG0541 Signal 
recognition particle GTPase; 
E=:3e-09 PFAM: PF00005; ABC 
transporter; E=0.018 PF00931; 
NB-ARC domain; E=0.2 PF00004; 
ATPase family associated with va; 
E=0.13" 

/codon- start =1 
/transl-table=ll 

/product=" flagellar biosynthesis 
protein FlhF" 
/protein- id= " CAD7 8 5 7 7 . 1 " 
/db-xref="GI: 32445846" 
/ 1 rans lat ion= " MMHIRTFRAANLQAALADIR 
DQMGPEASVLHTRQVRNGWMGWLG 
RTHVEVTAGLHSGPEDGTSHDASGSRVSANYADE 
PLQPADVRSVNSVAASSGYGTSPT 
GTPNINAGTINPGAAPVSSGQNPMDAAYQNYDAS 
HQHSSSTGIQGFGNAGNGAGQGGF 
YGAGYASGELS PADSLSLRLLQAGVEESTARRWM 
ASASSFAAGIANSESAVQSDQRWM 
EHLQRAVARELNLCGP IRTQPGDRHWALVGPTG 
VGKTTTVAKLAAGFRIEARRRVGL 
LTIDTYRIAAVQQLKAYAEIMDLPMQWEKPEQM 
ETALSALGDVDLVLIDTAGRSPRS 
DARIEQLSEFLRAAHPDETHLVLSATSSGENIRT 
TLEGFAPVRANAVTLTKLDETPCM 
ASVLAALTGRDRWAPPLSYLTNGQQVPDDIAVA 
DASGLVAQLLPHMPTAGLGEMGFE 
TMDGYESLHYGEAA" 
gene complement (182876 184 9 /gene="flhA" 

39) 

/ locus - 1 ag= " RB8 3 2 3 " 
CDS complement (182876. .1849 /gene="flhA" 

39) 

/locus- tag="RB8323" 
/ f unct ion= " flagellar organization ; 
type III protein secretion system" 
/note="PMID: 8097015 best DB hits: 
BLAST: ddbj : BAB06157 . 1 ; (AP001515) 
f lagella-associated protein 
[Bacillus; E=le-119 pir:C72319; 
flagellar biosynthesis protein 
FlhA - Thermotoga; E=le-115 
swissprot :P3 5620; FLHA-BACSU 
FLAGELLAR BIOSYNTHESIS PROTEIN 
FLHA; E=le-114 COG: BH2438; 
COG1298 Flagellar 
biosynthesis/type III secretory 
pathway; E=le-120 PFAM: PF00771; 
FHIPEP family; E=1.6e-261" 
/codon- start=l 
/transl-table=ll 
/product="f lagella-associated 
protein" 

/protein-id= "CAD78578 . 1" 

/db-xref="GI: 3244584 7" 

/ 1 rans la t ion= " MQLLMRYRDLVLPIGI I ACL 

WILVPLPPFLMDLLLATNITVGV 

IVLLTTVYVSTPLEFSIFPSLLLATTLARLVLNV 

ATTRLILTSTESGSGGAAGGVIQG 



ISEVAARFALDGMPGRQMAIDADM 

NAGLIDEKEAQRRREEIANQADFYGAMDGASKFV 

RGDAIAGIVITLVNVAGGLYIGVM 

RYGMTVSEAAELFTKLTIGDGLVSQVPALLISLA 

AGLLVTRSARKASLPEQFLQQLFS 

NPKALAVAGGFLSLLILTSLPALPMATLGASCIG 

LAWMNRQNKQDEQDELDAEEAEQ 

AAASAPAEKRVEDYLNVDPMELAIGLGLLSLADP 

NRGGDLMQRITGVRHSIAGDIGW 

LPKVRIRDDMNLHDYEYEIRIAGNPLTKQRVLPD 

RLLAIDSGHTTGVIDGEPTKDPTF 

GEPAVWIDPIRREQAAIYGYTTVEPGAVLATHLQ 

ELARRHADELLTRDATKHLIDELK 

DAAPAWEEL I PGMLKI SDVQQVLQTLLREDVP I 

RQLAI ILETLGDHAGRTKDPIWLS 

EYVRHRLARTISTRYRDELGQLHWALDPAMEDR 

lAAGIEHTERGLFVRMSPQAVDST 

CDKISQAVKKLITLGHTPIILVSPRIRPGLRQII 

AGSMPRVRVLSYNEITQDTKIQSH GWSD" 

complement (184997. .1851 / locus -tag="RB8 324" 

13) 

complement (184997. .1851 /locus-tag="RB8324" 
13) 

/codon-start=l 

/ transl-table=ll 

/product= "hypothetical protein" 

/protein-id= "CAD78579. 1" 

/ab-xref="GI: 32445848" 

/ 1 rans lat ion= " MKLVRSKRTCALSWHLPNL 

SRILHFRRKKSFDTVGLG" 

complement (18513 9. .1860 /locus-tag="RB8325" 

29) 

complement (18513 9 . .1860 /locus-tag="RB8325" 
29) 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical protein" 

/protein- id="CAD78 580 .1" 

/db-xref="GI: 32445849" 

/ 1 rans lat ion= " MNPPAVTHPNTSTGTDASAR 

QEQTKSANAGDLSASSLTGS S EVR 

LSRSRQTWTGAGSAAGFVLRYLPPMRRLLTDVLG 

SESDADRALAILISHLVKAGYSGH 

AKGRLRDFLILGLRSAAKARVTEIESKAKKAGES 

CRPRDLDLDSAKTESRRWLSYWRE 

GLLQRTWRALEREQHRERTRSHQIESGEVDAGDA 

DADADWADEPINDLVHDVMRMVA 

DHSGESSEVIAGRVAEIAERSVSAAEVKRQLPIA 

RSLFAQLLADEISLTLENADAKAI 

QAEISKLGLQKAFSGLQVKA" 

complement (186147. .1884 /gene="pknB" 

89) 

/ locus - tag= " RB8 3 2 6 " 
complement (186147 . . 1884 /gene="pknB" 
89) 

/ locus - tag= " RB8 3 2 6 " 
/EC-number= "2.7.1.-" 
/function= "unspecified signal 
transduction; unspecified kinase 
or ATP dependent regulatory 
protein; cellular 

communication/ signal transduction" 
/note="PMID: 1835671 best DB hits: 
BLAST : swissprot : P54 744 ; 
PKNB-MYCLE PUTATIVE 
SERINETHREONINE- PROTEIN; E=2e-35 
pir:T36501; probable 
serinethreonme protein kinase - 
Streptomyces; E=2e-34 
embl:CAB94054 .1; (AL358672) 
putative serinethreonine -protein; 
E=4e-34 COG: Rv0014c-1; COG0515 
Serine/ threonine protein kinases; 
E=2e-33 PFAM: PF00069; Protein 
kinase domain; E=3e-52 PF00191; 



gene 
CDS 



complement (188338 . 
07) 

complement (188338 , 
07) 



gene 
CDS 



188636. 
188636. 



189601 
189601 



/codon-start=l 
/transl-table=ll 
/product= "probable 
serine/threonine-protein kinase 
pknB" 

/protein- id= " CAD78 5 81.1" 
/db-xref="GI: 32445850 " 
/ 1 rans lat ion= " MCQCLPCRVGFCRTSWAGRL 
ASRSTASTTGGMHWFP lATRHLMT 
DSPPAPDSPPEFLDPPEAAGELGRLGPYRVIELL 
GQGGMGHVYRAQDTRLQRTVALKV 
MNERFAKSPNSRKRIVEEARSMAAVHHDNVATLF 
EVGQRNGTPFLAMELLQGGTLEQL 
LQSGRQFSSEEVIAIADQVALGLEAAHDRGIIHR 
DIKPANLWIQSPSERVKILDFGLA 
VAGSAVQGLARGDS WGTPGYLS PEQARNETVDE 
RTDLYSLGWMYEMCAGKVPLLAS 
NTPTQLVNILCREPLTLSQRKQNGESVEVPEPLE 
RLIMRLLSKEPGQRYETAGHFRKA 
LLAVSEQIQQSKQAGLAIDIDANSSNGSSVPAGA 
EPAATTKTTSGGTGKAFFAAVGLA 
VAALIAGGVWWMLRDGTSKSTSMESVKSASTPDT 
SPAVLAAELGALQLTGKVIADPEI 
PVGEMARFRLQLTNQADSRESDPRIRYSQAGQVA 
RITTYLQQEGQLKRKAPAFPRSFS 
PRQLPGPGETQSIEIQFLTNNLAQERFDVIFELQ 
SPAGAHVDSLTSQFEVTENLQTGD 
LLGFETIRTRAGKGADTYVREGSKEDFGKQPAVM 
AHRDGANDQNVLAIAFLRFDLTQL 
PPPSGAGADSASAVLKRIDRSALLLSMAPNSHPG 
VGRILVHGWPSDLTTPGGIGSLDW 
SETGEASLTFETSPVANGVGDLVPLGELKFDNSG 
EGLKDQPDGLRFVSRALDDFLRSS 
PGWTLVLSQEGWLGDPTRFNSRDRSPELAPGLA 
VRWKTD" 
1886 /locus-tag="RB8327" 

1886 /locus-tag="RB8327" 

/codon- start =1 

/transl -tableau 

/product="hypothetical protein" 

/protein-id= "CAD78582 . 1" 

/db-xref ="GI : 32445851" 

/ translat ion= "MFRAGQGRAWSAQSSGQSY 

QDPAWIPWGQISVAEELLGDVPVF 

TLQGWILPNELGGAAGQPVNRIDDRWNALVSHRN 

PAPHDRFTART" 

/ locus - tag= " RB8 3 2 8 " 

/ locus - 1 ag= " RB8 3 2 8 " 

/function="DNA synthesis and 

replication" 

/note="PMID: 97158661 best DB 
hits: BLAST: gb : AAB68787 . 1 ; 
(U97022) orf; similar to the 
Staphylococcus aureus; E=6e-35 
gb:AAK06020.1; AE006422-9 
(AE006422) HYPOTHETICAL PROTEIN; 
E=2e-29 ddbj :BAB048 94 . 1; 
(AP001511) CMP-binding protein 
[Bacillus; E=le-28 PFAM: PF01336; 
OB- fold nucleic acid binding 
domain; E=1.4e-05 PF01966; HD 
domain; E=l. le-10" 
/codon- start =1 
/transl-table=ll 
/product=" CMP-binding protein" 
/protein-id= "CAD78583 .1" 
/db-xref="GI: 32445852" 
/translation="iyiSRTAIHDLADGMTLAQPFQ 
AADKQLRVNRQGGKYI LLKLSDRS 
GTIVGMMWNADETTFDTFDRGDYIHCTGRTQVHN 
GSLQVILTEIQRMDPAEVDQSDFD 
RFDANQADNNFTRLRELLATISQPALHALAQSYL 
NDDSFVQSFRQAAAAVSNHHAYPG 



189583. .190335 
189583. .190335 



complement (190457 . 
13) 

complement (190457 , 
13) 



190489. . 190839 
190489. .190839 



190919. . 191278 
190919. .191278 



HDLGKIDELRASGEISYTDRGQLV 

GHIVIGVQRLGEKIAETEASTGQKFSTGLRHQLE 

HL I VSHHGVLE YGS PKI PVTLEAV 

ALHHIDNLDAKLAS YSS IIDSDI AADGNWTNYTP 

SIGRKLWKGDS" 

/ locus - 1 ag= " RB 8 3 2 9 " 

/ locus - tag= " RB8 3 2 9 " 

/ f unct ion= " transcriptional 

control" 

/note="best DB hits: BLAST: 
pir:E83522; conserved hypothetical 
protein PA0976 [imported] -; 
E=6e-63 embl:CAB50783.1; (X74218) 
hypothetical protein [Pseudomonas ; 
E=3e-62 pir:B82628; transcription 
regulator XF1858 [imported] - 
Xylella; E=le-60 COG: PA0976; 
COG0603 Predicted ATPase (PP-loop 
superfamily) , confers; E=6e-64 
XF0999; COG0137 Argininosuccinate 
synthase; E=3e-04 PFAM: PF02568; 
Thiamine biosynthesis protein; 
E=0.087 PF00733; Asparagine 
synthase; E=0.26 PF00764; 
Arginosuccinate synthase; 
E=:0. 00022" 
/ codon-start=l 
/transl-table=ll 

/ product = "probable transcription 
regulator" 

/protein-id="CAD78584 .1" 
/db-xref = "GI :32445853" 
/ trans lat ion= " MERRLVNSTSQSKTESAQQD 
AGKAWLLSGGLDSATCVAIARDQ 
GFEVHAISFRYGQRHDGELDRAAKQASLMGWSH 
RVIDIDLAQLGGSALVDSSIAVPK 
SDHVDKIAGDIPVTYVPARNTIFLSYALAVAETL 
GSRDIFIGVNALDYSGYPDCRPEF 
IDAFQTMARLATKAGVEDEHSLTIHTPLLHWTKA 
EIIQRGIELGVDYSQTLSCYDPQG 
SSDEMRPCGQCDACLLRAKGFAENEIADPAIG" 
1908 /locus-tag="RB8332" 

1908 /locus-tag="RB8332" 

/codon-start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein- id="CAD78585. 1" 

/db-xref ="GI : 32445854 " 

/ 1 rans lat ion= " MPKPHRNQVQFQPFASCFST 

TRERSRRPHPGRLMVLLVRSRGFH 

PELRTTAPSGAVWRDCHERNGETGRLQIAKWTLQ 

IGSCRSALSRQLPSRKYFAICIDQ 

FALCNPLTESSPGQCR" 

/locus~tag="RB8333" 

/ locus - 1 ag= " RB 8 3 3 3 " 

/note="best DB hits: BLAST: 

embl:CAB77251. 1; (AJ249487) 

accumulation-associated protein; 

E=0.75" 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id= "CAD78586 . 1" 

/db-xref="GI: 32445855" 

/ 1 rans 1 at ion= " MDCKVQIDQCKLQS I SGSAT 

DVTTQTCNCQFAMSTLQFAIAPSR 

HSSHGSHARPPRTEPSSLARGGNPETEREEPSVA 

PDGAWSVPALLKSTTRTVGTGLG 

CDVAWALRVAQRNS " 

/locus-tag="RB8335" 

/ locus - tag= " RB8 3 3 5 " 

/codon-start=l 

/transl-table=ll 



/protein-id="CAD78587 . 1" 

/db-xref ="GI : 32445856 " 

/trans lat ion= " MDCKVQIDQCKLQS I SGSAT 

DVTTQTCNCQFAMSTLQFAIAPSR 

HSSHGITPDRPGRGRRDCSGIVQPHDVSGWTWTW 

LRCALAVESGPTGQLRTTVLLCCG 

PFQRDRDSPTSLLLRSR" 

complement (191448 . .1935 /locus-tag="RB8337" 

47) 

complement (191448 . .1935 /locus-tag="RB8337" 
47) 

/codon-start=l 

/transl-table=ll 

/product =" hypothet ical 

protein- transmembrane prediction" 

/protein- id= " CAD78588 . 1" 

/db-xref="GI: 32445857" 

/ 1 r ans 1 a t ion= " MNFQRVQVADKRVINPTRTY 

FSNFDGRTQACRLAVLHVMTAVLA 

VCLIGSHSIAQDAAPAKEAGLEKNAGPAKQAGAE 

QGGAKPVLMTALKLLPDTAAGWR 

I PNLTVLCDSWKKTTLSGITDDPSMKPLIDANFG 

SQGAVWEKLGDKVGLRPKELYE lA 

TGEVIAVWLPFENDNRRPSSVCWADIRGKHAEA 

EAVLKRVDQDLKADGATRTDVTHA 

GETVRVYRPKTRPGQLKIEQWVCLTKDRVIASD 

RDSWLEILDAIANDGREDSLDKA 

PLLRTVWKQALSRSEIEVEGSQVEWFVKPLTMGR 

I IRDVAKVDRGNKVRILNLLENQG 

FDALQAAGGIAWGQGKFDLLHRGYVHAPPVTDK 

PSRYRLAARMLQFPNTPLEGI PAW 

VPANTASITRWWKIEEAFWAVETLVDEAFDNEI 

FRPSIEGIRDDEEGPQIDIEKNVL 

PNLSEHLLLLTDNTLPATVDS ERVLVAI EVLNAD 

AIGTAVRKAMEVEPDVFKVDTVPG 

VEIWQVKHGSGDDELDDEFFDDLGFEEEIDEEQT 

PLLNTWAIAMVPAGKGSKKSHLVF 

SSHVEWLVDVAKRMQTGEGDKLADLDEVKDLLKL 

TSEMGADEVAFQRWRLRQSLRAK 

YELQRQGELKNSDSVLASLVRRMFEGDEPKADEP 

lERLNVSKWPAFSEVEKYFRNAFG 

FVETTETGWNLNGFLLK " 

complement (193565. .1936 /locus-tag="RB8342" 

96 ) 

complement (193565. .1936 /locus-tag="RB8342" 
96) 

/codon- start- 1 

/transl-table=ll 

/product = "hypothetical protein" 

/protein- id= " CAD7 8 589.1" 

/db-xref ="GI : 32445858 " 

/translat ion= "MVSCKGTNQAMDGDIGISAR 

NGRVYDIGRLSQSRRDAGSDGFR" 

193677. .195134 /qene="purB" 

/ locus - 1 ag= " RB8 3 4 3 " 

193677. .195134 /gene="purB" 

/locus-tag="RB8343" 

/EC-number= "4 . 3 . 2 . 2 " 

/ f unction= "purine - ribonucleotide 

metabolism" 

/note="PMID: 96015068 PMID : 
2111814 best DB hits: BLAST: 
gb:AAB60684.1; (U20225) 
adenylosuccinate lyase [Mus 
musculus] ; E=le-144 gb:AAC83935 . 1; 
(AF106656) adenylosuccinate lyase 
[Homo sapiens] ; E=le-140 
gb:AAH00253 .1; AAH00253 (BC000253) 
adenylosuccinate lyase [Homo; 
E=le-140 COG: YLR359W; COG0015 
Adenylosuccinate lyase; E=le-132 
PFAM: PF00206; Lyase; E=2.5e-57" 
/codon- start=l 
/trans 1 - t able= 1 1 

/product=" adenylosuccinate lyase" 



/ab-xref="GI: 32445859" 

/ 1 r ans 1 a t i on= " MPLQETKMTETQETPYQNPL 

lERYASREMAFHWGPQRRFASWRK 

VWIALAEAEQELGIAITDAQIEQLKSFENKLNLD 

AAAKYERELRHDVMAHVHAYGDQC 

PDARGI IHLGATSCFVTDNADLLLIREALELTAK 

RLAATIDQMAKFAAEHRDLPCIiAF 

THFQPAQPTTVGKRACLWIYDLVLDLEAIEHRLE 

TLRARSAKGTTGTQAS FLELFSGD 

HDKVRALEKRIAEKLSFESVYAVTGQTYPRKVDA 

QLLDALSGIGQSLHKIATDIRLLA 

GRKEVEEPFEKKQIGSSAMAYKRNPMRSERICAL 

GRFVMSLQSSPAMTAATQWMERTL 

DDS ANRRLVI PQAFLAIDAALVLMQNVADGMWY 

PATIAKNLGAELPFMATENILMQA 

VAAGGDRQDLHEQ I RVHSQAAALEVKQNAGDNDL 

LERLKGDENFAGIDLEAAIDPHAY 

VGRAPQQVDEFMEAI ITPIRQRYSGSDSLSVEVT 

V" 

195118. .195243 /locus-tag="RB8344 " 

195118 . . 195243 /locus-tag="RB8344 " 

/codon-start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein- id= " CAD7 8 591.1" 
/db-xref="GI: 32445860" 
/translat ion= "MKSRSDFDGRLPLNAKISPA 
CHIAGEIKCLSWIVCSRNNKF" 

complement (195240. .1965 /locus-tag="RB8345" 

83) . ^ 

complement (19524 0. .1965 /locus-tag="RB8345" 
83 ) 

/note="best DB hits: BLAST: 
b:AAD19720 . 1; (AF124349) unknown 
Zymomonas mobilis] ; E=0.54 PFAM: 
PF00076; RNA recognition motif, 
(a.k.a. RRM, ; E=0.54" 
/ codon - s tart = 1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAD7 8592.1" 

/db-xref="GI: 32445861" 

/translation= "MKSKRAALWLALGGLVGLGS 

LNGASAADRFVADASYQGGMPTSQ 

AEWPVGHFDASCDESCDTFPEMECGCGELACDG 

GCGDMGCGDMGGGLFDDGGCGMFD 

CDLGDPFTLFGEACGYSVGGWVQMGYHNKALPLF 

NNRPDEYNLHQAWFYAEKAIDTSN 

GFDFGGRIDYVYGIDAQDTQAFGIDNGHWDTGWK 

HGEYGSALPQVYGEVGYGDLSVKF 

GHFFTIIGYEWAAPDNFFYSHAYTMYNSEPFTH 

TGAIiATYNASDDLTVYGGYTLGWD 

SGFEDNGDSFLGGVSLAMSDDVTVTYATVAGRFA 

DEPAGTTSQGYMHS I VTDI ALTDQ 

LQYVLQSDYMDAENGNGGNFKETYGUSnSTYLIYTV 

SDCLALGSRFEWYDADAGVFNGNV 

DVYALTLGANVKPHANVLIRPEIRWDWADGADVT 

GILENDQDDQTTFGIDTIFLF" 

complement (196 58 0. .1967 /locus-tag="RB8348" 

80) 

complement (196580. .1967 /locus-tag="RB8348 " 
80) 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id="CAD78593 . 1" 

/db-xref="GI: 32445862" 

/translation= "MPDFPVIRAGVSMAMVSIAS 

RREDDTSQQGLASTWRSTRLTGLL 

PWKVAVTVDAANPFE IDQGRAR " 

complement (196938. .1983 /locus-tag="RB8350" 

83 ) 

complement (196938. .1983 /locus-tag="RB8350" 
83) 



198374. .198622 
198374. .198622 



198577. .198735 
198577. .198735 



198802 . . 199011 
198802. .199011 



complement (199004 
74) 

complement (199004 
74) 



199178. .199453 
199178. .199453 



/transl-table=ll 

/product= "hypothetical protein" 

/protein- id="CAD78594 . 1" 

/db-xref ="GI : 32445863 " 

/ 1 rans lat ion= " MGAQLPVLTAS ILNLGDTKR 

PNWIGISTEKAMKLSKLALIAAVA 

CGVNATNLTGAS AND I GLVS CTS S LCD CGEPVCG 

CEPVGGCDTGCENEVTCGCETVGC 

LGDCGCNAGCDSGCDSLCGGGLGSCLPGLGDCCL 

DDPYTLFGDVCGIEVGGWAS IGYH 

TKALPLFNSRPDNVQLHQAWLYAEKAIDTSCGFD 

IGGRIDYVYGTDGPDTQAFGINNG 

HWDNDWDNGGD YGHAI PQLYTEMGYGDLS VKVGH 

FFTI IGWEWAAPDNFFYSHAYTM 

YNSEPFTHTGAIiATYNVSDDLSVFGGYTMGWDSG 

FEDNGDNFLGGFSAGLTDDLTLTY 

ATT IGRFGEDNAGGVEQGYMHS I VAD YAVSDNLQ 

YI IQSDLLDTEDETGATARDTFGI 

NQYLVYSVNDCLSLGGRFEWYQNEGVFDDGNNTG 

SDVYALTTGINYRPHANVLIRPEV 

RWDWVDGDTTGILENDDDDQTTFGFDTIFTF " 

/ locus - tag= " RB8 3 53 " 

/locus-tag="RB8353" 

/codon-start=l 

/transl-table=ll 

/product^ "hypothetical protein" 

/protein-id= "CAD78595 . 1" 

/db-xref = "GI : 32445864 " 

/translation^ "MPPKCGEGRSLAHNDETNAV 

HRFFNAHAACRRWHIETSRQSLRN 

SNQSPRKISSELKAIHHWGRIHRWVCEICPQPV 

LKTT" 

/ locus - 1 ag= " RB8 3 5 5 " 
/ locus - 1 ag= " RB83 55 " 
/codon-start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein-id="CAD78596 . 1" 
/db-xref ="GI : 32445865" 
/translation= "MGLRDLSSTGAQNDLETARG 
VRNAPVNPHLRVAPNVPSGQTLTL PDPNTPRI " 
/ locus - tag= " RB8 3 5 6 " 
/ locus - 1 ag= " RB 8 3 5 6 " 
/codon- start = 1 
/transl-table=ll 
/product=" hypothetical protein" 
/protein- id="CAD78597 . 1" 
/db-xref = "GI : 32445866 " 
/translat ion= "MTVSNHIHDNHRERWRRIRW 
HDSAKRAAKKSKWATVSKPGVKRS 
TWAVSVPRSPAVPAVNLSPTCKKFT " 
.1991 /locus-tag="RB8357" 

.1991 /locus-tag="RB8357" 

/codon- start =1 

/transl- tableau 

/product = "hypothetical protein" 

/protein- id="CAD78598 . 1" 

/db-xref="GI: 32445867" 

/translat ion= " MDVDRAAVATAGTGHQDFLL 

PLTSNGFVFTVLRTTPLRMHCVQT 

RIDLLVPFGNVT" 

/gene="gatC" 

/ locus - 1 ag= " RB8 3 5 9 " 

/gene="gatC" 

/ locus - tag= " RB8 3 5 9 " 

/ EC - number =" 6.3.5.-" 

/ f unc t ion= " tRNA- synthetases ; other 

protein-synthesis activities; 

other ^ tRNA- transcript ion 

activities " 

/note="PMID: 9342321 best DB hits: 
BLAST: ddbj : BAB043 84 . 1 ; (AP001509) 
glutamyl-tRNA (Gin); E=2e-13 



GLUTAMYL - TRNA ( GLN ) 
AMIDOTRANSFERASE; E=6e-ll 
swissprot: 006492; GATC-BACSU 
GLUTAMYL-TRNA (GLN) 
AMIDOTRANSFERASE; E=7e-10 COG: 
BH0665; COG0721 
Asp - tRNAAsn/Glu- tRNAGln 
amidotransf erase C subunit; 
E=2e-14 BS-yerL; COG0721 
Asp-tRNAAsn/Glu-tRNAGln 
amidotransf erase C; E=8e-ll 
PA4482; COG0721 
. Asp - tRNAAsn/Glu - t RNAGln 
amidotransf erase C subunit; 
E=2e-08 PFAM: PF02686; Glu-tRNAGln 
amidotransf erase C su; E=3e-15" 
/codon-start=l 
/ trans l-table=ll 
/product^" glutamyl -tRNA (Gin) 
amidotransf erase subunit C" 
/protein- id="CAD78599. 1" 
/db-xref="GI: 32445868" 
/ 1 r ans 1 at i on=: " MARLARLQLTEQEQTDFGPQ 
ITDILGFVEQLSELDTSGVEPMTS 
ALDVENRFRDDVPDQSLSAEVATRTAPASQDGTF 
LVPPVLGNTSSKK" 

199580. .199726 /locus-tag="RB8361" 

199580. .199726 /locus-tag="RB83 61" 

/codon-start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id- "CAD78600 .1" 
/db-xref= "01:32445869" 
/ 1 rans lat ion= " MNDAYGLDKNPGCLITQAKR 
HLDFIRERRRHQTRIFYRSTTVSL AESS" 

complement (199687 . .2011 /gene="Pphl" 

44) 

/ locus - taq= " RB8 362" 
complement (199687. .2011 /gene="Pphl" 
44) 

/ locus - 1 ag= " RB 8 3 6 2 " 
/EC-number="3.1.3 .16" 
/function="key phosphatases" 
/note="PMID: 11298281 best DB 
hits: BLAST: gb : AAF81068 . 1 ; 
AF223364-3 (AF223364) protein 

Phosphatase 1; E=9e-32 pir:E71538; 
ypothetical protein CT259 - 
Chlamydia trachomatis; E=4e-28 
pir:H75265; conserved hypothetical 
protein - Deinococcus radiodurans; 
E=5e-28 COG: CT259; COG0631 
Protein serine/threonine 
phosphatases; E=4e-2 9 PFAM: 
PF004 81; Protein phosphatase 2C; 
E=4.9e-09" 
/codon-start=l 
/transl-table=ll 
/product = "probable protein 
phosphatase 1" 
/protein-id="CAD78601. 1" 
/db-xref="GI: 3 2445870" 
/translat ion= "MSGSWRPGVKFAEQTHVGMR 
RANNQDS S AVL I AES AERLNQRGH 
LF WADGMGAHAAGELASQ I ASER I SMHYYQSRE 
EAPEHAISEAVHLANAAIYERGQS 
NPEFLNMGTTASSLVLVNGAAYVAHVGDSRVYRL 
RQGVLEQLTFDHSLVWEMQASGKV 
HSDSVFSKSIPKNVITRSLGPSPEVLVDLEGPFD 
LEVGDRFMVC SDGLTGQ I EDDELA 
VLLDSLD VDKAARVMI DLANLRGGPDN I TL I WE 
VDDEHLTQNDGAPRKARSLGADEV 
SSRALIGTTAFCWTGAVGFGLAAALVSPRFVGSA 
IVAFILGS I SAAVWASGVFKPNER 
RRPLKTRRVRDRNSAGASSNPGMPGQSGAGQGGS 



LGTGPYRRFKANSHEAFLLRMDDVITQLRQTSEE 
RNWMLDWQNVDQLLQQARQATKEQ 
AWGTAVSSYCDATLAAMEQLRKHQDDSASDTWD 
L" 

complement (201387. .2025 /gene="pepT" 
92) 

/ locus -tag="RB83 63" 
complement (201387. .2025 /gene="pepT" 
92) 

/ locus - 1 ag= RB8 3 6 3 " 
/EC-number= "3.4.11.-" 
/ f unct ion= " cytoplasmic 
degradation; proteolysis" 
/note="PMID: 8978088 best DB hits: 
BLAST: pir:B82211; peptidase, iyi20A 
family VC134 3 [imported] - Vibrio; 
E=le-34 swissprot : P54542 ; 
YQJE-BACSU HYPOTHETICAL 39.7 KD 
PROTEIN IN; E=2e-32 
ddbj :BAB05188.1; (AP001512) 
BH1469'^unknown conserved protein; 
E=7e-32 COG: VC1343; COG2195 Di- 
and tripeptidases; E=le-35 PA5390; 
COG0624 Acetylornithine 
deacetylase/Succinyl-diaminopimela 
te; E=5e-07 HI1348; COG2195 Di- 
and tripeptidases; E=5e-06 PFAM: 
PF01546; Peptidase family 
M20/M25/M40; E=6.8e-2 
/codon-start=l 
/transl-table=ll 

/product = "peptidase, M2 0A family" 

/protein-id= "CAD78602 .1" 

/db-xref="GI: 32445871" 

/ 1 rans 1 at ion= " MS WAAIDESAALDRFLELTR 

IPGKSGDEAAISAAIEKMLVDAGV 

DPDWIQSDDAGTKTRLDGNAGNLIVTLPGDDSLP 

RTLLS AHMDTVP I CVGCNPQLRDD 

DQLGRIWADGPTGLGADDRSGCAVILSAIVERL 

SRQAKNPDLKLAPAAVTFLIQEEV 

GLEGARHLDVSKVGHVDRAFNFDGGALDKIRHGA 

IGGERVQVTIHGHAAHAGVAPEKG 

VS AI VI AS EAI S SLHRNGWLGLVEKDGRRGTANV 

GVFQGGDATNVITPEVQLRAEARS 

HDAEFRAE I VSQMRTAI ENAVQSVKDVQGRTGSF 

EFTSRVDYEAFRLAEDHPSVLAAT 

EL I S QLGRS PQCEVANGGLDANWLMLHGI EAVTL 

GCGQAS IHTVDEYLMVDEFLDACR 

LAAELIAPA" 

complement (202631. .2029 /locus-tag="RB8365" 

03) 

complement (202631. .2029 /locus-tag="RB8365" 
03) 

/codon- start =1 

/transl-table=ll 

/product= "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein- id="CAD78603 .1" 

/db-xref="GI: 32445872" 

/ 1 rans 1 at ion= " MSRSLIGS I AALVFAVPASA 

HPGHGSTEGLTHYATEPAHWPML 

ASFAASGLLLAVGWFAMGRLNQSQTGTAAQPIRV 

EDQSPRRANRTR" 

202905. .203057 /locus-tag="RB83 67" 

202905. .203057 /locus-tag="RB8367" 

/ codon -St art = 1 
/transl-table=ll 
/product= "hypothetical protein" 
/protein-id="CAD78604 .1" 
/db-xref ="GI : 32445873 " 
/translation= "MNSQESREMRRADSPADFTF 
GDQTQCTCQPMSGIVEPDQPTDYT PWSSCC " 

complement (203030. .2040 /locus -tag="RB83 68" 

01) 



/note="best DB hits: BLAST: 
pir:F82221; conserved hypothetical 
protein VC1259 [imported] -; 
E=7e-86 pir:D83539; conserved 
hypothetical protein PA0858 
[imported] E=4e-81 pir:G75436'; 
conserved hypothetical protein - 
Deinococcus radiodurans; E=2e-67 
COG: VC1259; COG1054 
Uncharacterized enzyme related to 
sulfurtransf erases; E=6e-87 
S110765; COG1054 Uncharacterized 
enzyme related to; E=5e-67 RP125; 
COG1054 Uncharacterized enzyme 
related to sulfurtransf erases ; 
E=2e-49 PFAM: PF00581; 
Rhodanese-like domain; E=1.5e-17" 
/codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAD78605 .1" 
/db-xref ="GI : 32445874 " 
/ 1 rans lat ion= " MTESHASSVPAELPAESPAE 
AGAADLPVAVAALYCFTPLPQFES 
LREPFRQRMSDDGIRGSLLLAGEGINGTIAGPRE 
LMEPFIDWLRSMELDDQATPFRGM 
DVKWS YCEE I PFRKTKVRLKRE I VTMGVSS IDPL 
RSVGTYIEPQDWNALVDDPDVTLI 
DTRNDYEIEIGTFEGAINPHTESFREFPKFVEEN 
LDPQKHPKVAMFCTGGIRCEKSTA 
YLKERGFEEVYHLRGGILNYLEKMPEEDSRWQGE 
CFVFDNRVAVDHQLQAGAYELCHG 
CGWPLTPEMKQHSDYERGVACSRCAAEVTYEQRK 
RRRMRQQQLDQGV" 

gene complement (204020 . .2060 /locus-tag="RB8370 " 

89) 

CDS complement (204020. .2060 /locus-tag="RB83 70" 

89) 

/note="best DB hits: PFAM: 

PF00602; Influenza RNA- dependent 

RNA poly; E=0.35" 

/codon-start=l 

/ transl-table=ll 

/product = "hypothetical protein" 

/protein- id=:"CAD7 8 6 06.1" 

/db-xref="GI: 32445875" 

/translation="MADTRTKiyiLVTiy[IQWSKGI 

S I VSKRNVRRPARRI FAAAMLLAG 

TAPVTFPGGIAPAAGQETAPRSVAKPLDARSRTI 

VQSVDMQVKRAGAAYAGGDYETSG 

DALRKAIKQIEVAGNAESSDLHDALLPMMRRIVN 

ARAMLELEGVTLPPFRI PERPKAT 

AEPMAEASGSDPSSRSATDGVSFVSEVAPILVDK 

CGGCHIRGSKGGFAMPTFAKLMEG 

PPEGWI FPGDVIASRLIETIETGDMPRGGGKVS 

KEQLDKLKQWVTEGAKFDGPDPTA 

MLTSLKSGGEPPVTPATPEPAQPAMVGKPTGNET 

VSFSKDVAPLLVKNCTGCHLDAMQ 

TRGGLRMDTLAQMLRGGDSGSVMKPGDGEASLLV 

QKLRGSAGDRMPAGGRPPLSDSD I 

QL I S TWI S EGAKVDEALVQTPMKWTAQAWLS AA 

ASSEVSQRRAEIAEDEFRLAGADV 

NRMQNHQSDHFAVWGEVS PATLETVAESAEAALQ 

QARKFLPQADGS PEDFFHGKAS I Y 

VLPKRYDYSEFAKMVEGRNVPAEWQSHWKYDGIR 

AYVAWASERASEEEIAERWAPV 

ASLAWSRGEGVPRWFAQGLGTAISGGESKRDRN 

ENMRRQAELVTAVGSLKSGKDFLN 

GKLPPERADRMAVAIAESLLSRQNRRGFDAVLRN 

LQSGQPFPQAFQAGMNMTPIAYVD AWLQWVK" 

gene 206164 .. 207927 /gene="guaA" 

/ locus - t ag= " RBB 3 74 " 

CDS 206164 . .207927 /gene="guaA" 



/EC-number= "6.3.5.2" 

/ f unc t ion= " purine - r ibonuc leot ide 

metabolism" 

/note="PMID: 3894345 PMID: 2982857 
PMID: 8548458 best DB hits: BLAST: 
pir:D82282; GMP synthase VC0768 
[imported] - Vibrio cholerae 
(group; E=le-163 ddbj : BAB04326 . 1 ; 
(AP001509) GMP synthetase 
[Bacillus halodurans] ; E=le-162 
pir:G83173; GMP synthase PA3769 
[imported] - Pseudomonas 
aeruginosa; E=le-158 COG: guaA-2; 
COG0519 GMP synthase - PP-ATPase 
domain; E=le-104 PFAM: PF00117; 
Glutamine amidotransf erase cla; 
E=8.1e-56 PF00958; GMP synthase C 
terminal domain; E=6.5e-70" 
/codon-start=l 
/ transl- table=ll 
/product = "GMP synthase" 
/protein-id="CAD78607 . 1" 
/db-xref="GI: 32445876" 
/translation="MPATGITLPSPSGPPLPCSA 
AIFVLDHRSRLPWPFPVSPLLFYT 
AALAFPAISGKNTMTSSANTTSPAIAETTWLTDQ 
RI LVLDFGSQ YAQL I ARRVREQNV 
YCQIVRHDISAERIAELAPKGIILSGGPNSVYEE 
GAPKCDEGLFDLGIPVLGICYGMQ 
LACQALGGKVDNTPSREYGRAMCEFTDRDSIFRG 
MQESEQVWMSHGDQVSQIADQFTA 
MAKTSTCPYAAIRHNERPVFGMQFHPEVTHTPHG 
GQILRNFVIDVCGCDGSWKLGDFA 
DAAIESIRKQVGNKRVICGLSGGVDSSWAALLY 
KAIGPQLSCILVDNGLLRKNEQQI 
VLEEFSNHFKTDLHIVEAEDRFLADLAGIDEPQE 
KRRRIGHAFIECFKDEAAKIEDAH 
FLAQGTLYPDVIESGADKDGPAATIKLHHNVGGL 
PEELGFELIEPLRDLFKDEVRRLG 
LELGLPEQLVWRHPFPGPGLAVRCLGEITRDKLV 
VLREADAI WEE I ENAGL YRETSQ 
AFAVLLPVQSVGVMGDARTYDNAVAVRCVNTDDF 
MTADWSHLPYELLARISTRIINEV 
KGVNRVCYDISSKPPATIEWE" 
/qene="guaA" 
/ locus - tag= " RB8 374" 
/note="cosmid pircos-c4e02/ cosmid 
pircos-a3c02 joining point" 
/ locus - tag= " RB8 3 7 5 " 
/ locus - 1 ag= " RB8 3 7 5 " 
/function="ABC transporters" 
/note="best DB hits: BLAST: 
ddbj :BAB2 0562 . 1; (AB041227) ABC 
transporter [Myxococcus xanthus] ; 
E=le-165 pir:D71714; ABC 
transporter ATP-binding protein 
(abcTl) RP060 -; E=le-162 
embl : CAB71215 . 1 ; (AL138538 ) 
putative ABC transpoter 
ATP-binding; E==le-156 COG: RP060; 
COG04 88 ATPase components of ABC 
transporters with duplicated; 
E=le-163 Rv2477c; COG0488 ATPase 
components of ABC transporters 
with; E=le-154 yjjK; COG0488 
ATPase components of ABC 
transporters with duplicated; 
E=le-146 PFAM: PF00005; ABC 
transporter; E=2 . 6e-47" 
/codon-start=l 
/transl -table= 11 
/product="ABC transporter" 
/protein-id="CAD78608 .1" 
/db-xref="GI: 32445877" 
/ trans lat ion= " MAGQFIYQITDLTKKHGQRK 



KSTLLKIMAGMDTEFEGTARLGKGFTVGYLEQEP 

PLDPTKTVFENVQEAVAERQAIVD 

RYNE I SGLLGE VTDDDEMQKLCDEMATLQD I I DA 

NNLWELDRFVEMSMAVMNLPPKDA 

EITNLSGGEKRRVALCQLLIRQPDLLLLDEPTNH 

LDAESVSWLEQHLANYKGTWAVT 

HDRYFLDNVAQWILEVDRGRGI PFEGNYSAWLEN 

RAKRMALEERQQKAREKNLARELE 

WIRMSPKARQAKSKARIKSYEQMSAEQFEDRPDD 

LEIQIPSGKHLGSLVIEAQHVNKA 

FGEKTLMTDMNFRLPPGGIVGIIGPNGAGKTTLF 

KMLTGQEPVDSGEIKIGETVDLGY 

VDQSRDKLDPNKT I YQE I SGGHDS FEMGGRVMHA 

RTYVSRFNFKGPDQEKKVGVLSGG 

ERNRVHLASLLRKGCNVLLLDEPTNDLDVDTLRA 

LEEAIANFAGCVWTSHDRWFLDR 

LATHIIiAFEGDGKWWCEGNFDTYERNRRERMGE 

DADDDS PKSRYKS IHAG " 

/ locus - tag= " RB8 3 7 6 " 

/locus-tag="RB8376" 

/ codon- s tart = 1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id= " CAD78 6 09.1" 

/db-xref ="GI : 32445878 " 

/translation^ "MTILRTGTNEKYAEGFEAAF 

GGGTPASKKTAKKATKKAAKKTSK 

KAKTQPAQSKKKSAKKSVKKAAKKSVKKAVKAKA 

TPKKAVKKAAKKSVKKSVKKTAKK 

SVKKSAKKAKKKS" 

/ locus - 1 ag= " RB 8 3 7 8 " 

/ locus - 1 ag= " RB8 3 7 8 " 

/codon- start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id= "CAD78610. 1" 

/db-xref="GI: 32445879" 

/ 1 rans 1 at ion= " MGSGRESQAFSRAPRHNRHV 

LHNDVAPAKRPAFQLSSVPLMPAS 

PSREPNGNLPDHQLAAAAWSPPSQDVRTALQSLI 

GYLNFSSGPAPTAVFSAWNQVHDE 

ASGGDVLNGPPPFLIMRDWLEET^QSLRKSS PAF 

ANIDRALGLINLLWSGLLPAYLDF 

HRDLLFHQPPELLFNGFFLGRCAEVMVQQIGEDE 

S IGTVDDESARLRQI IGTLNDYVG 

YRPVAVLENRACQPYAHEFVRPI PQMVQGAGITA 

GP YREL I ERAVRAI EEAPED I LRS 

AAMDPKQLRELCLDPRAYDFDHPVNRRPNYHFGG 

WDEHAIDSDGRYDRFILRSVTLDS 

LIiARVNETPELPRDELLEEAASVLAGTMLMASGI 

SGWGPGCYSSDVTLRSLMVPIANF 

RDEFYNWRISRIGGEHGQRLLDEVKTRHQAFGAA 

RQNLNAALARRRAIQLQHVQLARI 

YARLGYPDSATKQADWPATSARLMCRIDCGMTL 

GLRALRTDRFPDAVEVPEQTFDLI 

RRAIECGALMDPWDILGFTGNFSLYPGIENSIHD 

SRLDELLYIIENLFGYIARVWSEA 

AARDDEVAYERMDRQYREVAQWWRTFAAHTVESV 

EATDPWESYESARLVAQALRLWHR 

GGAQRGDIAFWAPHADLFDSPRAYMLVISALLER 

KDF I PAQALL IHWLCQADRVGLRS 

GANSLPRLTERWLLQLRDHMREDSHELWPRVNKF 

FDYLEANAESFWSAPRFEVGDDEP 

RDRHRDNWEQELAEANEDPFGSPEEDDEAGLFDA 

AYENVS YRDTTDDGNEGS I FDTS S 

DSSSVDELEAEVKRLiASRLDFLQSLARMWAVGAD 

VALTEIHAATESTDSADSSDPQRR 

QQQIEALSAWARRAVANRIGLLELLGDVRAYKIK 

PAGTDKDSMRIYDRRRVLRDSLME 

RVITTAVEMADARRLIASALGALQGKLPEDVGEE 

FAEDDAGAI ELFS AL I AGDAERAR 

ERFPTFIEAILNRSLLYIPLSRGGDPVKIYVTRL 

RERVLRHLLHWLPRRGLFAETCRL 

I ETARQMEQRHP IGAGAVTEFDGLYRAGYRSLVG 

SIAEAVLHATAGEPPSPAPTEPQS 



HSQTLRLSPLEAVSDPAKWERLVE 

F IKLYGDP I FTQGFLQLSNVRAVLHQDVSSWLER 

AMTDGDDLLEDTQLFRDLEIGKIK 

INEASRWITLVFESLLDHHAEYQDYNSTTTQSDR 

GELIYTFLDFLRLRTRYERVAWNL 

KPIMWTHEVLVRMGLDQTSLGWRESLNERIAAEA 

ELYVQKLRELQRRHSMRMPTIADR 

lEERFVQPMTIDRMRSLVEPAMGDAEADRFSAAF 

EQLEAEANELSRTPVGVGLDLPPW 

LSALEDEVEKI GKRNL I SE IDPQELMT I PVS PLT 

LDELKTQLATAQTQGRRLPHMRRR KS" 

/ locus - 1 ag= " RB8 3 8 1 " 

/locus- tag="RB8381" 

/codon-start=l 

/ trans l-table= 11 

/product= "hypothetical protein" 

/protein-id= "CAD78611 .1" 

/db-xref="GI : 3244588 0" 

/translat ion= "MRVPSGSRSALRVTAARVNE 

PQTSSPPGCVHLAFESDKQNGDDF 

SAESRKVKKTLTQVRLKMASRGEVSEVNPKD I AG 

TQGNDPLL" 

/qene="glgP" 

/locus - 1 ag= " RB8 3 8 3 " 

/gene="glgP" 

/locus-tag="RB83 83" 

/EC-number="2 .4 .1.1" 

/function="C- compound and 

carbohydrate metabolism" 

/note="Piy[ID: 7916624 PMID: 2424788 

best DB hits: BLAST: pir:S77252; 

phosphorylase (EC 2.4.1.1) 2 - 

Synechocystis sp. (strain; E=0.0 

swissprot:P73511; PHSG-SYNY3 

GLYCOGEN PHOSPHORYLASE pir: ; 

E=0.0 gb:AAD30476.1; AF124787-1 
(AF124787) muscle glycogen 
phosphorylase; E=0.0 COG: slrl3 67; 
COG0058 Glucan phosphorylase; 
E=0.0 PFAM: PF00343; Carbohydrate 
phosphorylase ; E=0 " 
/codon-start=l 
/transl-table=ll 
/product= "phosphorylase 2" 
/protein- id= " CAD7 8 612.1" 
/db-xref ="GI : 32445881" 
/ 1 rans lat ion= " MSNTLSAPPNASPTDTSFPH 
LDLSSELARHLTITLGHDFTGDST 
DKQNSEYLYQALAITVRDRLVPIWLETWKKTCLS 
EDRKVYYLSLEFLIGRSLTNAVEN 
LDLDEDVRKALRAYSVGMEEVADKELDAGLGNGG 
LGRLAACFLDSCANLQLPWGYGI 
RYE YGMFHQH I EDGRQVED PDRWLRDGNPWE I KR 
PEDTRRVRFYGRTENYYDEHGTLR 
PRLVDSYDVLAVPFDMPVPGYRNDTVNTLRLWKA 
STTDVFNLSEFNAGSYPEAVAAKN 
DAEQISMVLYPNDASENGKELRLKQQYFLVSASL 
QDVIARWVEQHGEDFSDFGRKNCF 
QLNDTHPACAVPELMRLLMDEHGLEWDDAWDWT 
RCMAYTNHTLLPEALERWSVGLFS 
RLLPRLLD II YE INARFLKLVDQQWPGDVAMRRE 
MSLIEEGDNPHIRMAYLAIVGSFS 
VNGVAGLHTQLLESGLFKHFNTLWPRKFNNKTNG 
VTQRRWLSHCNPGLRDLLNETIGE 
GWQKDLTKIKELTPYATDADFRKKWIDVKQENKA 
RLSDLWAETGVRFDTSFMFDVQV 
KRIHEYKRQLLNVLHIVHLYDRILRGETAGMVPR 
CVLIGGKAAPGYHVAKLIVKLIND 
VAKKVNNHPAANDLLKWFFPNYRVSSMEVICPA 
TELSEQISTAGKEASGTGNMKFMM 
NGALTIGTLDGANIEIRENAGAENFFLFGLDAAE 
VTELKKDYRPNEIIAADEDIVRIM 
NLLESGHFNPDNPGLFDLLTSGLRNPQDPWVTIA 
DLRAYIDAQAEVGKAYQDVDHWNQ 
MSILNTAGSGWFSSDRTIQQYADDIWDVRPLS " 



/ locus - 1 ag= " RB 8 3 8 4 " 
217623. .218924 /gene="insA" 

/locus - tag= " RB8 3 84 " 

/f unct ion= " retrotransposons , viral 

and plasmid proteins; 

recombination and DNA repair" 

/note="best DB hits: BLAST: 

gb:AAD45539.1; U70376-1 (U70376) 

InsA [Streptomyces netropsis] ; 

E=5e-33 gb:AAG60893 .1; AF322013-12 

(AF322013) ID460 [Bradyrhizobiutti; 

E=8e-33 ddbj :BAA19966.1; (D63923) 

transposase [Acetobacter 

pasteurianus] ; E=le-26" 

/codon- start =1 

/transl-table=ll 

/product^ "probable InsA 

(transposase) " 

/protein- id=:"CAD78613 . 1" 

/db-xref ="GI : 32445882 " 

/ 1 rans 1 at ion= " MLHGELGMDVAELEQLEDRL 

NAYLARFGDCFRRSDTRAHLTTYV 

RGQLSDLDAKSVEPIALQAGTPVRTLQEFIAQHR 

WDEDGLRRRLIHIVRDEHVNKNTV 

AIIDETSDVKKGDKTPGVQRQWCGKVGKQENCIV 

TVHLAAANEDFHCMVDGELFLPES 

WSNDRERCAAAGI PDEMVYRPKWQIALELLDRSK 

EEGIEYPWLTADEGYGGKPGFLEA 

LADRDQKFVLEVPRTFSVWEKHPEVTEQPYRKGG 

RGRGRKTPRVKSGESSPRSVETVF 

WHGEAMKAKRWKRYRVKDGEKGPIIWEAKRVRVT 

LKGSDGLPGLSLWLWARNVLDGE 

LKFFVSNASEFASMAMLLQVAFQRWRVERCFEDQ 

KQEVGLDC YEGRRYLGLKRHL II T 

SLSYLFLSQTCQQEREKKSGVDNSANSRRGRRNR 

CQLVTPS " 

218935. .219087 /locus-tag="RB8385" 

218935. .219087 /locus-tag="RB8385" 

/codon- start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein-id="CAD78614 . 1" 
/db-xref="GI: 32445883" 
/ 1 rans lat ion= " MLLDQVASRINYHQSRNVAA 
RISHTQTKLDRYAELGIDPESITR CRWPKT" 

complement (219229. .2197 /locus-tag="RB8387" 

44) 

complement (219229. .2197 /locus-tag="RB83 87" 
44) 

/codon- start =1 

/transl- tableau 

/product= "hypothetical protein" 

/protein-id="CAD78615 . 1" 

/db-xref="GI: 3 244 58 84" 

/translation="MIGQMIADSGQLCIGYAKRL 

MTGVTQEQFARFAWGDTVIESNH 

PAFILGHLSLYPCQVLEQVGANTSAVKPSEKYDA 

LFSKHAKCLDDPNGTLYPPMEEW 

EKFFSSHGAAIEALLNADDAVFSTPNPTEHLREK 

FSTVGSLHGFYLSGHMMLHIGQFS 

AWRRAMGLGPA" 

complement (219960. .2208 /locus- tag="RB83 88" 

80) 

complement (219960. .2208 /locus -tag="RB83 88" 
80) 

/note="PMID: 114 81431" 

/ codon - s t art = 1 

/transl- table=ll 

/product = "hypothetical protein" 

/protein-id= "CAD78616 . 1" 

/db-xref ="GI: 32445885" 

/ 1 rans lat ion= " MSMVRRKPRRGTAPQLNS IM 

DGEI IMATWCTRWLDAFPASYVT 

VLRNAVAANLNRPHRFVCVTDNVAGLTDGVEGVQ 

MPDLGI PMQRQQKGCWPKLS ILAP 



FHALREWNPTLWNLAPLKMRPDRG 

VQGSILGFFPGEQAQLFHRFNENQQECFERFPLD 

QDFLTENVEDVRYWPYEWTASFKW 

HCLKYYPFNQLFPEIKRPAKAKWVFHGNPRPID 

WPQGDYYWGTKRKFGRGPVDWVR 

AYWLRHDASWAENLWWRKSSHQHQAESQAA" 

complement (220915. .2237 /locus-tag="RB8389" 

34) 

complement (220915 . .2237 /locus-tag="RB8389" 
34) 

/codon-start=l 
/transl-table=ll 
/product= "hypothetical 
protein-putative transmembrane 

protein" 

/protein- id="CAD78617. 1" 

/db-xref="GI: 32445886" 

/ trans lat ion= " MKFSCS IRSVHLSSPVTRLS 

NTRQIGEIPENSDVDTGLDPYRCQ 

SEAPDDVRLIRTLGNGRAATAELVEASWNDGRRH 

LYVEKVFAPGLLTRS I YRLAFAAP 

FAYQSNQHAILACFYRRRWSRMLEASAIRDERD 

SSVDVALPAYVRYDRDREAYVLAA 

S FVEGRGP I PACANQNPATVENEMAGLVAEMRRV 

ERQLFDFGLVGSGWQVS PGAMVST 

ANLLLSEHAGLRWHSDESEADSPRSLRYTIIDL 

ESGIPAILVWRYLWQSWRRGAVFP 

FDDLDSQKLRASCADLVDQLVQRGEQNLAAELSA 

DVEALVAADEQWKAGEIAPFRRPW 

TWFSAQRRAHYRRTCIDRWQQQRLVDSENVPALE 

TSVWFFTLMWLLGVCPPGAIGRWF 

QS CLANRSHRERVRRFATDKRVRS AFWMKYRRRQ 

VKALREQQRCSNRTRPASSLGFIG 

HRLLRYTTTAGVHRYLVDPTRRARRNNQMRALLF 

RGSYQIAFGRKAFRAATRRWQSRE 

WLTDDEAMRLDSQFSGPQLAIYSRGLGMHLAIKF 

FYPLITPLKVGGIAMALTGGSVWF 

AILPIMILPAARTLVTLFSLLANRSHDVPHRHAM 

FVGAI PWGSAAFI VQMWTANPAV 

SKFLLRDFASRFARKIPIYGGPDSRTEHAVLAIG 

ERLLRLVAVFHPTSNEVAEAMPSR 

SESKDVQTS I AKEQSHWMRWP YLKVMRDSLLATA 

VCCGGLWAANYMLADSLSNQWLTG 

ENGALES VQLLVLVCTAI AGCWALLRVKAS SWQS 

I AVALVCVAVAGAARE I PSAVSQA 

SLSDGAYQTAEATGFAISAAWKHGVIAVAVCVWL 

SRVAYAWLKFPEDRKLWFSPMFVW 

PAIPFASCFVLAELFERSKWVTAEETVEVFGYTM 

MMTVAIWMIRNANVLQDWRSEEAN 

RDGRSADMIVSERMDPSAEGMKRAA" 

complement (223815. .2249 /locus-tag="RB8394 " 

39) 

complement (223815. .2249 /locus-tag="RB8394 " 
39) 

/codon-start=l 

/transl-table=ll 

/product=" hypothetical protein" 

/protein- id="CAD78618 .1" 

/db-xref ="GI : 32445887" 

/translation="MSTSASLPSESQTSPRFSYA 

LAS VGD I QVGTVHRS PGGRI S LRE 

LEIDGQPWATRRFWRSFFTRFGIAENVFRYFRP 

DEVFSRIQSQRDDVTFRYCIARSS 

SAS S KRSDQLLAVTNPNRP 1 1 RYDD I RGLLDGNG 

GEDIRYHNGWTSTHAPRGGSRQF 

AIGGDQFRDRFCLETPVDGYGHPRLFLSMLRLVC 

SNGMIGHAKAFRSDVPVGKNMEHC 

IVRAIESFDNGDGYAALRQRFESSQTSWASVHEC 

LRLNELLESLRRDKQLTQDGLIGR 

FRSLAGDLNELYGLANLDALSDKRRRILPSKARV 

YDLINYASEVATHHSHADGANRIQ 

AWLGTLVSEEYDLEGTANSGGDFDAFFVTSDDGL 

PAQSRN" 

complement (225129. .2263 /gene="deaD" 
10) 



complement (225129 . .2263 /qene="deaD" 
10) 

/ locus - tag= " RB8 3 9 6 " 
/function= "phage proteins" 
/note="PiyiID: 8472961 PMID: 7836307 
PMID: 10452953 PMID: 1901561 best 
DB hits: BLAST: embl : CAB524 98 . 1 ; 
(AJ131519) putative DEAH-family 
helicase; E=le-ll 

swissprot:P24125; V51K-BPL79 51.5 

KD PROTEIN pir: JU0445; 

E=5e-10 embl:CAB63670.1; 
(AJ251789) hypothetical protein 
[Lactobacillus; E=le-08 COG: 
VC0812-2; COG1061 DNA or RNA 
helicases of superfamily II; 
E=le-08 DR1624; COG0513 
Superfamily II DNA and RNA 
helicases; E=5e-04 PFAM: PF00270; 
DEAD/DEAH box helicase; E=0.024 
PF0G271; Helicase conserved 
C- terminal doma; E=5.5e-07" 
/codon-start=l 
/transl-table=ll 
/product = "probable DEAH- family 
helicase" 

/protein-id= "CAD78619,1" 

/db-xref ="GI : 32445888 " 

/translation="MSTAILPAIPSEPSSELDLD 

GFAIQPRPYQHRI IRRTLNAFGCG 

EHEFCSAIDGAGQGVSRPSSVLIESPTGSGKTVM 

GLAIAAIMQRTFGIRVGWVAMRRN 

LLAQAQAENIRRKFGLDMQLISMFDKDPPDVDLL 

VIDEAQHDAATSMANLHGQIQPRW 

TLGLSATPYRTDRVKLCFENVIRDAGIAHLIADG 

FLSAYHHYTIEEYSPQSLSELLID 

EPERWGKSLVFFHKQTECDELKSRLDQAGIRSDV 

VTAKTNRDRQLADFIEGHTDVLIN 

MAILTEGFDCPTLKTVFCRPSGQGTTIQMAGRVL 

RKCESEPIKQWQCRHTRHPMMRT 

AI PAEQYLWTEYGWRS I AGNRQLDAMAMAARQRV 

AKANVSLPKLIAAHRPRRRMHRLA R" 

complement (226470. .2285 /gene="uvrB" 

96) 

/locus- tag="RB8397" 
complement (226470 . .2285 /gene="uvrB" 
96) 

/locus-tag="RB8397" 
/function="DNA repair (direct 
repair, base excision repair and 
nucleotide excision repair) " 
/note="PMID: PubMed=10601012 best 
DB hits: BLAST: swissprot : P56981 ; 
UVRB-BACCA EXCINUCLEASE ABC 
SUBUNIT B; E=0.0 pdb:lD9Z; A Chain 
A, Crystal Structure Of The Dna 
Repair Protein; E=0.0 pdb:lD9X; A 
Chain A, Crystal Structure Of The 
Dna Repair Protein; E=0.0 COG: 
BS-uvrB; COG0556 Helicase subunit 
of the DNA excision repair 
complex; E=0.0 VNG2390G; COG0556 
Helicase subunit of the DNA 
excision repair; E=le~179 TP0116; 
COG0556 Helicase subunit of the 
DNA excision repair complex; 
E=le-176 PFAM: PF00271; Helicase 
conserved C- terminal doma; 
E=:2.3e-19 PF02151; UvrB/uvrC 
motif; E=1.2e-12" 
/codon-start=l 
/transl-table=ll 

/product="excinuclease ABC subunit 
B" 

/protein- id= " CAD7 8 620.1" 



/ trans lat ion= " MS ITKLAPAAFDLHQPFPPS 

GDQPAAIAKLIEGIQSGKTAQTLL 

GATGTGKTYTMANVIASVQRPALILSHNKTLAAQ 

LYGEFKEFFPNNAVHYFVSYYDYY 

QPEAYIPQRDVYIEKDSSINEEIDRLRLATTSSL 

VSRRDWI VASVSS I YGLGS PDDY 

RQLWDLHQGEQTRRDHLLLKFVDLQYQRNDIQF 

ERGKFRVRGDS lELWPSYEEFAYR 

lEMWGDEIEKISLIKPTSGETIKTVEHLYIYPCK 

HFVMPEDRIQRAIRLLREELTQQL 

EIFQSQGKLLEAQRLSARTKFDLEMLAEVGHCPG 

lENYSRPLSGKEPGATPDTLYDFF 

PKDFITFVDESHVTVPQVRAMYAGDRSRKITLVE 

HGFRLPCALDNRPLKFDEWEERTG 

QICFVSATPSDYELERTGGEWEQIIRPTGLLDP 

EVEIVSARGQVTHLLEQVRIRAER 

DERVLVTALTKRLAEDLATYFQEQGVKCRWLHSE 

LNAFERVDLLQELRAGQFDCLVGV 

NLLREGLDLPEVSLVAILDADKEGFLRSETSL I Q 

TIGRAARNANSRVILYADKVTDSM 

QMAIDETERRRVIQMEYNAKHGI VPKTVRKS IRK 

GIDTEAANHKESTRKAQDSGEAIY 

ITIEYVDKLEQEMLAAAEDLEFERAARLRDRVLQ 

LKEHIGKPLSEVEIVDEKSAGKSG 

GRGRGRRGAKKKGASKGTKI PRPKRG " 

gene 228679 .. 229314 /qene="tdk" 

/ locus - 1 ag= " RB 8 3 9 9 " 

CDS 228679. .229314 /gene="tdk" 

/ locus - 1 ag= " RB 8 3 9 9 " 
/EC-nuinber= "2.7.1.21" 
/function=" nucleotide metabolism; 
pyrimidine-ribonucleotide 
metabolism" 

/note="PMID: 2060797 PMID: 2041474 
best DB hits: BLAST: 
gb:AAK03320.1; {AE006163) Tdk 
[Pasteurella multocida] ; E=2e-66 
swissprot:P44309; KlTH-HAEIN 

THYMIDINE KINASE pir: ; 

E=3e-65 swissprot:P23331; 

KITH-ECOLI THYMIDINE KINASE 

pir:; E=2e-64 COG: HI0529; COG1435 

Thymidine kinase; E=3e-66 PFAM: 

PF00265; Thymidine kinase; 

E=4 . 9e-88" 

/codon- start=l 

/ trans l-table= 11 

/product=" thymidine kinase Tdk" 

/protein-id="CAD78621.1" 

/db-xref="GI: 32445890" 

/translation= "MAKLYFYYSTMNAGKSTVLL 

QSSYNYRERGMNTLILSPEIDTRF 

GSGKVASRIGIESESVSFNTSDNLLNLVRNETRI 

NPLHCVLVDEAQFLTRTQVRQLSD 

VCDDLDIPVLAYGLRTDFQGNLFEGSEHLLAWAD 

TLTELKTICHCGRKATMVLRVSES 

GQVIRDGEQVQIGGNERYQTVCRLHFKEAIYQRA 

EDELPLLDSNEPRQSER " 

gene 229393 .. 231687 /gene="rnr" 

/locus-tag="RB8402" 
/note=" synonym: VvacB" 

CDS 229393 . .231687 /gene="rnr" 

/ locus - 1 ag= " RB8 4 0 2 " 
/EC-number= "3.1.-.-" 
/function="rRNA processing; mRNA 
processing (5 ' - , 3 • -end 
processing, mRNA degradation) " 
/note="PMID: 9603904 best DB hits: 
BLAST: ddbj :BAB07272 . 1; (AP001519) 
ribonuclease R [Bacillus 
halodurans] ; E=6e-88 
swissprot :032231; RNR-BACSU 
RIBONUCLEASE R (RNASE R) (VACB 
PROTEIN; E=8e-84 gb:AAK05040 . 1; 
AE006328-8 (AE006328) ribonuclease 



BH3553-1; COG0557 

Exoribonucleases; E=le-76 PFAM: 

PF00773; RNB-like protein; E=2e-87 

PF00575; SI RNA binding domain; 

E=3.2e-08" 

/codon- start=l 

/transl-table=ll 

/product="ribonuclease R" 

/protein- id= " CAD7 8 622 . 1 " 

/db-xref="GI : 32445891" 

/translat ion= "MQVSQELIDRVLRLVHAAEY 

RPSKPKQIAALLELDADGYREVRR 

VIKQLVLEGRLIYGGNHLWAAAAVGGPTDQIRG 

TFRRAMGGGFGFVRPS SGGGNADA 

DVPEDVFVPTGMTAGALEGDLVAVTIEPSRRGGI 

EGKWEVLQRARRQFTGTFFS S PQ 

PDQPGSDTIEGPWYLDGVHYEAPVSVGDVRGLP 

LQDGDKIFVEIVDFPDEESGGGEA 

VILERLGSSKNPAIDTLTIMRQYALPDEFSEDVL 

DEAREQADAFDDDWPTDRKDLTD 

MLTITIDPFDARDFDDAISLQREDGRWRLWVHIA 

DVSHFVPPGGKLDVEARRRGTSVY 

LPDRVIPMIPEIISNHLASLQPERMRLVKTVEIE 

MLDDLTITHSEVHNAAIRSDKRFN 

YEQ I DQF I AS PAAFQKDWGDS I CELLTHMHTLAM 

Q I RKRRFKDGALSMDMPD I KLELD 

RTGKVKGAYQTENTESHQI lEEFMLLGNEAVATW 

LDDQELNFLHRIHAPPERRKLRQL 

TSFVKDLGLGFDNVESRFE IQAVLDKVAGTTLEN 

AVNFAVLKSMSKAVYGPHREGHYA 

LDKEHYCHFTSPIRRYPDLSVHRLVQRLIEQKST 

PDESFAELVKLGHECSDAERNAAQ 

AERELIQLKLLHFLKKKQGETLEAVISRVFADGI 

HARCLKLPVDGFIPVTELPSDQYR 

FERRGQSLTGFKSGNRFRLGDHLTVRIGKVDLQD 

RQLYLEWKNHSAAKSDPRGPSGS 

KSKKSPLRHQKKSDRREKKKRRRR " 

/gene="cobT" 

/ locus - 1 ag= " RB 8 4 0 4 " 

/gene="cobT" 

/locus - 1 ag= " RB8 4 04 " 

/ EC - number = "2.4.2.21" 

/note="best DB hits: BLAST: 

Eir:E64856; hypothetical protein 
1120 - Escherichia coli ; 

E=2e-60 gb:AAG55866,l; AE005321-9 
(AE005321) putative nicotinic 
acid; E=2e-60 gb:AAC78722 . 1; 
(U89687) putative nicotinic acid 
mononucleotide: 5, ; E=4e-60 COG: 
ycfY; COG0846 NAD -dependent 
protein deacetylases , SIR2 family; 
E=2e-61 PFAM: PF02146; Sir2 
family; E=3.1e-57" 
/codon- start=l 
/transl-table=ll 

/product = "probable nicotinic acid 
mononucleotide : 5 , 6-dimethylbenzimi 
dazole phosphoribosyltransf erase " 
/protein-id= "CAD78623 .1" 
/db-xref ="GI : 32445892 " 
/ trans lat ion= " MNVLILTGAGISAESGI PTE 
RDANGLWEGHAVEEVATPQGFARN 
PNLVHEFYNQRRRALLNPEIQPNAAHVALADFER 
EHLENGRGDFLLVTQNIDNLHQRA 
GSQNVLAMHGQLLQVRCVYSEEIFDWTGDLSVDT 
PHPEAPDDDSMRGCLRPNWWFGE 
MPIGLTQIEKAATKADLFIAIGTSGWYPAAGIV 
AQTPPHCRRIEVNLDDTPASSAFD 
ET I RGAAS VE I PKLLNHFSAM " 
2336 /locus-tag="RB84 06" 

2336 /locus-tag="RB8406" 



/codon- start=l 



complement (233700. .2345 
45) 

complement (233700 . .2345 
45) 



234547, 
234547- 



.234840 
.234840 



234746 . .235063 
234746. .235063 



/product^ "hypothetical protein" 

/protein-id= "CAD78624 .1" 

/ab-xref= "61:32445893" 

/translation= "MLACVCEKVKTSIVMLLLGE 

FARISASVRQSPCRDEFTMPAFHI 

QRSQTIDADTRDVYDAVRDYSTWTRWSPWLQVDP 

DAEVTVSDPSNELGATYHWKGELV 

GEGSMTHRQLQAPKSLSANASVQADLAFVKPFKS 

QSKVEFEIEPVMTDGRPGSKITWH 

MRGKLPWFLFWMRSMMETFVGMDYERGLLMFKQF 

VETGEVLSKLEIKGWSEPDRMII 

GQRGGCTMDDIGCHMAATLERVKSHYNADDERVH 

EWASLYHTTSDLRKRWFDYTAGYL 

ADAGTPVPDGCVADTVPAGKFLLVRHIGEYAHLG 

NAWSGGIQYIRYKKLKMAKAIGCE 

VYRNDPETTETKDLITDVYIALK" 

/locus -tag="RB8407" 

/ locus - 1 ag= " RB 8 4 0 7 " 

/note="best DB hits: BLAST: 
pir:T37061; probable secreted 
protein - Streptomyces coelicolor; 
E=2e-27 pir:T35483; hypothetical 
protein SC6C5.12c SC6C5.12C - 
Streptomyces; E=2e-16 pir:T35506; 
hypothetical protein SC6E10.19c - 
Streptomyces; E=9e-ll COG: DR2293; 
COG3 021 Uncharacterized BCR; 
E=7e-04 PFAM: PF01260; AP 
endonuclease family 1; E=0.88" 
/codon- start =1 
/transl-table=ll 
/product= "probable secreted 
protein" 

/protein- id= " CAD78 62 5 . 1 " 

/db-xref="Gl 1324458 94" 

/trans lat ion= " MHWMLILFVRKMLFLAIAFA 

SFAFMLRSPLVLAQEQETNSTIRL 

RVLSYNIHHGRGTDGKIDLDRLANVIRSVDPDLV 

AVQEVDQNTRRNGMVNQVETLAVQ 

TSLHGKFAKQIDYDGGEYGQAVLSKYPIESLEVH 

WLPGDPIRERRIVGVAEILIHKTR 

LRFATTHLHHSRADLREKQATELNRLLACGSTPV 

1 1 AGDFNAKPASLAMQTLQAKWRI 

ATTESMHTFPARSPNRQLDYVAMYPANSWRIVES 

EVLDEPVASDHRPLLVEIELETHS GSLPK" 

/ locus - tag= " RB84 0 9 " 

/ locus - tag= " RB8 4 0 9 " 

/codon- start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id= " CAD78 62 6 . 1 " 

/db-xref="GI: 32445895" 

/translation= "MFDSKSKPSETAATVRRPVW 

THQDFSGGGYFGTIPKSTDRCGVF 

TAGVLTADVDLRCIGLKRSTNEDDALFSTEQSRC 

S IQNWRSQKI PHQPENWDR " 

/ locus - 1 ag= " RB 8 4 1 0 " 

/ locus - 1 ag= " RB8 4 1 0 " 

/note- "best DB hits: BLAST: 

embl:CAB57873 .1; (AJ132579) coat 

protein readthrough [Oat golden; 

E=0.10 embl:CAB57874.1; (AJ132579) 

coat protein [Oat golden stripe; 

E=0.10" 

/codon- start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein- id= " CAD7 8627.1" 

/db-xref = "GI : 32445896 " 

/ 1 rans lat ion= " MTLCSLQSSRDAQSKIGDRR 

KSRTSPKIGTGERSIGALKTREQL 

NRLRFAGQPSYTEGSTRRPLPPSAGQLAGSSESQ 

SLTCETMPRNRIANDDSPRMERLA IKR" 



235092. .236180 /locus- tag="RB8411" 

/ f unc t i on= " t rans c r ipt ional 
control" 

/note="best DB hits: BLAST: 
pir:H83475; probable transcription 
regulator PA1359 [imported] 
E=0.067 embl:CAA64 677.1; (X95394) 
aldehyde dehydrogenase- like 
protein; E=:0.52 COG: PA1359; 
COG13 96 Predicted transcriptional 
regulators; E=0.006 PFAM: PF013 81; 
Helix-turn-helix; E=7.3e-ll" 
/codon-start=l 
/transl-table=ll 

/product=" similar to transcription 
regulator" 

/protein-id= "CAD78628 .1" 

/db-xref ="GI : 32445897" 

/ 1 rans lat ion= " MAQTAS FLTRKIGFGMAQAN 

AQLKQARQSLGLSQLQLAMRAGVS 

SRTVQFAESGQNVS IGTMRRI AGALGMQADQLVR 

IDPGTGQDGFAELPWSVADKFRSN 

RGFDEGSMCRNEADVIEWRQLRENFSVQIQKWG 

TSHDQQQALQKNSAIDEVYFRYEQ 

RYVDLWRRNPECIRLDRFEDTVGGVS IVLPLTAE 

S FHAFRDGKLAWLD I S ADDLADQS 

QYLLLDSVTEFTKQCRRPWYQVTKSLSLITFNQV 

ASLAQSPNQSDFEMVSFSASPLNE 

RRLGTIGFIPEPTKEPEFSYPIYWFGEDPRILAK 

EEYSNWATFKHFAMLIKSVDKAGL 

RRRMIRNLLSMVKRLQRPAERSISRQAA" 

complement (236128. .2372 / locus -tag="RB84 14 " 

04) 

complement (236128. .2372 / locus -tag="RB84 14 " 
04) 

/ codon-start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id="C!AD78 629 .1" 

/db-xref ="GI : 32445898 " 

/ 1 rans lat ion= " MLRFVWRNRLATTGRLTWIG 

GGSMNGKILSYSAGSTSLDPFKFR 

VIDSQKPTDKMKRLATIFPDLQPFFVDPSQSLW 

DAYPAALGGMEAMTRVITYLHPPT 

CIRALRLAAAENRRWF I AQPLAGADLLLQAMET 

QMDWPTELLWATGGYPLPASLERS 

VEAWLANRGCRLTVLQAYGVAELDHTLMASMHRG 

SDSHPIYQLIDPRLELDSFEDGCS 

LNKHVRFQGIRTANQDRIESCGSGYRIHGNPSLY 

GDGALRWLEKWQPNDWWNCTGYLS 

DRDGAI ALQQRRGRTTNAEVS INCLSLPAMASLP 

RGVACLPVEHFDFMSDDGMSWMEK 

PKWNPAAFKQLDAKS I ARRAAAVA " 

complement (237132. .2377 /locus-tag="RB8415" 

58) 

complement (237132. .2377 /locus-tag="RB8415" 

/note="best DB hits: PFAM: 

PF00583; Acetyltransf erase (GNAT) 

family; E=0.72" 

/codon-start=l 

/transl-table=ll 

/product=" hypothetical protein" 

/protein-id="CAD78630 .1" 

/db-xref="GI: 32445899" 

/translation="iyiDVLHCFGTADPLFDWQEI 

RRDGSEQTLYHYDEYSNHYILKWQ 

GQPLGTLTMLAAANGRLDCEDHYPTVLLDQHGDE 

LFASCKLRIRRGPSTPIAALRTLV 

RGS WAHQLANGKRI S I ANAD VRMKAFYRRMGFT Y 

LPGFDFIHPELKTQSIVLLMAADA 

RGRSYCQDMFAELAKQSDQPALVEMCCGSFGETG 

SRPRVGSLGIAAVA " 

237883. .237999 /locus-tag="RB8417" 

237883. .237999 /locus-tag="RB8417" 



gene 
CDS 



complement (237986. 

21) 

complement (237986 . 
21) 



/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD7 8631.1" 
/db-xref="GI: 32445900" 
/ translat ion= "MRMRSNAKAYRISDRHKARR 
LRDSTIEMLICFRFITSN" 
,2404 /gene="priA" 



2404 



gene 
CDS 



240382. 
240382, 



,240843 
,240843 



/locus- 
/gene=' 



tag="RB8418" 
prxA" 



/locus-tag="RB8418" 
/function="DNA synthesis and 
replication" 

/note="PiyiID: 9086272 best DB hits: 
BLAST: swissprot : P94461; 
PRIA-BACSU PRIMOSOIVIAL PROTEIN 
(REPLICATION; E=le-130 
embl:CAA71348.1; (Y103 04) priA 
[Bacillus subtilis] ; E=le-123 
swissprot :P74397; PRIA-SYNY3 
PRIMOSOMAL PROTEIN N' 
(REPLICATION; E=le-112 COG: 
BS-priA; COG1198 Primosomal 
protein N* (replication factor Y) 
-; E=le-131 PFAM: PF00005; ABC 
transporter; E=0.66 PF01443; Viral 
(Superfamily 1) RNA hel; E=0.33 
PF00271; Helicase conserved 
C- terminal; E=7 . 8e-09" 
/ codon - start = 1 
/transl- tableau 
/product = "primosomal protein N" 
/protein- id= " CAD78 63 2 . 1 " 
/db-xref ="GI : 32445901" 
/translation= "MGYLAPAGKKRSSHVMLSDS 
SLSNLGDPTLNFPPFAAAPVPADS 
SSSEPTQNELFETDPPPWELTVGEDVQLASIVFA 
RSPHGPYDYRIPDDLLDVLRPGMR 
VGVPLGHRKKPTPGWCVSIKTGNAAQKKLRDVSE 
VIDDEPLCDAALVRLVMFIAHYYQ 
VPAGQVFDTLIPASVRDNAGTRKTTYFRPAPGLT 
EEQIAKLPSKQQSAMRFLIAQDRP 
MTAAEIAIMAECTEDPIRRLRKKELLVPEVRREL 
SQNIRIRAQANDGETRKTHDLTAQ 
QENALSRINSAVDSGRGRTLLLHGVTGSGKTEVY 
IQAI EHWKQAGSAI VLVPE I SLT 
PQTRGRFEDRFQNVAVLHSQMSASERHFHWQRIR 
RGEVQWIGPRSAVFAPLPNLGLI 
VIDEEHDTSFKQDKQPRYHARKVAHARAMALGIP 
LVLGSATPSMEAWHATQTGHAELV 
TMSERVGNRPMPDVQLVDLRVKEERGKGGAISRP 
LHAAVLETLKEKGQAILLLNRRGY 
ATTIQCPACGTVCACPDCDMPLTHHRDGGKAMCH 
YCDYTIPTPPWCPACRFDGMRYGG 
LGTQRLEMEAKARFPDARIARMDSDTMKRAGSHQ 
RVLSEFRAGEIDVLLGTQMIAKGL 
DFPNVLLVGVINADSALHFPDFRAAERTFQLVTQ 
VAGRTGRGDRGGRVIVQTFTPEHP 
AIQAAARHDYLKFVEDEMVNRKKFNYPPLGSVAR 
IIIRGPLEDKTESVADAIVDRLEK 
ARDLLKAEVRILGPAPPPIVKISGKYRFHLLLQA 
TEAAWGEVIRRGLADFKVDPKEE 
lEFLVDIDPVNLM" 
/ locus - tag= " RB8 4 2 0 " 
/locus-tag="RB8420" 
/codon- start=l 
/transl- tableau 
/product=" hypothetical protein" 
/protein- id= " CAD7 863 3.1" 
/db-xref="GI: 32445902" 
/ 1 r ans 1 a t i on= " MS S FS SRRAQDNPS 1 1 AHPV 
TRLVTRLNCPALPRPESPSLSRSF 
HPELLTQTPPEPLVPIRLGRGRRLGSHRGAAKDG 
NRGSRNAHPERRRPHQTAPHRVKE 



240840. .24 
240840. .24 



241270. .24 
241270. .24 



complement 
76) 

complement 
76) 



AIRIPSSHYTGSLKAMR" 
1310 /locus- tag="RB84 21" 

1310 /locus- tag="RB842i" 

/codon-start=l 
/transl-'table=ll 
/product= "hypothetical 
protein- signal peptide and 
transmembrane prediction" 
/protein-id= "CAD78634 .1" 
/db-xref = "GI : 32445903 " 
/translation="MTEHRPSPMHPRSRSVPTIL 
RNILAWAGLFLGSVLNMAIVTIG 
PILIPLPDGVDMSDMDQFAENLKLLKPANFFAPW 
LAHAFGTLVGAFVAAKIAASHKMK 
FALGIGVFFLLGGITMAMTFGGPLWFIVLDLVGA 
YLPMGYLGGTLARATRPQPT " 
1428 /locus- tag="RB8422" 

1428 /locus- tag="RB8422" 

/codon-start=l 
/transl- tableau 
/product= "hypothetical protein" 
/protein- id= " CAD7863 5 . 1 " 
/db-xref = "GI : 32445904 " 
/ 1 rans lat ion=: " MELSREQRDRNPRDVRYIST 
APSGAAGWLLNRPLGASTPSYERR LLRSHPAI" 
(241421. .2433 /gene="atsG" 

/ locus - 1 ag=: " RB 8 4 2 3 " 
(241421. .2433 /gene="atsG" 



/ locus - tag= " RB8 4 2 3 " 
/ EC - number = "3.1.6.-" 
/ f unction= " sulf atase " 
/note="PMID: 9634230 best DB hits: 
BLAST: pir:F70837; probable 
sulf atase (EC 3.1.6.-) atsG - 
Mycobacterium; E=le-4 0 
gb:AAF55607.1; {AE003724) CG14291 
gene product [Drosophila; E=le-20 
gb:AAF29467.1; (AF156255) 
N-sulf oglucosamine sulf ohydrolase 
[Mus; E=3e-20 COG: Rv0296c; 
COG3119 Arylsulf atase A and 
related enzymes; E=le-41 BH1718; 
COG1413 Phycocyanin alpha- subunit 
phycocyanobilin lyase and; E=0.010 
PFAM: PF00884; Sulf atase; 
E=5.2e-22" 
/codon-start=l 
/transl-table=ll 

/product = "probable sulf atase atsG" 

/protein-id= "CAD78636 .1" 

/db-xref ="GI : 32445905" 

/ 1 rans 1 a t i on= " MWQAAIRRFQDARVPMMVAL 

LLTAMSLFQTVEAGDRPNILWLSA 

EDISPHIGCYGDPHAITPRIDQLASEGIRYSNAF 

TTAGVCAPCRSGI ITGMYQTTLGT 

QHMRCQAKLPES IRPFSTYLRESGYFCTNNSKQD 

YQFETPKGSWDQSSSKAHWRDRPD 

QDTPFFSVFNFTGCHESGIENDAKYKSVTKGLSE 

RERQDASKLSTFPPYYPDTPAARE 

DWKRNYELITALDHWVGGLLDQLNADGLDEDTIV 

FFWSDHGVGLPRAKRWLYDSGTHI 

PLVIRMPAQFRSTDNVAGWDDRLVSS IDFGPTV 

LNLAGLDVPEPMQGKPFLTSPLSK 

LAAADRDYVYGARDRMDERYDIIRMVRDQRYKYI 

RNYEPLKPYFQYMNTPEKGQTMRS 

IREAEQAGTLPEAAMPFFRGTKPTEELYDLENDS 

HEIHNLASSSDHTEVLHRMRAAHE 

QWVTRTKDLGLIPEPILAERASELGSQYAVLRQS 

DDSELANRVAAAALAASEGPRALP 

EMRSALDDIDSAVRYWGATGIGNVFARGEIDKLP 

YLMDLQERLADESVTVRVAAARAL 

CHSGDPEATSKALSVLAEALADGAQWERLQAAIV 

LDEIDEKALPVIDSMQDALEPRQE 



complement (243479. 
53) 

complement (243479 . 
53) 



244767 
244767 



.245624 
,245624 



transporters 



245160 2451 



.2447 /gene="mntH" 

/ locus - tag= " RB84 3 0 " 
.2447 /gene="mntH" 

/ locus - tag= " RB 8 4 3 0 " 
/function= "metal ion 
(Cu(2 + ) , Fe(2 + ) , etc J 
/note="PMID: 10844693 PMID: 
10712688 best DB hits: BLAST: 
pir:T44945; hypothetical protein 
1 imported] - Natronobacterium; 
E=le-4 8 swissprot :Q9RPF3; 
MNT1-PSEAE MANGANESE TRANSPORT 
PROTEIN MNTHl; E=le-18 pir:B83105 
probable transport protein PA4334 
[imported] -; E=le-18 COG: PA4334 
COG1914 Mn2+ and Fe2+ transporters 
of the NRAMP family; E=le-19 PFAM: 
PF01566; Natural 
resistance -associated macrop; 
E=0. 00026" 
/ codon - s tart = 1 
/ trans l-table=ll 
/product =" Manganese transport 
protein mntHl" 
/protein-id="CAD78637 . 1" 
/db-xref="GI: 32445906" 
/ 1 r ans la t i on= " MS FATLAGMKLPKFFRRFGP 
GLLVTAAFIGPGTVTKATTAGANF 
GHTLLWAIGFSVIATIVFQEMASRLGIVTGRGLG 
EAIRPT i PNALARGLAI VLWSAI 
I VGNAAYQAGNI AGAAVGVAAATGMQHQTLVS IV 
IGLTAWCILMIGHYRSLQRILVAL 
WTMSCVFLLTALSVPIDWRSIALGWIQPTIPDG 
GLKEVLAI IGTTWPYNLFLHATA 
SAEKWSSDKKEPVSDENIRDAIQHSRGDTILSVG 
LGGLVTAAVMATATAAFFQSNTGF 
TNLADAARQLEPLLGNHARWLFGVGLFAAGLTST 
ITAPLAAAYAAAGCFGWPIDLKDW 
RLRTVFTTVIVFGTSFAASGSKPTDI ITFAQIAN 
GLLLPLLAI FLLAVMNNAALLGKH 
RNHWIANTLGVLTWWSVLGLRSLVSVSFAG" 
/ locus - 1 ag= " RB 8 4 3 2 " 
/ locus - tag= " RB84 3 2 " 
/note="best DB hits: BLAST: 
pir:F70844; probable fusion 
protein - Mycobacterium 
tuberculos is; E= le - 0 7 
embl : CAC04235 . 1; (AL391515) 
conserved hypothetical protein; 
E=2e-04 pir:E83611; conserved 
hypothetical protein PA0269 
[imported] -; E=0.020 COG: 
RV3327-2; COG2128 Uncharacterized 
ACR; E=le-08" 
/codon- s tart =1 
/transl-table=ll 

/product = "probable fusion protein" 

/protein- id="CAD78 63 8 .1" 

/db-xref ="GI : 32445907 " 

/ 1 rans lat ion= " MNSHHSSSGYPVMPSRSELV 

HTSATTIANNDDSNANQRVWNFCA 

AALILRPPITHPSEVLMTTGLKPLTDAEAREVEL 

VFRLTKSHLGFVPNSMRTMARQPA 

ILSSFTLMVGNILGQPSDAKSPIWLGIRLVIKNV 

IWSLRNMRSKDRLPLALKNLVAHV 

TSGAAGCRYCQAHTIGEARDQGVPIEKLEAVWEF 

DRSDLFDEAEKSALRFALAAGSHP 

NGVTADHFADLRKHYTEDQIVELGATIALFGFLN 

RWNDTFATTLEPESAAFANQHLSA 

SGWEIGKHG" 

/locus-tag="RB8432" 

/note="cosmid pircos-a3c02/ cosmid 

pircos-cle07 joining point" 



CDS 245660. .245869 /locus-tag= "RB8434 " 

/codon- start =1 
/transl-table=ll . 
/product=" hypothetical protein" 
/protein- id= " CAD7 863 9.1" 
/db-xref ="GI : 3244 5908 " 
/translation="iyiGSADSMERTSKAGACDSPP 
WKHSRDFCPFRLHYFNRKWIAAM 
RPRLADSLPTQPFGFLECVPSLACY " 
gene 245842 248907 /qene="cti" 

/locus - tag= "RB843 7 " 
CDS 245842. .248907 /qene="cti" 

/ locus - tag= " RB8 4 3 7 " 
/note="PiyiID: 98027367 best DB 
hits: BLAST: embl : CAA04438 . 1 ; 

(AJ000978) cistrans isomerase 

[Pseudomonas; E=0 . 03 0 
qb:AAD41252 . 1; AF110738-2 

(AF110738) Cti [Pseudomonas 
putida] ; E=0.030 gb: AAD41255 . 1; 
AF110739-2 (AF110739) Cti 

[Pseudomonas putida]; E=0.030" 

/codon- s tart =1 

/ trans l-table= 11 

/product=" similar to cis/trans 

isomerase" 

/protein-id="CAD78640 .1" 
/db-xref="GI; 32445909" 
/ t rans lat ion= " MRSLLGLLLTIAFAVGFVER 
WSAEPSDTAQITVPATNRPLSFN 
RDIRPILTDRCFACHGLDANTVEAGLRLDVREAA 
LEGGAIVPGDAESSEIFLRITSDD 
DDLVMPPSELHKPLSDAEIDLVRRWIQEGAPYEP 
HWAYTPLPRNGVPSSVNTTWGENW 
IDQFVARRLQDASVTPSPQADPTTLIRRLSFDLT 
GLPPTPEEVEAFKQDNSNESYERL 
IDRLLSSSRFGERMAIYWLDLVRYADTVGYHGDQ 
NVSHSPYRDYVIDAFNSNMPYDQF 
VREQLAGDLLPNPTIDQLVASGYNRLNQTTEEGG 
AQPKEYLS I YFADRVRNVSQVFMG 
ATVGCAQCHDHKYDPYTAKDFYSLGAFFADIEER 
GKYSQRERPPQIPVPNEEQQAQLD 
ALRLKVAEAEANLES ITQKQLAFLPEWEAQAIES 
INQSKFRTEEHTWVDDQLAAGGSK 
SGEWNHVTADQHPVHSGETSRRQTGKGWQHLFT 
NAKEPIEVKAGTELFAWVYLDPQS 
PPKTIMLQWHGGPDNSSGWDHRAFWGDDKVGFGA 
KPQSWHGHKRMGDLPETGKWIRLS 
VAADDVGFAPGKTVHGMAFTQFDGIVHWDQAGWN 
ETDGFPSEIRAALLiADVKARTKNQ 
IKQLRDYFVEHEPLITQQAKELKRARTQVSEFEN 
SIPTMVISRAVAPREIRILDRGNW 
MDESGE I VQPAI PEFLGTLQTSNRRATRLDLANW 
LCDSENVLTARTMVNRLWYLMFGR 
GICSSVDDLGGQGTFPSHPELLDHLAVEFIESGW 
D I QHLLKQ I AMS AT YRQASNGS PE 
LNALDPFNELFARQGRFRVSAEMVRDTALMI SGL 
LVEEIGGPSVKPYQPEGYYSQLNF 
PRRNYVPDQGDNQYRRGVYTHWQRTFLHPMLKAF 
DAPSREECTAIRARSNTPLQALTL 
LNDPTFIEAATMFASRIMREGGTTVDERIEWAYR 
T AVS RP I D AWAE ELRT I QTEHLR 
HYAQEPDLAKALVSVGEQSSTTALDPAEWAAWTS 
VARVILNLHETINRY " 
gene 248914 250311 /locus -tag="RB844 0" 

CDS 248914. .250311 /locus- tag= "RB8440 " 

/note="best DB hits: BLAST: 
gb:AAK03766 . 1; (AE006204) unknown 
IPasteurella multocida] ; E=0.72" 
/codon- s tart =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAD7 8641.1" 
/db-xref="GI: 32445910" 



AGVGSVALASMLNERTSLASEGIH 

HPPRIKRVIHLCMAGGPSHLETFDYKPELARLDG 

QPMPESLTKGQPIAQLQNKALRVM 

GPQHAFAKHGKSGLEISSVLPHLSKMADDLCWK 

SMHTEQINHDPAHTFFNTGTAISG 

RPSMGSWVLYGLGAETQDLPGFIVLTSEGGGQSQ 

PISSRQWHSGFLPSRVQGVQLHAS 

GNPVHYVGNPAGVNHSQQRDIVDAVARINMHRNE 

TLQDPEIATRISAYEMAFRMQTSV 

PELTDMTDEPDHIVKMYGCTPGDGTFGSNCLMAR 

RLAERGVRFIQLYHRGWDHHGGVK 

RGTEKTGSLVDQGSAALIQDLKQRGMLDDTLIVW 

GGEFGRTPMAQGDGRDHHIKGFS I 

CLAGGGIKGGTSYGATDEFGYNAVKDPVHVRDFH 

ATMLHMLGIDHNRFTYKFQGLDFK 

LTGVEEAHWDGILS " 

complement (250315. .2530 /locus-tag="RB8441" 

38) 

complement (250315. .2530 /locus-tag="RB8441" 
38) 

/note="PMID: 87307953 best DB 
hits: BLAST: pir: 139942; 
penicillinase antirepressor penJ - 
Bacillus; E=le-07 
swissprot : P12287 ; BLAR-BACLI 

REGULATORY PROTEIN BLARl ; 

E=2e-07" 
/codon- Start =1 
/ trans 1- tableau 

/product= "probable penicillinase 

antirepressor penJ" 

/protein-id="CAD78642 . 1" 

/db-~xref= "01:32445911" 

/translation= "MIWAALVISSIVSLWMSPA 

MIAGRKRSHFSATWNDLWRMGIV 

LMLILPIALTLRPSSMALWGFAPSSLGTNLPSAI 

SADDVSVATGLNSEFSTAMAGGDV 

SDSTNVDRSFVGKDRSMPVSALNSAALESSPEMF 

VRPWAI AET I SNLVMAVGWVWFAI 

ASALLIRMLIGYVKVWSMVRAATEVPDPKAMFQL 

IRLPMKTTLSRARCFHSRDVEVPC 

WPCWKPTILLPHDFERWSSDEQTAILQHELCHL 

LRHDVGWQFLANLLWLYWFHPLA 

HHARRQLRRTAEFAADDAVLNAGFSADNYAENLI 

RIARGLTAHSTVAHLSAAVSEHSV 

RERVFEVLDLDKSRNPTNPRRRVIGWS I S AATLL 

LLFSFAPSFATDPTEEPNWSQRIA 

MIRQAHPWIRLSDSFADEIERRQKIHRGERIDVR 

GQVLAKDQTPVAGALVLLRQRTNT 

TWMRGFQSYQQATFAYSWTDQNGAYDLRSIPTPW 

RGWNQSTAWETLILSPDHCVGTRV 

HPPGDLKFDDKTILLEPYQTISARFVDANNQPLP 

MQLNLAVLVDGRETSQSFNSLDFY 

DMNLMGSELPGLMQANEAGEVTIGGLPPNRLAVF 

NCFSPTNELFSFASQSTTMLRIAT 

SPAGVRDPLIPLSNSRGSGEVHPFVENGWMQTN 

ETIEQLGGLREERLQLVPEVFGAT 

DRRWGMQSDWVRGRSKDDITVDSRDYRWKVKW 

DAETKRPLPSVYVDWCDERRRPIR 

GPEDLDGHFPHTLTDDEGHAELWPIEDVKVFAA 

GRYLGYQTAYDRLHANPAEPTSNP 

PLEQWVRSLTASTEDNEVTFELQPVTPLTVRCMD 

TNGDPVSADIAATDLAGSVNGFWK 

SATNQSGIASVPLLPTFESVRIDVQAGDLAPQSR 

TVDIGPENSIIEFILAP" 

complement (253050. .2534 /locus-tag="RB8445 " 

75) 

complement (253 050. .2534 /locus-tag="RB8445" 
75) 

/note="best DB hits: BLAST: 
pir:S74550; hypothetical protein 
slr024 0 - Synechocystis sp. 
(strain; E=2e-07 swissprot :Q47839; 
COPY-ENTHR COPAB ATPASES 
METAL-FIST TYPE; E=0.047 



HYPOTHETICAL 16 . 5 KD PROTEIN IN 
PTSI; E=0.13" 
/codon- Start =1 
/transl-table=ll 

/product^" conserved hypothetical 
protein" 

/prot e in- id= " CAD7 8643.1" 
/db-xref ="GI : 32445912 " 
/ 1 rans lat ion= " MPDPTMI SLGKRQRQI YETV 
HRLGNASVTSVRDGLEDPPSYSTV 
RAILNRLVDQGWLLSEQDGPRLVYRCTQKQVAAK 
RSMAKQVLQNFFQGS PAEALAAFL 
DASERPLTEDEVDQMIEEVQRARTQI SKKSQAKP 
KRPTS " 

/locus-tag="RB8448" 

/locus-tag-"RB8448" 

/codon- start=l 

/transl-table=ll 

/product =" hypothetical protein" 

/protein- id="CAD78644 .1" 

/db-xref ="GI: 32445913 " 

/translation= "MIVHFGGADARDKPVRFPTL 

DRFPRSKTQVIRDHRNTDSTSRTF 

GVFASAMDNSHSRAPVLYTNQPCLACHSACHVAA 

NMKWVHGSRSNPRGRERFVSRSTH 

WFDVHEHGLHAPGLHAFRFPSSVCSRHFEC " 

/ locus - 1 ag= " RB 8 4 5 0 " 

/ locus - 1 ag= " RB84 5 0 " 

/note="best DB hits: BLAST: 

pir:S75290; hypothetical protein 

S111289 - Synechocystis sp. 

(strain; E=0.007 pir:G70687; 
hypothetical protein Rvl732c - 
Mycobacterium; E=0 . 03 7 
ddbj :BAB05296.1; (AP001512) 
cytochrome c biogenesis 

(thioredoxin) ; £=0.53 COG: 
S111289; COG0526 Thiol-disulfide 
isomerase and thioredoxins ; 
E=7e-04" 
/codon- start =1 
/transl-table=ll 

/product =" conserved hypothetical 

protein" 

/protein- id=" CAD78645 . 1" 

/db-xref= "01:32445914" 

/ 1 rans lat ion= " MVPDQIQGAAKGSFRGRRIG 

LMFTNMVCMLLVCMLFASHRAFAQ 

DTSNADGLKGLDLDGRLIRLGQSNETKAVAWFL 

STHCPISNGYLPLLNGYASEYGTQ 

GIELIGVISDPSVTRSDAKKHSVDYRVEFPVLFD 

GSGELRLALSPTHTPQAMLIRPTG 

EMLYNGAIDDRHVQVGRKKDHAANS YLRDAI RAV 

I ARRAVDVPRTKP I GCLLED P PNK 

AIEGEVTFARDVAPVIQANCAGCHCQNGSAPFPL 

LTYNDVSGHANQILEVTHSRFMPP 

WKPAAGFTRFQDELRLTNHEMSLLNVWVRDGKPA 

GDPKDLPAPLPPSNGWPLGEPDMI 

LEMSETFS I PSSGPDVRQYFVI PTGLKEHRLVNA 

IDFHPGTPQSVHHASFFLDTQRKG 

RRLDQSDPAPGYSSFGGPRFEPEGTLSSWFPGMS 

PRPLPEGMGRLVPKGSDIVAEIHY 

VTTGKPHQDKSKIGLYFAPPSARKMVTEIQVGNK 

RIRIPPGAEHHLQRATYTLPVDTV 

LLDTVPHMHLFGREMKVWSKSPDGETKPLLWIKD 

WDFNWQGKYSFAEPVRLKKGTI IH 

VDAWYDNSANNPLNPNSPPKPVSWGADSTDEMLI 

CHFQCTCETMGELNELIRHQKKHI ADRQ" 

/gene="nadD" 

/locus - taq= " RB84 55 " 

/gene="naaD" 

/Iocus-tag="RB8455" 

/EC-number= "2.7.7.18" 

/function= "biosynthesis of 

vitamins, cof actors, and 



gene 
CDS 



256483. .257115 
256483. .257115 



gene 
CDS 



complement (257117 
58) 

complement (257117 
58) 



gene 
CDS 



257496. .257663 
257496. .257663 



/note="PMID: 10894752 best DB 
hits: BLAST: swissprot : 066452 ; 
NADD -AQUAE PROBABLE 
NICOTINATE -NUCLEOTIDE; E=2e-23 
swissprot :P52 08 5; NADD-ECOLI 
NICOTINATE -NUCLEOTIDE; E=7e-23 
gb:AAG54973 . 1; AE005243-2 
(AE005243) orf, hypothetical 
protein; E=le-22 COG: aq-036; 
COG1057 Nicotinic acid 
mononuc leo t ide 

adeny ly 1 1 rans ferase; E=2e-24" 
/codon-start=l 
/transl-table=ll 
/product= "probable 
nicotinate -nucleotide 
adeny lyltransf erase" 
/protein-id= "CAD78646 . 1" 
/db-xref = "GI : 32445915" 
/translation="MSASETTPQSNHGIGILGGS 
FDPVHVGHLWMAESALEQLPIEHV 
RWI PAATSPLKPHGPVASNEHRLQMLRLALSGQS 
GLVIDDWELRQDSVSYTLLTLEYL 
QEQFPDRPLYLI IGADSLASFDRWREPEQILKRC 
HLAVIARGGDPPPDYSILDGMTDE 
TQIQRIRESQIQMPQIEISSSDLRNRIATGRSIR 
FRVPHPVATL IDNEKMYRVR " 
/ locus - 1 ag = " RB 8 4 5 6 " 
/locus- tag="RB84 56" 
/note="best DB hits: BLAST: 
swissprot :P72873; Y925-SYNY3 
HYPOTHETICAL 23.0 KD PROTEIN 
S110925; E=3e-25 gb : AAC26156 . 1 ; 
(AF035751) hypothetical protein 
[Synechococcus sp.; E=2e-24 
swissprot :P72821; YC90-SYNY3 
HYPOTHETICAL 21.6 KD PROTEIN 
SLR1290; E=5e-24" 
/codon- start =1 
/transl-table=ll 

/product = "conserved hypothetical 
protein" 

/protein- id= " CAD7 8 64 7.1" 
/db-xref ="GI : 3 244 5916 " 
/ 1 rans 1 a t ion= " MPAI PRVSKLRSAMSTNP I P 
LLSEQEYLDQERKASFKSEYFGGE 
TFAMGGAKRAHNLIVTNLIRVLSNALLDGPCEVY 
PCDMRVQIEATGLYTYPDVWACD 
EPKFRDDQLDTLQNPTLLIEVASESTEAYDRGTK 
SRHYRRI PSLKGYVLVAQDRPAVE 
SFTRGDDHTWTLREAGGLENELRIDALKLNLSMT 
EVYHRVQFDRPDPLVK" 
.2574 /locus-tag="RB8457" 

,2574 /locus-tag="RB8457" 

/codon- start =1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id= " CAD7 8648.1" 

/db-xref ="GI : 32445917" 

/translat ion= "MRRVRLRTRERLRARLRVLI 

GRGGMKLFENIEGEWTDSAGKPTS 

PARQWCPVNAKRSRSRATQCAEIGVSTQSLLPIL 

PDKAFLGRSVSAQDAEPEPGCSVQ 

SRLAGEGRIEL" 

/qene="rpsR" 

/locus- tag="RB8459" 

/gene=" rpsR" 

/locus - 1 ag= " RB8 4 5 9 " 

/function="ribosomal proteins" 

/note="PMID: 6806564 best DB hits: 

BLAST: swissprot : P21475 ; 

RS18-BACSU 3 OS RIBOSOMAL PROTEIN 

S18 (BS21) ; E=le-08 pir:S66013; 



Bacillus subtilis; E=le-08 

pir:R3BS18; ribosomal protein S18 

- Bacillus stearothermophilus; 

E=le-08 COG: BS-rpsR; COG0238 

Ribosomal protein S18; E=le-09 

PFAM: PF01084; Ribosomal protein 

S18; E=2e-16" 

/codon-start=l 

/transl-table=ll 

/product="3 0S ribosomal protein 

S18 " 

/protein- id= " CAD7 8649.1" 

/db-xref ="GI : 32445918 " 

/translat ion= "MYVDYKDLELLSKMVNRQGR 

IMGRRKSGCTAASQHAVTAAI KRA 

RFMALLPYVGE" 

257681. .258142 /locus-tag="RB8462 " 

257681. .258142 /locus-tag="RB8462 " 

/EC-number= "3.1.2.23" 
/note="PMID: 9837940 best DB hits: 
BLAST: ddbj : BAB05718 . 1 ; (AP001514) 
4 -hydroxybenzoyl - CoA thioesterase ; 
E=3e-07 swissprot: 067466 ; 
YE94 -AQUAE HYPOTHETICAL PROTEIN 

AQ-1494 ; E=6e-06 pir:B83525; 

conserved hypothetical protein 
PA0968 [imported] -; E=7e-06 COG: 
BH1999; COG0824 Predicted 
thioesterase; E=3e-08" 
/codon-start=l 
/transl-table=ll 
/product= "probable 

4 -hydroxybenzoyl -CoA thioesterase" 

/prot e in - id= " CAD 78650.1" 

/db-xref="GI: 32445919" 

/ trans lat ion= " MNDSASDGAPTQRAPVYQTQ 

RRVEFRDTDAAGIVHFSAFFPMME 

AAEHEFLRSVGIPVMPHHESEERLTWPRVAASSD 

FHGPARFED I LQ IDLHVDRIGQS S 

ISYRFELTCEGRRIATGKITAVCCDLKAGGKLVK 

TNVPDDLRQRLSGPQND " 

complement (258130. .2586 /locus-tag="RB8463 " 

75) 

complement (258130. .2586 /locus-tag="RB8463 " 
75) 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/prot e in- id= " CAD7 8651.1" 

/db-xref ="GI : 32445920 " 

/ trans lat ion= " MRRLDWPRSRS PSFRLRRLS 

FFGKFGHSNLLQDRVEDLFGGFEI 

RHFGLELSQDASRDGHLFASRGRGVQQHHARHHD 

EVQWEPEVNGAFAFRGTRQCDW 

TFQVFVLASEHLVDFTEDDFVFSGDRQRFQRHKS 

AAGQIERGSLVAFVFGSFDQLPW 

RVQQFQAEAFESCGHVFVQS F " 

258151. .258621 /locus-tag="RB8464 " 

258151. .258621 /locus-tag="RB8464 " 

/codon- start=l 
/ trans 1- tableau 
/product=" hypothetical protein" 
/protein-id= "CAD78652 . 1" 
/db-xref ="GI : 32445921" 
/ trans lat ion= "MPATFEGFGLKLLYPDNWKL 
VERAEDEGDQGATFDLPGGGFVSL 
ETLPITREDEWLGEIDQMLRGQYEDLERDNVTL 
PGATEGERAVDLRFYYLDLWMSR 
WLLDAPAPTREQMPIAGRILAQLQAEMSDFEAA 
EQVFNAILQQIRMAELPEEA" 

complement (258682. .2592 /gene="fur" 

36) 

/ locus - tag= " RB8 4 6 5 " 
complement (258682 • .2592 /gene="fur" 
36) 



7590316 
BLAST: 



FUR-CAMJE FERRIC 
PROTEIN; E=7e-13 
FUR-NEIGO FERRIC 
PROTEIN; E=8e-13 



complement (259533 . 
99) 

complement (259533 . 
99) 



/function= "transcriptional 
control" 

/note="PMID: 8083178 PMID: 
PMID: 9765558 best DB hits 
pir :B70423 ; transcription 
regulator FurR family - Aquifex 
aeolicus; E=2e-15 
swissprot : P48796 ; 
UPTAKE REGULATION 
swissprot : Q51008 ; 
UPTAKE REGULATION 
COG: aq-1418; COG0735 Fe2+/Zn2+ 
uptake regulation proteins; 
E=2e-16 PFAM: PF01475; Ferric 
uptake regulator family; 
E=1.5e-22" 
/codon-start=l 
/transl-table=ll 

/product = "probable transcription 
regulator FurR family" 
/protein- id="CAD78653 . 1" 
/db-xref ="GI : 32445922" 
/ t rans lat ion= " MACKHFFSLSQRSVLSDVSS 
VPESLERLEVALTPQERFEEYLQS 
KGQRQTKPRKFLVEKIFSQHAHFDADELIEKLPR 
KGEPNYVSSATVYRSLREFVDAGL 
LNCFQLDGRTVYELDYGYPPHDHLYCTRCRKLIE 
FRSEELIATRDEAAAKHGFRVSGH 
RMLVSGVCRECGKNRRKKRKQDLV " 
,2597 /gene="csyQ" 



2597 



/locus- 
/gene=' 



tag="RB8468" 
csyQ" 



259779. .260975 
259779. .260975 



/locus-tag="RB8468" 
/note="best DB hits: BLAST: 
swissprot :P46726; CYSQ-MYCLE CYSQ 

PROTEIN HOMOLOG pir:; 

E=0 .66" 

/ codon -start=l 
/transl- tableau 

/product=" similar to CysQ protein 
homo log" 

/protein-id="CAD78654 .1" 
/db-xref ="GI : 32445923 " 
/translation= "MPPIGDQGQLRNEIQSKLRS 
PVASSEGSDGTLADFFFAVIEPRP 
GRNNVLTLGRSQWRLRHLAVWQRCAFGVGDSARK 
RPANTLPPRP" 
/gene=" ode " 
/ locus - 1 ag= " RB 8 4 6 9 " 
/gene="odc " 
/locus-tag="RB84 69" 
/EC- number = "4.1.1.17" 
/ f unc t ion= " amino - ac id me tabol ism ; 
urea cycle, biosynthesis of 
polyammes and creatine" 
/note="PiyiID: 20528325 best DB 
hits: BLAST: ddbj :BAA24 923 . 1; 
(AB011029) lysineornithine 
decarboxylase ; E=5e-59 
ddbj :BAA83427.1; {AB031066) 
ornithine decarboxylase 
[Nicotiana; E=le-46 pir:T03035; 
ornithine decarboxylase (EC 
4.1.1.17) - common tobacco; 
E=le-46 COG: aq-728; COG0019 
Diaminopimelate decarboxylase; 
E=8e-44 PFAM: PF02784; 
Pyridoxal- dependent decarboxy; 
E=4e-55 PF00278; 
Pyridoxal -dependent decarboxy; 
E=1.8e-17" 
/codon- start=l 
/transl-table=ll 



decarboxylase " 
/protein-id="CAD78655 .1" 
/db-xref ="GI : 32445924 " 
/translat ion= "MVPNWRHPSRMIASTILRAP 
NARTAQPNLS FDLAKELAEKHGTP 
LL WS RS KAI ET YWAMKNALPGVDLYYAAKANPD 
LHFLS TLNGENAF IDVCS PGELKA 
ALAAGFTPDRMLHTHPCKTDANLWECYEAGVNWF 
VFDNP I EAEKI RRLTPDVNLLLRL 
ATTGASSRINLSAKFGCPMHEALELLATAKAKGL 
KVRGFSFHVGSQCLNPQDYVEVLR 
S VREVWDEATKAGHHLEVLD I GGGFPAP YREDAP 
SIEAFGEVITNGLREIFGDLPIRL 
lAEPGRGLCTESVTLITRVIGKSRRWELPWFFLD 
DGIYGSFSGKIFDHTDFPLLVEND 
GSRETVPCWAGPTCDSTDIVSRDQWLPDLEVGE 
LVLVPSMGAYAAASASPFNGLPMA NSVAID" 
/qene="chtl" 
/ locus - tag= " RB 84 7 2 " 
/gene="chtl" 
/ locus - 1 ag= " RB 8 4 7 2 " 
/function="ion transporters; 
t ranspor t f ac i 1 i t at i on " 
/note="best DB hits: BLAST: 
gb:AAG3 6945 . 1; (AF276872) sodium 
and chloride -dependent; E=8e-26 
embl:CAC03 719.1; (AJ401467) high 
affinity choline transporter; 
E=le-25 ddbi :BAA90484.1; 
(AB030947) high-affinity choline 
transporter; E=2e-25 COG: PA1418; 
COG0591 Na+ /proline, 
Na+/panthothenate symporters and; 
E=7e-20 panF; COG0591 Na+/proline, 
Na+/pantnothenate symporters and 
related; E=3e-10 PFAM: PF00474; 
Sodium: solute symporter family; 
E=4.5e-05" 
/codon- start =1 
/transl-table=ll 
/product="high affinity choline 
transporter" 

/protein-id= "CAD78656 . 1" 

/db-xref = "GI : 32445925" 

/ 1 rans lat ion= " MGLIAAVLAYLLLTIAIGLL 

AARRVGNAQDFMVAGRSLPLYMNF 

ACVFATWFGAETVLSVSATFAGQGLRAIPGDPFG 

FS ICLVLVALFFARAFYRMDLLTI 

GDFYRKRYGRSIEVLTSWISASYLGWAAAQLTA 

LGLVISVLGKGIGYETLTINNGIV 

IGFTIVAFYTVMGGMWSVALTDMIQTFVI I IGLL 

WSVYMT^ilAAGGVSWIESARESG 

RLQVFPDWGQSGQWWIYIGGFLTAALGSIPQQDV 

FQRVTSAKDERTAMTGTLLGGMFY 

CMFAFVPMFIAYAAWIDPDHLQQFNSDDLREVQ 

RTLPHAVIQSTPFWVQTVFLGALV 

SAILSTASGTLLAPSSLIVENVIRPFRSDLDDKN 

MLRWLRIVLLMFGALALHQALTSN 

NTMYEMIQQAYSVPLVGALVPLAVGLYWKRATTR 

GAMAS I VSGVATWLAFEYMLPEFL 

I PSQLMGLAAS FLAMVWSLLDKFENIRPDAVQR 

II 

/ locus - tag= " RB84 7 3 " 
/ locus - tag= " RB84 7 3 " 
/ codon- St art = 1 
/transl-table=ll 
/product = "hypothetical protein" 
/protein- id= "CAD78657 .1" 
/db-xref ="GI : 32445926 " 
/ 1 rans lat ion= " MKTFGLMLCSVDLLIACVRH 
FCERVHDSLCSKASAGTSGTT " 
2630 /locus-tag="RB8474" 

2630 / locus -tag="RB84 74" 



/transl-table=ll 

/product =" hypothetical protein" 

/prot e in- id= " CAD7 8658.1" 

/db-xref ="GI : 32445927" 

/ translation= "MIKTFGQAYKFVLKSKVCTV 

FGSKNSPYPSLWDNTDLSEDKPKA 

GGWSPKVTAVWDWKTRI PQTYPAEVFYGKVRGGD 

AVLMEMQHFREVHYAEAYQPVHEL 

DVLCQE I FEL I RLEADYTGPLRKRAI ERLACTKS 

QFDTALKKLQ I SLNWRSNDPKMK 

NDFWLPMREVHLDIVQQHER" 

complement (263113 . .2633 /gene="ppiC" 

94) 

/ locus - 1 ag = " RB 8 4 7 6 " 
/note= " synonym : parvA" 

complement (263113 . .2633 /gene="ppiC" 

94) 

/ locus - 1 ag= " RB8 4 7 6 " 
/EG-number= "5.2.1.8" 
/function= "protein folding and 
stabilization" 

/note="PMID: 7925971 PMID: 8163020 
best DB hits: BLAST: pir:C83123; 
pept idyl -prolyl cis -trans 
isomerase C2 PA4176; E=5e-28 
b:AAG33124 .1; AF247689-1 
AF24 768 9) putative parvulin; 
E=le-22 gb:AAG33121. 1; AF247686-1 
(AF24768 6) putative parvulin; 
E=le-22 COG: PA4176; COG0760 
Parvulin- like peptidyl -prolyl 
isomerase; E=6e-29 PFAM: PF00639; 
PPIC-type PPIASE domain.; 
E=3.1e-24" 
/codon- start =1 
/transl-table=ll 
/product="peptidyl-prolyl 
cis-trans isomerase C2" 
/protein-id="CAD78659 .1" 
/db-xref="GI: 32445928" 
/ trans lation= "MATASARHILVETEEACQDL 
KDQIEKGQDFGAIAAEFSSCPSGK 
SGGALGTFSPGQMVKEFDEWFS6ELGKYHGPVK 
TQFGYHLIEITQRVD " 
complement (263334. .2637 /locus-tag="RB8478" 
59) 

complement (263334. .2637 /locus-tag="RB8478" 
59) 

/codon- start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id= "CAD78660 . 1" 

/db-xref ="GI : 32445929" 

/ 1 rans lat ion= " MGRDFIEPATLSLRWMVSRH 

SNLSHIETMLQAGGKQQRGTPLYQ 

AAAMSRWPLGPVASWKLVRTHFGEDSWRQTVRP 

ERRLQQGFSNCSGQATMVAEAGER 

LSTSAGPLTTPRRJSTTTQETSWPLQVHDIFSLKPK 

KPVRI" 

263840. .265150 / locus -tag="RB84 79" 

263840. .265150 / locus -tag="RB84 79" 

/codon- s tart =1 
/transl-table=ll 
/product= "hypothetical 
protein- signal peptide and 
transmembrane prediction" 
/prote in- id= " CAD7 8661.1" 
/db-xref ="GI : 32445930" 
/translation="MISRLVFQPLLISLVTLLTS 
QGLSSAKEWSVPVAGNAFRTQPEP 
GGRSMRRGGSLRLRDSSEVHSVYFHVDDAHTLEL 
GLKGNASDHPATILATVEETKLSC 
DLQPGSAKTYPLGEIQVENAGYVRVDLQLAPAAG 
GNPSGSTQITDLWTSDDADLQLD 
YVQTNDGNMFYWGRRGPSVHLSYRLPRGTDLTYA 



GYFGMQVNSPTERKVLFSIWSPFRTDNPNEIPDE 
DRVETLAKGDGVIAKDFGNEGSGG 
QSFFVYPWEAGKTYRFLTEVKPDGDGNTLYTSWF 
GDKAKNEWRLVATFRRPKTDKHLT 
GFHSFLENFSAEQGHLERSAHYGNQWVCDVNGKW 
HEITRAKFTGDNTARGRHRLDYAG 
GSQDEHFFLKNCGFFSETVDLDQTFERKSTAGQQ 
PEINWDQLPR" 
/gene="psd" 
/ locus - tag= " RB 8 4 8 2 " 
/qene="psd" 
/ locus - tag= " RB8 4 8 2 " 
/EC-number= "4.1.1.65" 
/function^ "phospholipid 
biosynthesis; biosynthesis of 
secondary products derived from 
glycine, L- serine and L-alanine" 
7note="PMID: 3042771 PMID: 2406271 
PMID: 1495415 best DB hits: BLAST: 
pir :F72029; phosphatidyl serine 
decarboxylase (EC 4.1.1.65) 
precursor; E=3e-65 pir:D81745; 
probable phosphatidylserine 
decarboxylase (EC 4.1.1.65); 
E=7e-65 pir:E71482; 
phosphatidylserine decarboxylase 
(EC 4.1.1.65) precursor; E=9e-61 
COG: CPn0839; COG0688 
Phosphatidylserine decarboxylase; 
E=3e-66 PFAM: PF02666; 
Phosphatidylserine decarboxyla; 
E=1.9e-61" 
/ codon-start=l 
/transl-table=ll 
/product= "phosphatidylserine 
decarboxylase precursor" 
/protein-id= "CAD78662 .1" 
/ab-xref ="GI : 32445931" 
/ 1 rans lat ion= " MRPDSRPRPLPFKQESLMDE 
IVYHDRYTGEDRIEKVYGDKALRW 
TYGTIAGRLSLSLWKRTLFSHWYGWRMDQPKTR 
EKIAPFIKEYELDSDEFVRDVDEF 
ANFNEFFFRKLKPEARPIDAAADSWFPADGRHL 
CIPDLSKCEGLFVKGEMFDLPTLL 
QDSALADRYASGSLLLSRLCPVDYHRFHFPAAGV 
PGESRLINGPLYSVNPIALRQNIH 
ILTSNKRCLTQLETESFGTVLLLEIGATCVGSIQ 
QSYSPGETISKGDEKGYFRFGGSS 
TMVLFEPGRIQFDADLIENSRQHRELYARMGDHL 
GNRPANSI" 

/ locus - tag= " RB84 8 4 " 
/ locus - tag= " RB84 8 4 " 
/note="best DB hits: BLAST: 
pir :T3 5585; hypothetical protein 
SC6G4 .34 - Streptomyces 
coelicolor ; E=4e-08 
embl:CAB88460 .1; (AL353815) 
hypothetical protein SCD6.31c; 
E=8e-05 pir:C83475; hypothetical 
protein PA1354 [imported] - 
Pseudomonas ; E=0 . 002 " 
/codon"Start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAD7 8663.1" 

/db-xref="GI: 32445932" 

/ 1 rans lat ion= " MFVYRGGCDEMENAS PDEMQ 

KVMQRWMDWIQDGMKAGWLLDGGD 

GLKPEGATWPDLSVTDGPFTESKELVGGYSMVQ 

AENLAAAVELAKTSPIIHHKGAIE 

VRELANVGQPEEA" 

/locus-tag="RB8485" 

/ locus - tag= " RB84 85" 

/function= "transcriptional 



gene 
CDS 



complement (267983 
58) 

complement (267983 
58) 



gene 
CDS 



269305. .269841 
269305. .269841 



gene 
CDS 



complement (269812 . 
47) 

complement (269812 . 
47) 



/note="best DB hits: BLAST: 
pir:T35586; hypothetical protein 
SC6G4.35 SC6G4.35 - Streptomyces ; 
E=2e-47 pir:G83477; probable 
sigma-70 factor, ECF subfamily 
PA1351; E=8e-45 swissprot : Q52997 ; 
YDNY-RHIME HYPOTHETICAL 47.0 KDA 
PROTEIN IN DNAA; E=le-35 COG: 
PA1351-1; COG1595 DNA-directed RNA 
polymerase specialized sigma; 
E=2e-17" 
/codon-start=l 
/transl-table=ll 
/product = "probable sigma-70 
factor, ECF subfamily" 
/protein- id="CAD78664 . 1" 
/db-xref ="GI : 32445933 " 
/translation="MTDSSGHLVEHFFRHESAHL 
lAVLTRAFGIRRINLIEDMVQVAM 
LEAMNAWKQGGVPDNPTAWIHRATKNRILDALRR 
EKTLAQAISLAGPSLDVQEDTVDH 
WLSEEQLPDSLLRMMFVCCHPTLNRETQIALTLK 
TLCGFGYAEIAAGLLLPLETIKKR 
lARAKRSLCEANVAVELPSPDEIAQRRGWHDVL 
YLMFNEGHNTSHGSQPIRDDICEE 
AARLCHLLSESEHGSRATKALLALMLFQAARLES 
RTDEQGNVI LLEDQDRSKWDRRL I 
TIAQHELAGSKIDSPTTYHWEAAIAMQHCIAPSV 
EATDWSTIVRFYNRLLEIHRSPIY 
ELNRAIARAQAGQINEALKELQTLQQSGAMQTYL 
LLDCAIARVHELGNHTSQAIDAYL 
lALSKAKADHEKQLVETKLRKLQQA" 
2691 /locus-tag="RB8487" 

2691 /locus-tag="RB8487" 

/codon-start=l 
/transl-table=ll 
/product= "hypothetical 
protein- signal peptide prediction" 
/protein-id= "CAD78665 .1" 
/db-xref = "GI : 32445934" 
/translat ion= " MRRKYFGLAMAAIATLGPAQ 
AFGGDRE I AQQVMQRLKVSRDAGQ 
LKNFNLDMKVNDGWLFRGTVDGSEQQDLVLAAL 
KDVDGWNWNELEVLEAKIYKPK 
AAAIAAVEPAEAFDFKGALETEAQEWPGTVSPV 
AGEAAADSQTRTVAAWENAGGEAN 
SDEAITQSVAAALGKAQSVGHLKDFGVDVNAYDG 
IVEITGEAASASQRELIAEIARHA 
PGARGVRDLMSVKAGSNTGLVATPASHRTGGLQP 
LPPANRGQVQARPVSYGQAGGQMI 
NGEMWPGSMVTHGAGVPAAPAPMAHAPVMGQPV 
PMAPAAPVGAPRYDS PNLPNYAWP 
GYAAYPNYAALTYPQQYSPSAFPYIGPFYPYPQV 
PLGWRKVSLEWDDGWWFLDFTDK" 
/locus-tag="RB8490" 
/locus - tag= " RB8 4 9 0 " 
/codon-start=l 
/transl-table=ll 
/product=" hypothetical protein" 
/protein- id=:"CAD78 66 6 .1" 
/db-xref="GI: 32445935" 
/ 1 rans lat ion= " MWINRRAPRSTAKEKPRRKR 
AQFKFNRPRKIACSSRHCQSESPS 
SRRTWGIRRDTRFAGRCGKCDHRFGNIGTECLRT 
AVGFPAFRHVPKPNRTRWTDRVRR 
AQHRDVCQRLGAEWDRPWLSPQDRARNGRERGVS 
CLLPEQAEILASCWVEKPARNRLV 
HKPLRQPTGSGARFARGF" 
.2704 /gene="sigW" 

/locus-tag="RB8492" 
.2704 /gene="sigW" 



/ f unct ion= " transcriptional 
control ; transcription" 
/note="PMID: 8432708 best DB hits: 
BLAS T : s wi s spr ot : Q4 5 5 8 5 ; 
SIGW-BACSU RNA POLYMERASE SIGMA 
FACTOR SIGW; E=3e-23 
gb:AAF18265.1; U22895-1 (U22895) 
alternative sigma factor AlgU; 
E=6e-18 ddbj :BAB03 982 . 1; 
(AP001507) RNA polymerase ECF-type 
sigma factor; E=2e-17 COG: 
BS-sigW; COG1595 DNA-directed RNA 
polymerase specialized sigma; 
E=3e-24 PFAM: PF00776; Sigma-70 
factor (ECF subfamily); E=9.9e-15" 
/codon-start=l 
/transl -tableau 
/product = "RNA polymerase sigma 
factor sigW" 

/protein-id="CAD78667 .1" 

/db-xref ="GI : 32445936" 

/ trans lat ion= " MDEASSNRNERTAANLAPTD 

EGALIEAALSGDASAFEGLVIRHQ 

DRLHHAMIHVTGSVHDAEEVTQEAFIRAFVKLDT 

FQQNSQFFTWLYRVAFNIALSRKR 

RQKVRLSLDQQREEIGEEWCGGEAVDANMIRQD 

DVSLVQLALRQLSDEHRS ILVLRE 

MNESSYEEMAEILELSIGTVRSRLNRARKQLRLA 

lEQLREPESESSGEPSS " 

complement (270481 • .2718 / locus -tag="RB84 94 " 

54) 

complement (270481. .2718 / locus -tag="RB84 94" 
54) 

/EC-number="2.1.1. -" 
/EC-number="3 .4 . 99. 
/function= "proteolysis; protein 
secretion systems" 
/note="best DB hits: BLAST: 
pir:A75321; prepilin peptidase, 
type IV - Demococcus radiodurans; 
E=4e-16 swissprot :P72640; 
LEP4-SYNY3 TYPE 4 PREPILIN- LIKE 
PROTEINS LEADER; E=3e-15 
ddbj :BAA83726.1; (AB017809) 
prepilin peptidase [Escherichia 
COlx] ; E=3e-14 COG: DR2065; 
COG1989 Signal peptidase, cleaves 
prepilin- like proteins; E=4e-17 
PFAM: PF01478; Type III leader 
peptidase famil; E=8.9e-28" 
/codon- start =1 
/trans l-table=ll 

/product= "prepilin peptidase, type 
IV" 

/protein-id= "CAD78668 .1" 

/db-xref="GI: 3244593 7" 

/ translation= "MASRVCGHAKGHAILSRRRS 

PYRILAVCLLIGYLLASAAYVWLS 

ALWISGAHPHFRWDLLMPRANDWIFTFFLWG 

ASVGSFLNVWWRLPQRLNVNGHS 

FCPRCRNRLRARDNVPIFGWLWLSGRCRDCRLPI 

SSRYPIVETLVGITFALVGATELT 

RWSLPYFGDSVRPNWMWTPIVDADLLTLILYHW 

ALATAWAMGLIRFDGKALPSRLTL 

FAAVTLIGGMFAFPQAMWPWQLVGESMPAGIGT 

WPPSGWMTTGAEWSVTLVQIGLRL 

LTSLVAAGFFARVLAKSFCPSADMKMDPLGSSTK 

RLIDLMILIAVPAIWGWQAVMGV 

ILLAAVFAKILGMCEFAQSRDSLGRFAVSLPVAT 

SVQLLLWRGLVASQVWPSEQASQV 

GLIVSFFGVLTIPLWLDENGESFPPMGDESGNGG 

VSDHFDEDVREAQASRKDPDALQQ GDSFELN" 

271893. .273416 /qene="phr" 

/locus - tag= " RB84 98 " 

271893. .273416 /gene="phr" 



/EC-number= "4.1.99.3" 
/function="DNA repair (direct 
repair, base excision repair and 
nucleotide excision repair) " 
/note="PMID: 7813451 PMID: 2668276 
best DB hits: BLAST: 
swissprot :P12769; PHR-METTH 
DEOXYRIBODIPYRIMIDINE PHOTOLYASE 
(DNA; E=3e-58 gb : AAF81696 . 1 ; 
AF246697-1 (AF246697) 
CPD-photolyase [Fowlpox; E=le-54 
swissprot :P342 05; PHR-CARAU 
DEOXYRIBODIPYRIMIDINE PHOTOLYASE 
(DNA; E=6e-54 COG: MTH904; COG0415 
Deoxyr ibodipyrimidine photolyase ; 
E=3e-59 PFAM: PF00875; DNA 
photolyase; E=0.0034" 
/codon-start=l 
/trans l-table=ll 
/product = " deoxyr ibodipyrimidine 
pnotolyase" 

/protein-id="CAD78669.1" 

/db-xref= "01:32445938" 

/ trans lat ion= " MTSTLPPQRHQFANDHPPRH 

EGD YWYWL I AQRRLHWNFALQHA 

IDRALEFEKPLLIFEPLRVRYQWASDRIHRFIIE 

GMRDNAAEAESLPWYYPYVEPKP 

GVGSPLLHRLAKHACTWTDEYPCFFLPHMIDAV 

KDKLPARLELIDSNGVLPLRAAEK 

TYTVAHSYRRFMQKNILPALEQLPLANPLDRNSN 

RPALPCPDGESMAERRKKLLPSKI 

LSKWPAADLEKLLGQDGLSEIPIDHSVRPSQATP 

GGTVEATRRWQEFLNKKLSRYNDD 

RNHPDEHATTGLSPHLHFGHISAHEMVSSILEHE 

GWS ADQTS KPNGKNHS FWNVGENA 

EALLDQLLTWREIGFNWSFQNPDTYDSLESLPDW 

AQKTLNETRDDERPHLYTLAEFEN 

AETHDEIWNAAQRELVETGEMHNYMRMLWGKKIL 

HWTQTPEEALEFMIHLNNKYGLDG 

RDPNSYCGILWVLGRTDRAWGPKRPVFGSVRYMT 

SDSTRKKLKLTKYLQRFAGSNKK" 

complement (273503. .2740 /locus-tag="RB8499" 

69) 

complement (273503. .2740 /locus-tag="RB8499" 
69) 

/codon- start =1 

/transl -tableau 

/product= "hypothetical protein" 

/protein-id="CAD78670 .1" 

/db-xref="GI: 32445939" 

/translation= "MSALASADENQRRGCASSSS 

VAERRHPASLGFQPKDHQRITISP 

VAERRQLWCLNHLSPLRGFSNGRNDLNLGLKPE 

ADRFHRSAWPPRPVAERRQLFAA 

RRQAIPVSRARKVSAVYRLPFHVADGGPPSANES 

QRRQSRWLTRCHVLAEGPSLAAMV 

VDTPTEDLSRLRADRFFCRNSLHSVRSL " 

complement (274044. .2743 /locus-tag="RB8500" 

67) 

complement (274044. .2743 /locus- tag="RB8500 " 
67) 

/codon- start =1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78671 .1" 

/db'Xref="GI: 3244594 0" 

/translation="MRDMMENSKRLPWKSEPNLT 

ETRRPIWGVFLSMSPTEALRRQTN 

LSDARAVGSLWTCLLRGHRLQRWWWIRRLRASV 

GYGGLATNDLAKEVKAMKCRQWG IGVSR" 

274391. .275536 /locus -tag="RB8501" 

274391. .275536 /locus- tag="RB85 01" 

/note="PMID: 99397451 best DB 
hits: BLAST: ddbj : BAB09588 . 1 ; 
(AB018118) gene-id:iyiRI1.6~pir 



b:AAG35309. 1; AF274868-1 
AF274868) sialidase NanB 
[Pasteurella; E=0.050 pir:D64592; 
hypothetical protein HP0580 - 
Helicobacter pylori; E=0.42 PFAM: 
PF02012; BNR repeat; E=0 
/codon-start=l 
/transl- tableau 

/product = " conserved hypothetical 
protein" 

/protein-id= "CAD78672 . 1" 

/db-xref="GI: 32445941" 

/translation= "MKVSLAVFLALTLTISIQAE 

ELPDVAVAGEDALVSSQQIFDLAD 

KPTKESHASTIVETPTGLVAAWFAGTRERDPDVG 

IRVSRHENGQWTESVEWSGVQSS 

TLRYPTWNPVLFQPSEGPLMLFYKVGPNPREWWG 

MLTTSQDGGKTWSWPTKLGEAHTI 

GHLLGPVKNKPVELADGTILCPSSTEIEYADGSS 

HWRVHFEVTKDLGKTWEVIGPIQT 

GETFHAIQPSILTYPDDRLQILCRSKEKRIVESW 

SKDGGKTWSTLTATDLPNPNSGTD 

AVTLQDGRQVLIYNHSEGLVRRKDAPDLKPRRIL 

NLAISSDGKTWKPVLTLEHETGPH 

PKDSDRRRHFGEYSYPAIIQTSDGMLNMVYTYNR 

EGVKHAWDPSKL" 

complement (275479. .2756 /locuS"tag="RB8503 " 

37) 

complement (275479. .2756 /locus-tag="RB8503 " 
37) 

/codon-start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id= "CAD78673 .1" 

/db-xref="GI: 32445942" 

/translation= "MAGDCCEIVAAQQFFLIWRT 

SLGESFARSKDGCAQSLLGSTTAC LTPSRLYV" 

complement (275661. .2759 /locus-tag="RB8504 " 

96) 

complement (275661. .2759 /locus-tag="RB8504 " 
96) 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id="CAD78674 .1" 

/db-xref = "GI : 32445943 " 

/translation= "MRTASPLTLANNANAIKAMR 

EYFGRSTVANWEGKREGTSEEEPK 

LNSKPWPPQATRARSRERPSNRRRLQVADGGPPS 

ANEHLRRPKRLLTCYHVLAGGQSL 

AAIDSGFAD" 

complement (276127. .2774 /locus-tag="RB8505" 
46) 

complement (276127. .2774 / locus -tag="RB85 05" 
46) 

/note="best DB hits: BLAST: 
pir : H69064 ; serinethreonine 
protein kinase related protein - ; 
E=4e-04 swissprot :P77774; 
YFGL-ECOLI HYPOTHETICAL 41.9 KD 
PROTEIN IN; E=0 . 004 gb : AAG2 0 856 . 1 ; 
(AE005154) Vng6189h [Halobacterium 
sp. NRC-1] ; E=0.007 COG: iyiTH1485; 
COG1520 Uncharacterized proteins 
of WD40-like repeat family; 
E=4e-05 PFAM: PFOlOll; PQQ enzyme 
repeat; E=0.03" 
/codon- start =1 
/transl-table=ll 
/product = "probable 
serine/threonine protein kinase 
related protein" 
/protein-id= "CAD78675 .1" 
/db-xref ="GI : 32445944 " 
/translation= "MFLTLERASACPGSLSNFGD 



PSATAGDAWTGWLGPHRNGWVADFSPPESWPEQL 

SLAWKINVGEGYGTPLVSGDRVYQ 

HARQGEDEVITCFDIiASGGEIWKRSWPVPFEIGG 

GGERHGAGPKANPVMADGRLFTAS 

INGWTAWDAESGDQLWQRDERKRFS INHPKWGA 

SCSPWDGNRLLIHLGNDETGALY 

AFDVASGET I WSQGDKP I S YS S PVWELHGVRQV 

IDWNRDDVAGVDVETGKWLWSYAL 

KHVDPDQNSPTPVQHRDTI I IGAENRGIRSLRPT 

KTDDGWIVEEAWVLEDLAMNMSTA 

VMNDGRLFGLSHYDSGRLFCINADDGSVPWQGRP 

RLGDHATFLAIPGHIITLTDSGEL 

RVIDAAAEE YQAKAT YTVSDS PTWAAPVLGANYL 

LIKDREHLLRWNW" 

complement (277454 . .2777 /locus-tag="RB8507" 

20) 

complement (277454, .2777 /locus-tag="RB8507" 
20) 

/codon-start=l 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id= "CAD78676 .1" 

/db-xref="GI: 32445945" 

/translation="MPESDVRPSPKTRLPSESIL 

HFAIWFTFSIHHAPWHSPPNPR 

FQTPWTELGRPEVSGSSRSRVWLPKTGWLVDRAA 

WSFECSSLSM" 

277651. .278046 /locus-tag="RB8509" 

277651. .278046 /locus-tag="RB8509" 

/codon-start=l 
/transl-table=ll 
/product = "hypothetical protein" 
/protein-id= "CAD78677 .1" 
/db-xref="GI: 32445946" 
/ 1 r ans la t i on= " MQNAKL I RWAVAFSGKDGRP 
IPAKVIGPGGLRSHAVFGCRIAKR 
LSRCNPGIKLAPAIRASMATACRLNSLSGLPPRA 
KHDGPIRGETQTLAAIYFSNVRRE 
DEANLHDLFQRARFTRGCETRISKCRPNS " 

complement (278096 . .2794 /gene="hom" 

42) 

/ locus - 1 ag= " RB8 5 1 0 " 
complement (278096 . .2794 /gene="hom" 
42) 

/ locus - tag= " RB8 5 1 0 " 
/EC-number= "1.1.1.3" 
/function= "biosynthesis of the 
aspartate family" 

/note="PMID: 1333566 best DB hits: 
BLAST: pir:F69031; homoserine 
dehydrogenase (EC 1.1.1.3) - 
Metnanobacterium ; E=4e - 8 1 
b:AAF21129.1; L78665-3 (L78665) 
omoserine dehydrogenase; E=2e-80 
swissprot :P29365; DHOM-PSEAE 
HOMOSERINE DEHYDROGENASE (HDH) ; 
E=le-77 COG: MTH1232; COG0460 
Homoserine dehydrogenase; E=4e-82 
PFAM: PF02737; 3 -hydroxyacyl-CoA 
dehydrogenase; E=0.85 PF02254; KTN 
NAD-binding domain; E=0.19 
PF014 08; Oxidoreductase family, 
NAD-bind; E=0.73" 
/codon-start=l 
/transl-table=ll 
/product = "homoserine 
dehydrogenase " 
/protein-id= "CAD78678 .1" 
/db-xref ="GI : 32445947 " 
/ 1 rans lat ion= " MNATSTKQKSSDRTNVAI IG 
MGTVGAGVARLLIDHGDRTARHAG 
KTIWLSKAWRDLSKPRGIDLPEGVLTDSIDDVL 
NDPE ID WI QLMGGLS PAREVMLR 
AMEAGKD I VTANKALLAEHGAELFTRARELGRS I 
AFEAAVAGGIPIIANISQCLSANQ 



279385 2793 



279419. 
279419, 



,279760 
.279760 



complement (279573 
12) 

complement (279573 
12) 



279912. .280316 
279912. .280316 



280316. .281917 
280316. .281917 



ELGYAEADPAMDVDGTDAAQKLAI 
LSHLAFGATVDWRD I PRQGIDGLDPADLVYARQL 
GYRIKLIiATARLVDGELELSVAPT 
LVAIGTPMAEVRDAFNAIRWGDAVGPVFYHGLG 
AGQMPTASAWADVIDTAVGRTPI 
TFETLEYFSAERPPRTAQRDVDRLRGRYYLRLRV 
ANDPGTLAAIAGVLSSHSISIASV 
IQHEPKSKDEANAAENTVPLVIMTHEASEGAASR 
ATTDIEALPAISGQWRMPVRE " 
/note="cosmid pircos-cle07/ cosmid 
pircos-alg06 joining point" 
/ locus - tag= " RB8 5 1 1 " 
/ locus - 1 ag= " RB8 5 1 1 " 
/codon- start =1 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD7 8679.1" 
/db-xref="GI: 32445948" 
/ 1 rans 1 at ion= " MLGGCS VHVGDGNQS PRSLN 
TGVIQ I PRRS PRYRTE I P VGRVRS 
APKKASAIRMRRI CWQSKENGGATCPNAAPPFS F 
SIARNMQPGLGVAIPHHDTCPRLS 
VSAIASGPPLG" 
.2798 /locus-tag="RB8512" 

.2798 /locus-tag="RB8512" 

/ codon - s tar t = 1 

/transl-table=ll 

/product = "hypothetical protein" 

/protein- id="CAD7 8 680 . 1" 

/db-xref="GI: 32445949" 

/ 1 rans 1 at ion= " MHSLGARLCRTLRLAGWNQ 

VEDPTLWPKRTDGGTCRGVGLRHR 

DPAACFLRLKMRTAGQRLGTLHRRFLLIASKSAS 

F" 

/ locus - tag= " RB8 5 13 " 

/locus-tag="RB8513" 

/codon- start=l 

/transl-table=ll 

/prbduct= "hypothetical protein" 

/protein-id= "CAD78681.1" 

/db-xref ="GI : 32445950 " 

/translation= "MTDPIALRNLSKIRNFPSRI 

FPTNRFRRCKDAPMAIGPLTDTTT 

LSIDRLYDLYHAIAERDHAFRLQAQYGSTPPPKG 

HCEFRPLGRQTFVQRVLHYDSLPA 

EVGAAFRARLSRQAEAYGVDPLSKTLNKTNAA" 

/gene="yqjH" 

/ locus - 1 ag= " RB 8 5 1 5 " 

/gene="yq]H" 

/locus- tag="RB8515" 

/note="best DB hits: BLAST: 

swissprot :P54545; YQJH-BACSU 

HYPOTHETICAL 47.0 KD PROTEIN IN; 

E=0.15 PFAM: PF00817; 

impB/mucB/samB family; E=0.87" 

/codon- s tart =1 

/transl-table=ll 

/product=" conserved hypothetical 

protein" 

/protein-id= "CAD78682 .1" 

/db-xref ="GI : 32445951" 

/ 1 rans la t i on= " MSDRGQTLRQ I LRAAHCRST 

HHHFALDATELVQTDSGQRLVSQL 

LRHHTRYLAGAKDPDVRFRDFQNHWHVDDGYWG 

GAPRVAHQWYDRLQRYLHQSRFDD 

AAHAAGVLSHYVTDPIQPLHTAQTPMEKVLHRPI 

EWCITQNYLDLLAQWRQDPTRIVF 

QLSDQPGWLGEAILQSARLAHSHYQTLLDNFDLA 

ASEHDPKQGLNEVSRRIVSQLVGL 

S VTGWARILERAAADAEQRSGRP I PTAGLSLPMV 

FSAIRVPDRMWLKRITYKIEREKV 

QRLVDEFRRTGDVEQNLPSEVRVLQRVRTIYHRE 

KEYRQAKAKLAAAREALVASEAAE 



GKKTAARFAGIGIHTVGDFLKADE 
KTMTRRLATRWITVDTIHAWRCQTMLMCQLPSML 
AREVQLLVGAGYTTTDALAKTDVT 
TLQSAIETYSATYSGRRYLRGANPPTLDRLEI I I 
GDAAHAMIKLKRDNAMHHSDSEST 
ESSSAAQSQLQSAGQPQQSPQRRAA" 
2840 /locus-tag="RB8516" 

2840 /locus-tag="RB8516" 

/note="PiyiID: 10984043 best DB 
hits: BLAST: pir:D83 024; probable 
secretion protein PA4 974 
[imported] -; E=0.28 pir:A82540; 
outer membrane export factor 
XF2586 [imported] - Xylella; 
E=0.31 pir:B83071; liypotlietical 
protein PA4 592 [imported] - 
Pseudomonas; E=0.45 PFAM: PF02321; 
Outer membrane efflux protein; 
E=8.2e-07" 
/ codon -St art = 1 
/transl-table=ll 
/product=" similar to secretion 
protein" 

/protein- id= " CAD7 8683.1" 

/db-xref="GI: 32445952" 

/translation- "MDRQTSLSSRKQTRRRRLAS 

WLGVAVAASTLTSGCHWAQKIPDG 

PHDTKTSYHDHSAMRIEYPEVNQCATPVISAAVS 

AVQPNTLEDPANLPSVELTLEQAV 

QQALRSSPVLRTIGGAWSAPQSATTVFTPGLAH 

ADPLGGVEAALANFDAQYSQSLFW 

NKLDQPQNIATGGLGAAFTPTTFQATQATFANEL 

SKRTATGAQFALRSNVGYDRNNRP 

FRQFQSDFIGFLEAEYRQPLMRGAGTTYNRIAGS 

AGASGAI GQ YNGVL I ARVNED VAL 

ADFEAAVIQLVSDVEQAYWDLTTSYRLLEANAKG 

RESALQTFQYQQVRLEVGTGRRDE 

EAQARSQYYQFEAQVKSALAGETGLYAREQALRY 

LIGLPATDGSLLRPTSDPLNAKW 

FDWQSALGQALDRRVEVRRQKFTVKRREMELLAS 

RLNRRPQVDFVGQYRWRGLGDNLI 

GPGDQGLNDNLVDSITGGNYQEWTAGVEFLAPIG 

LRAAS S AVAHAKLNLQRERAVIAE 

TELRISHDLSDAARAIDLTFELVETNYNRYLADL 

RQVEVLRRRYRDGTDNINFLLQAQ 

RQWTSESEFYQSISNYNLAIRDFHREKGSLLAY 

NQVDLTEGAWCPGAERDAYELGRF 

LEPRRHPEKVSAPRPLTSGQFDPAAVQSTMGAAS 

YDWQPAPATSAIAQPVPMTDDGM 

lEVDVTPAE" 

/ locus - tag= " RB8 5 1 8 " 

/ locus - 1 ag= " RB8 5 18 " 

/note="best DB hits: PFAM: 

PF00559; Retroviral Vif (Viral 

infectivity) p; E=0.47" 

/codon- start =1 

/transl-table=ll 

/product = " hypothet ical 

protein- signal peptide prediction" 

/protein-id="CAD78684 . 1" 

/db-xref ="GI : 32445953 " 

/ 1 rans lat ion= " MWLRTCLSFFVAPACTDFQ 

SHLLVLERLQRRLTTFRIRFQQTI 

RGRQVPVGLLDIGNQLDHSGFEISQGNVFVDSSN 

QNTWLADGSAGPSRSGDPWCRS 

GTSHQRLTILGFEETDEIALKLAEWTWAWSDI 

ATQSELGTGRGAFGQFVGERCLGG 

LKCGRCESRTKPTGCDVLRLIELVPEQRLAVLSV 

EVGQSCFDTPQRIRVSQAWREDRR 

RGLRSAHHRATNRSQHGTGSQCLLDRLLQRQLDA 

GQIGRVFQRIGLNRRHGSRDDWRG 

TLVDLGVLDPHCRMVMIRRFGWRTVRDFLRPMA 

SAGQRGCRHRDTQPRGQATTTGLF 



complement (284120 . 
63) 

complement (284120 , 
63) 



285045. .285170 
285045. .285170 



285213 . .286121 
285213. -286121 



286108. .287595 
286108 . .287595 



GG" 

2851 /locus-tag="RB8519" 

,2851 /locus-tag="RB8519" 

/codon- start=l 
/trans l-table= 11 
/product= "hypothetical 
protein- transmembrane prediction" 
/protein- id= " CAD78685 . 1 " 
/db-xref = "GI : 32445954 " 
/translation="iyiHDPSRKTIAMLESTWDWWF 
GDGPVRRRNTSRMFAEFSVKRFYE 
MVRWVRGIRPSRRTFAWVLSGCLASTVAVGTADE 
PMALTTPDAHVASLPSGFFAAAGF 
AMIDDDWADSAAASDVPSDTAPANADEARRQRL 
RDLLRTGAESDRAEAVNGKDRMSL 
RDEIIADAKEEEKDEDGLFSQPFISERLPNVTGG 
RIVLPSLRAVSLDTSTIGNGRLPD 
DFDSTEDVAAIALPEEPFARGMVAAGAVLPWAAP 
NTYSHPLYFEDRMLERHGHERWGC 
MQPIAAGARFFTTIPMLPYLATIQEPCDIVYSKG 
YYRAGSSAPCVMQRPPLERRAVW 
EAAAIAGGIIALP" 
/ locus - tag= " RB 8 5 2 0 " 
/ locus - 1 ag= " RB 8 52 0 " 
/codon- start=l 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD78686 . 1 " 
/db-xref="GI: 32445955" 
/translation="MTENSANILDVLRRRTGPSP 
NHQSHVDSSIAIVFLDGSCKK" 
/locus - tag= " RB 8 5 2 2 " 
/locus-tag="RB8522" 
/EC-number="2 . 1. 1. -" 
/function="rRNA processing" 
/note="best DB hxts: BLAST: 
embl : CAB76071 . 1 ; (AL157953 ) 
putative rRNA methylase 
Istreptomyces; E=le-22 
SWissprot:Q10543; Y881-MYCTU 
HYPOTHETICAL TRNARRNA; E=7e-18 
pir:C82904; rRNA methylase UU327 
I imported] - Ureaplasma 
urealyticum; E=3e-14 COG: Rv0881; 
COG0566 rRNA methylases; E=7e-19 
PFAM: PF00588; SpoU rRNA Methylase 
family; E=2.6e-26" 
/ codon -St art = 1 
/transl-table=ll 

/product= "putative rRNA methylase" 

/protein-id= "CAD78687. 1" 

/db-xref ="GI : 32445956 " 

/ trans lat ion= " MINNHSKTVDPPCAFVTPDR 

HTSSTTRTNGPERLPETIRDLDDP 

RLSPYRNLRHRETDDGFIAEGSLWGRLLQSDLG 

VQSVLIHTGRESKYLPLIPDSVPV 

YLIDRELGKELAGYDFHRGVLAHGFSPAIEPLEK 

FRDSPQSSLALALVGLSDPENVGS 

LLRSAAALGVRDILLGPHTISPMTRRVIRVSMAS 

VFRHRFYRFDNPVGQLRDLANAGI 

ASIATLLDDTAIPLIDLSNAIQSKRRILMVGNEA 

NGLPRE I AEAATHRSTLPMRNDTD 

SLNVGVASAI FLYELTRLQDADHGNG " 

/qene="lcfA" 

/ locus - 1 ag= " RB8 52 4 " 

/qene="lcf A" 

/locus- tag= "RB8524" 

/EC-number= "6.2.1.3" 

/function=" breakdown of lipids, 

fatty acids and isoprenoids" 

/note="PMID: 8969504 best DB hits: 

BLAST: pir:E71667; 

2-acylglycerophosphoethanolamine 



gene 
CDS 



complement (287811, 
04) 

complement (287811. 
04) 



gene 
CDS 



288097. 
288097. 



289089 
289089 



qblAAG19674.1; (AE005055) Vngl339c 
[Halobacterium sp. NRC-1] ; E=5e-17 

?ir:A69831; probable acid--CoA 
igase (EC 6.2.1.-) yhfL 
[similarity] ; E=8e-17 COG: 
RP620-3; COG0318 Acyl-CoA 
synthetases (AMP- forming) /AMP-acid 
ligases; E=5e-19 VNG1339C; COG0318 
Acyl-CoA synthetases 
(AMP -forming) /AMP-acid; E=5e-18 
BS-yhfL; COG0318 Acyl-CoA 
synthetases (AMP- forming) /AMP-acid 
ligases; E=7e-18 PFAM: PF00501; 
AMP-binding enzyme; E=1.3e-08 
PF01781; Ribosomal L38e protein 
family; E=0.56" 
/codon-start=l 
/transl-table=ll 

/product= " 2 -acylglycerophosphoetha 
nolamine acyltransf erase (aas) " 
/protein- id= " CAD7 8688.1" 
/db-xref="GI: 32445957" 
/ 1 rans lat ion= " METADVHDLLSALIHHAKQR 
PDEIALIDSQTGAAAFTWSELTCR 
VDATAIMLRSKFSEATTTKRLTYRCTNRPEDWL 
SLACVAAGFTEIPIDAFLPAPQQD 
VLIKRSKALHWAHEIHANSIHSSKVADAISNLET 
AARDVDIHSPSLVLWTSGTTSEPR 
GVMLSQHNLTTNAKAKLLAVPQHTSDLRLSLLS I 
AHAYARTSDMGTWLLSGCRWSLGR 
GRSTLRSLPENLCPTLINAVPVLINDILNRIESG 
QSSLQSLRLLGCGGVAMSLGQFER 
CKQNGIGVI QGYGCTETS PVI CS AS PDNATPNRV 
GPLVAGWES KVEHGRLFVRGPGVM 
IGYLDEEAATQQKVSPGGWLDTGDLVEIHDDGQF 
Q I LGRADD VI VMDNGFKVFPAT I E 
RQLLQLEGIEQAVLLHHQGQLCLLLSHSQPQVTA 
DAPTGRDPIESCLADSLPPGTSVK 
RLKLSEPLSIKSGELTAKGTPRRPIIRQRRLSSP 
PDSRPNHSRDP" 
.2881 /locus-tag="RB8526" 

.2881 /locus-tag="RB8526" 

/codon-start=l 

/ transl-table=ll 

/product =" hypothetical protein" 

/protein- id= " CAD7 8689.1" 

/ab-xref ="GI : 32445958" 

/translat ion= "MGMRVLQLVSLNESRICYEQ 

SLESGVHRFVQMLQDRANLDGLIL 

ISSLSQLESQINFEPRWFAFSAQESRVTGDRVP 

LLGLWELRSPADESGFFEA" 

/ locus - 1 ag= " RB 8 5 2 7 " 

/ locus - tag= " RB8 5 2 7 " 

/note="best DB hits: PFAM: 

PF00114; Pilin (bacterial 

filament) ; E=0.038" 

/codon- start =1 

/transl-table=ll 

/product= "hypothetical 

grotein-putative conserved 
ypothetical protein" 
/protein- id= " CAD78 6 90.1" 
/db-xref="GI: 32445959" 
/translat ion= "MPNYCRDVNKNRKAFTLVEL 
LWIAI IGVLVGLLLPAVQSAREA 
ARRMSCSNNMKQLGLAMHNYHDAFRSFPYGHLNT 
GLSVTHKRDSWYQNILPFVEQQAY 
AEKYRNAATIYSVYEAEWIHRMPLELAGVAVPTF 
MCPSEPSGPALGGGGSDNGFQGSY 
GVSAGGGTLPQDLLGGITPDEINIIKTDPGGMFG 
ETRTYKFRDCLDGTSNTLLASETI 
IRGASGGS WGGMGGYWGGS PHGS FGFS SAEPPNT 
SVPDRVYSCKSTNFPKSPCENGNA 



289184 . .289294 
289184. .289294 



289288. .289716 
289288. .289716 



complement (289725 . 
86) 

complement (289725. 
86) 



complement (289911. 
25) 

complement (289911. 
25) 



ERQIWRDLGNRADGHVLGEF " 
/ locus - tag= " RB8 52 8 " 
/ locus - 1 ag= " RB 8 5 2 8 " 
/ codon - start = 1 
/transl-table=ll 
/product= "hypothetical protein" 
/protein- id= " CAD78 6 91.1" 
/db-xref="GI : 32445960" 
/ 1 rans 1 at ion= " MPP 1 1 PAKRFE IRLPDEQEL 
RSQIVKPSHAVHIKFC" 
/ locus - 1 ag= " RB8 5 2 9 " 
/ locus - tag= " RB8 5 2 9 " 
/ codon - s tart = 1 
/transl-table=ll 
/product= "hypothetical 
protein- signal peptide and 
transmembrane prediction" 
/protein-id= "CAD78692 .1" 
/db-xref="GI: 3244 5961" 
/ 1 rans lat ion= " MLRFTSMYKNAVCS IGLFIT 
PLILTGCPAPSGLELYPVTGTVTF 
EGKP IDTGRIQFRTASEERRS FSAAIENGNYEME 
TLTGPMTVEVRASRLIEGKFDKSN 
PDELTPAGEMYIPQKYNSRTELTADVPAGGDTID 
FNLLGS" 
.2898 /locus-tag="RB8530" 

.2898 /locus-tag="RB8530" 

/note="best DB hits: BLAST: 
pir:D64585; cag pathogenicity 
island protein cag5 - 
Helicobacter; E=0,63 pir:F71927; 
cag island protein, DNA transfer 
protein - Helicobacter; E=0.63 
gb:AAF80195.1; AF282852-5 
( AF2 82852) Cag - beta [Hel icobact er ; 
E=0.63" 

/codon- start=l 
/transl-table=ll 
/product=" similar to cag 
pathogenicity island protein cag5" 
/protein-id= "CAD78693 .1" 
/db-xref ="GI : 32445962 " 
/ 1 rans 1 at ion= " MLSNTI PTGFTLSSPPNDKQ 
RJSTFNIRKQVISRRNLLAFVGYGSA 
AVIGRAQLN" 
.2911 /gene="tatC" 

/ locus - tag= " RB8 53 1 " 
.2911 /gene="tatC" 

/ locus - tag= " RB8 53 1 " 
/function= "protein secretion 
systems " 

/note="PMID: 9649434 PMID: 9546395 
best DB hits: BLAST: 
swissprot: 005523; YDIJ-BACSU 
HYPOTHETICAL 29.1 KDA PROTEIN IN; 
E=4e-18 swissprot :Q9ZM59; 
TATC-HELPJ SEC- INDEPENDENT PROTEIN 
TRANS LOCASE; E=6e-18 
swissprot :Q9ZCG6; TATC-RICPR 
SEC- INDEPENDENT PROTEIN 
TRANSLOCASE; E=7e-18 COG: BS-ydiJ; 
COG0805 Sec -independent protein 
secretion pathway component; 
E=4e-19 VNG2267G; COG0805 
Sec -independent protein secretion 
pathway; E=9e-06 PFAM: PF00902; 
MttB family UPF0032; E=4.5e-35" 
/codon- start =1 
/transl-table=ll 

/product=" Sec -independent protein 
translocase protein tatC homolog" 



/ab-xref ="GI : 32445963 " 

/translation= "MASSSSVQSSSAQWYPPSPG 

GLKLVEALTRPKDDLFDNSTMTFG 

EHLEELRGSLVKAI IWLLIGLAVGLMFANRWRF 

IQEPLKQAI lEYNADRDLKEMGLP 

DRKEDPQVSRFYEFLTSNSLVADWYTLPSGSVP 

VATGDMVEPPEATGTAVESEPDEK 

NS PDVEAEAGES AALD I PERI TTAELMKSMGT I P 

NPDELVPMIQLRRSERGLSSLKIE 

EPFMI WVKAGLI VGAVLAS PMI FYHLWSFVAAGL 

HSHERRYVYVYLPFSWLFVSGW 

LAFGLVLHYVLTFLLQFNGSMDVAVEPRLTYYVN 

FVLMLPLGFGVAFQLPLVMLFLQR 

I DL I QTQD Y INS WRVAVLVI FVI SMI VTPADVTS 

MVALAVPLLFLYFLGIVMCAYMPR 

GRGLGSEAYDPA" 

291172. .291762 /gene="pfpl" 

/locus- tag="RB8533" 

291172. .291762 /gene="pfpl" 

/locus-tag="RB8533" 
/EC-number="3 . 2 . - . - " 
/ f unct ion= "proteolysis " 
/note="best DB hits: BLAST: 
pir : H75163 ; intracellular 
proteinase (pfpi) PAB0311 - 
Pyrococcus; E=le-30 pir:F72722; 
hypothetical protein APE0319 - 
Aeropyrum pernix (strain; E=4e-30 
^b:AAB89965.1; (AE001016) 
intracellular protease (pfpl) ; 
E=2e-29 COG: PAB0311; COG0693 
Putative intracellular 
protease/amidase; E=le-31 PFAM: 
PF01965; ThiJ/PfpI family; 
E=l.le-39" 
/codon-start=l 
/transl-table=ll 

/product= " intracellular proteinase 
pfpl" 

/protein- id="CAD78 695 . 1" 

/db-xref ="GI : 32445964 " 

/ 1 rans lat ion= " MPFMTDPTSSSNPATAVSVD 

SYS PQTLVGKRVLS FVGE I YEDLE 

LWYPKLRLIEAGAEFFVAGPKAGEKYDGKLGYPC 

VSDLAIDACEADSFDGLLVPGGFM 

PDKLRRDPKVLQLVRDFDAAKKPIAAICHGGWIP 

I S AGVYRGVRVTGS PGI KDDLVNA 

GAIFEDASVWDGHHVSSRRPDDLPDFCRHFIAL 

LA" 

291802. .291930 /locus- tag="RB8534 " 

291802. .291930 /locus-tag="RB8534" 

/codon-start=l 
/transl-table=ll 
/product=" hypothetical protein" 
/protein-id="CAD78696 . 1" 
/db-xref = "GI :32445965" 
/ 1 rans lat ion= " MDQPSSTVCVRGGTCLKRSH 
ERREPVRRWIAAWRLSAARRNN " 

complement (291942. .2921 /locus-tag="RB8535" 

63) 

complement (2 91942. .2921 /locus-tag="RB8535" 
63) 

/codon- start =1 

/transl-table=ll 

/product=" hypothetical protein" 

/protein- id="CAD78 6 97 . 1" 

/db-xref="GI: 32445966" 

/translation="iyiGTKIHAGTPSNQIGKNLGI 

IHRADPTNPRQSMSSQLRQGITTL 

QFHKMSQPPKTCSTAVPAVRCWGSVIAPE " 

292098. .293066 /locus-tag= "RB8536 " 

292098- .293066 /locus-tag=:"RB8536" 

/codon- start=l 
/transl-table=ll 
/product = " hypothet ical 



7protein-id="CAD78698 . 1" 

/db-xref="GI: 3 244 5967" 

/translation^ "MDDSKVLSDLVGRRSRVYFR 

SHERMTLSAPMRCWWATCALAIVM 

LGSGCSLFRNGVWSDGSSGELNSGSNHTQVSKLT 

HKSRKTISLQADFRHVGTQDLDES 

LWQHVDETAFPPKVREAWLANGFRIGLIATPDAL 

PETAEENVASGDDPINRLLSTAGV 

LGRSPSGIETIPLRPSQRHELPISTVLDGSHWL 

AQENGKLTGRSLDSPQMSLSLTPT 

LGPGPGQVTLEIRPEIQHGSVQQRFISSEAATRL 

ATGRSTWELPEMNLSWAANPHLTL 

LIAPVHQPDENEPTFGLARQMLRDADHMQDDQHV 

MLLLRINDLPSS" 

complement (2 93117 . .2933 /locus-tag="RB8538" 

02) 

complement (293117. .2933 /locus-tag="RB8538 " 
02) 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical protein" 

/protein-id="CAD78699.1" 

/db-xref="GI: 32445968" 

/ 1 r ans 1 a t i on= " MTVGAGSGEGVSAKVS VFHE 

RS STGTDAI S SHRKKKFSKWWQTF 

AGACVDRENLARVKDMS " 

293349. .293768 /gene="argA" 

/locus - tag= " RB8 54 0 " 

293349. .293768 /gene="argA" 

/locus - tag= " RB8 54 0 " 
/EC-number= "2.3.1.1" 
/function^ "assimilation of 
ammonia, biosynthesis of the 
glutamate family" 
/note="best DB hits: BLAST: 
pir:A70418; conserved hypothetical 
protein aq-1359 - Aquifex; E=2e-08 
pir:H75488; conserved hypothetical 
protein - Deinococcus radiodurans; 
E=5e-08 pir:B81033; 
N-acetylqlutamate synthase NMB1876 
[imported] E=9e-07 COG: 
aq-1359; C0G1246 N-acetylglutamate 
synthase and related; E=2e-09 
PFAM: PF00583; Acetyltransf erase 
(GNAT) family; E=6e-15" 
/codon- start =1 
/transl-table=ll 
/product = "probable 
N-acetylglutamate synthase" 
/protein-id="CAD78700 . 1" 
/db-xref="GI: 32445969" 
/ 1 rans lat ion= " MTVRQSTPLDAQAIHALMRP 
FVSQHLLLSRTEAEIIELTRHGFV 
AMVGPRC I GFS AI EVYS PKLAELQCLAVHPEAQR 
LGLGRKLVGHCI ERARTLGVME XL 
AISSSEDFLKSCGFDYSLPDQKKALFCQLRPRNF 
DDH" 

293758. .294831 /gene="celM" 

/ locus - 1 ag= " RB 8 5 4 1 " 

293758 . . 294831 /gene="celM" 

/ locus -tag="RB8 541" 
/EC-number= "3.2.1.4" 
/function=" degradation of 
exogenous polysaccharides" 
/note="best DB hits: BLAST: 
pir:E75542; probable endoglucanase 
- Deinococcus radiodurans (strain; 
E=6e-48 gb:AAG19796.1; (AE005064) 
endoglucanase; CelM [Halobacterium 
sp.; E=2e-47 pir:A69157; 
endo-1, 4-beta-glucanase - 
Methanobacterium; E=8e-47 COG: 
DR0229; COG1363 Cellulase M and 
related proteins; E=6e-49" 



/transl-table=ll 

/product = "probable endoglucanase" 

/prot e in- id= " CAD7 8 701.1" 

/db-xref="GI: 32445970" 

/translation= "MTTEMQPLEFFKQSILTPSP 

SGYEEPIQKLIGEYLKPHSDEVSI 

DVHGNLTARVGEAGGPKLMLAGHCDQIGMLISHI 

DDQGFLYAQTIGGWDPQQLIGQSM 

TVWTDDGPVSAVISRKPIHLLSQQERGEWKLEQ 

MWLDIGAKDGDEAKSKVRIGDCVT 

LNLAYRELLGDMVSGPGMDNKTGMWTVI ETARRC 

ASSDQALQCELHSVATVQEEIGLR 

GAKTAAGRINPDVAIAVDVTHASDCPTIDKQQQG 

DIKIGGGPVIFRGPNINAKVAKRL 

MQLADDNDIAYQPAALGRAAPNDSNVLQVSGSGV 

ATGLVAIPNRYMHSAVETISLGDI 

EAI AKLLTLFAQSLTPECDFI PG " 

complement (295001. .2960 /gene="pyrH" 

38) 

/ locus - 1 ag= " RB8 54 2 " 
complement (295001. .2960 /gene="pyrH" 
38) 

/locus-tag="RB8542" 
/EC-number="2 . 7.4.-" 
/ f unct ion= " pyr imidine - r ibonucleot i 
de metabolism" 

/note="PMID: 9576794 best DB hits: 
BLAST : swissprot : O04 905 ; 
UMPK-ARATH URIDYLATE KINASE (UK) 
(URIDINE; E=le-30 pir:T47842; 
URIDYLATE KINASE- like protein - 
Arabidopsis thaliana; E=9e-30 
qb:AAB273 65. 1; UMP kinase 
[Saccharomyces cerevisiae. 
Peptide, 204; E=le-29 COG: 
YKL024C; COG0563 Adenylate kinase 
and related kinases; E=le-30 PFAM: 
PF004 06; Adenylate kinase; 
E=2.6e-39" 
/codon- start =1 
/ trans l-table=ll 
/product ="uridylate kinase" 
/protein-id= "CAD78702 . 1" 
/db-xref="GI : 32445971" 
/translation="MTESNELTPNTFYIEVIGAG 
LPEVDGLFVPSTAPPAESESGTVS 
S PGYWNGKMAWDRADGKS ARS PALS YSNTYRS WR 
I CRLDGHLAYD ITCEDELPPTDRR 
WHVYKKGVAPSPKWIHHSDPRLPCPEPNWFVL 
GGPGAGKGTMCELAESQLGWVHLS 
TGDLLRAEREANGPHAAAIEEI lAAGNLVPSTIV 
VKLLRDAMEKITRETGNRNFLLDG 
FPRSESNLEAWYDVFGKDTELPKMLFFECPYDVL 
EKRVLARAKYTGRQDDNLVSLKSR 
FDTFKKETLPTVEFFKSQNRCVELDTSQDRQAVY 
RLVCEQLSEHTDCELANKPLSERA 
EMLLGLRPFPN" 

complement (296105. .2989 /gene="alaS" 
42) 

/ locus - tag= " RB8 54 3 " 
complement (296105. .2989 /gene="alaS" 
42) 

/ locus - 1 ag= " RB8 54 3 " 
/EC-number="6 . 1. 1. 7" 
/ f unct ion= " tRNA- synthetases " 
/note="PMID: 7005898 PMID: 7025207 
PMID: 6256345 best DB hits: BLAST: 
pir:H83533; alanyl-tRNA synthetase 
PA0903 [imported] - Pseudomonas; 
E=le-132 swissprot: 067323; 
SYA- AQUAE ALANYL-TRNA SYNTHETASE 
(ALANINE- -TRNA; E=le-130 
pir:F75289; alanyl-tRNA synthetase 
- Deinococcus radiodurans (strain; 
E=le-129 COG: PA0903; COG0013 



PFAMl PF01411} tRNA synthetases 
class II (A); E=2.8e-255 PF02272; 
DHHAl domain; E=5.4e-20" 
/ codon - s t ar t = 1 
/transl-table=ll 

/product ="alanyl- tRNA synthetase" 
/protein-id="CAD78703 .1" 
/db-xref ="GI : 32445972 " 
/ 1 r ans lat i on= " MKTDELREKYLAFFETKGCV 
RQPSDVLVPAWDPSVLFTPAGMNQ 
FKDHFLGKVKLDFTRATTCQKCLRTGDIDNVGRT 
AFHHTFFEMLGNFSFGDYFKEEAI 
HWAWEFLTDKKWLGI PGERLTVTVYKDDDEAFGI 
WHDKIGLPTQRISRMDEDENFWPA 
SAPSEGPDGVCGPCSEIYYQLEDGSDVEIWNLVF 
TQFNRVGTPPDNLHPLPSKNIDTG 
MGLERTASVLQGVPTNFHIDSLFPIVEAASEVCG 
VKYEYESDNGRRLRRITDHARASV 
FAVHENVYPGPKDARSVIRRLIRRAVLDGYQMNL 
REPFLYKLVEAVADASKAAYPELG 
QTTQRVSEAIESEEKAFFSTIDGGMKRIHRLFEE 
MNDEASVMVPGAEAADLLTTYGVP 
PELVQTLAAEQNFTFDWSGFREAMDKHADESDGG 
QRVLFQTGPLETLKEALRETPFVG 
YEQTEATAWKGIITGDGKGKGDDGQLLSHLDRP 
EDAVLRLVLDHSPFYGESGGQVGD 
IGVISNDNFEFEVIDTQRHASMIVHHGRLIRGKI 
NEGETCTAKVDVENRTALARAHSA 
THILHHALHTHVGRHAEQQGSKVEPDRLRFDFTN 
PKAIDDETLVKIEQDVLGMVGKGD 
EIRVroWSLADAREAGAMMLFGEKYPDPCRMVSM 
GTFSRELCGGTHLTNTKQVGS FEV 
WEESVSTGTRRIEALTGERAKEHREQTQALLNE 
VAGKLNCDASVAAAATVALIEEVR 
RLKKELS SGKAADYPAEFVFDAKAAKAETTD I SD 
YNAVRAAVRGLTRRLNVAITDVLS 
RLDSLLADRSKLVEQLKQVTAGGKISADDLIADG 
TKVGDTLLIVAETPGANPNIMRGW 
IDQIRKKSDTPTAVLLASSMNDKVMLVGGLSRDL 
VDRGLKAGDWVGAAAKWGGSGGG 
RPDMAQAGGKDASKLPEALQQARETMTEKLG " 
2991 /locus-tag="RB8546" 

2991 /locus-tag="RB8546" 

/codon- start=l 

/ trans l-table=ll 

/product=" hypothetical protein" 

/protein-id="CAD78704 . 1" 

/db-xref ="GI : 32445973 " 

/trans lat ion= " MKS I SSGEVELWSRRSVST 

GQKS I AAAVWRPALWAE IRHLS PV 

GHRLISCRRWNASPRGS IPDLFLVESTRTS " 

/ locus - tag= " RB8 54 7 " 

/ locus - tag= " RB8 54 7 " 

/EC-number="3 .4 . 11 . 9" 

/f unction="proteolysis" 

/note="best DB hits: BLAST: 

gir:F65012; hypothetical protein 
2385 - Escherichia coli (strain; 
E=le-55 gb:AAG57511.1; AE005469-11 
(AE005469) putative peptidase; 
E=le-53 pir:D75419; proline 
dipeptidase - Deinococcus 
radiodurans (strain; E=le-49. COG: 
ypdF; COG0006 Xaa-Pro 
aminopeptidase; E=le-56 PFAM: 
PF00557; metal lopeptidase family 
M24; E=2.9e-50" 
/codon- start=l 
/trans l-table=ll 
/product = "putative peptidase" 
/protein-id="CAD78705 .1" 
/db-xref ="GI : 3244 5974" 
/trans lat ion= "MTQRIERLVQSIADSDQPMD 



VRPDGKATLLSDRRYETQIAEECPALESAIRPPS 

QTLVALLAEYLADSSLKTIGFEAD 

HVQVSTMHQWKEQIESVEWTQTSGLVETLRSIKD 

ADELATIRRAISIAERSFLSVTNK 

LTPRMTELQIAHELEATMRSLGASGVAFDVIAGA 

EPSGALPHYHPRNIALADCRTLLI 

DWGARVDGYCSDLTRTLHKADVRSATADRFEAAY 

QAVLESQEAAI S AIRDGVEAI EVD 

RAARQVLQNAGLGDAFKHGLGHS FGLEIHEDPRM 

GPMSTDVLREGMVLTVEPGVYFEG 

EFGIRIEDDILVTADGFERLSTLPKGLDDCRLW 

II 

gene 300336. .300863 /gene="accB" 

/locus- tag="RB8548" 

CDS 300336. .300863 /gene="accB" 

/Iocus-tag="RB854 8" 
/function=" fatty acid 
biosynthesis" 

/note="PMID: 7592499 best DB hits: 
BLAST: pir:H69580; acetyl -CoA 
carboxylase (biotin carboxyl 
carrier subunit) ; E=le-18 
gb:AAB00182 .1; (U36245) biotin 
carboxyl carrier protein 
[Bacillus; E=le-17 
swissprot:P4 978 6; BCCP-BACSU 
BIOTIN CARBOXYL CARRIER PROTEIN 
OF; E=le-17 COG: BS-accB; COG0511 
Biotin carboxyl carrier protein of 
acetyl-CoA; E=le-19 PFAM: PF00364; 
Biotin- requiring enzyme; 
E=1.5e-25" 
/codon- start=l 
/transl-table=ll 

/product= " acetyl - CoA carboxylase 
(biotin carboxyl carrier subunit) 
accB " 

/protein-id="CAD78706 .1" 

/db-xref ="GI : 32445975 " 

/ translat ion= "MDFAGIGLRMKSETPDGGDV 

FDIDRIRQIVELMEQHELNEVDLQ 

QGEDRIKLTRGGTAPAMI PAPVAAPPVAAAPAAP 

SAAPAASGGDAPADTAGTITINSP 

MVGTFYSKANPESPPFVQVGDWGEDTWCIVEA 

MKVFNEIPAECRGKIVEVLVGDQQ 

AVDFGKPLFRVQVDG " 

gene 300867 .. 302219 /gene="accC" 

/ locus - 1 ag= " RB8 5 5 0 " 

CDS 300867. .302219 /gene="accC" 

/ locus - 1 ag= " RB 8 5 5 0 " 
/EC-number= " 6 . 3 . 4 . 14 " 
/function=" lipid, fatty-acid and 
isoprenoid biosynthesis" 
/note="PMID: 81023 63 best DB hits: 
BLAST: gb : AAB88214 . 1 ; (U59234) 
biotin carboxylase [Synechococcus 
sp. PCC; E=le-139 
swissprot :Q06862; ACCC-ANASP 
BIOTIN CARBOXYLASE (A SUBUNIT OF; 
E=le-138 pir:S74380; biotin 
carboxylase accC - Synechocystis 
sp. (strain PCC; E=le-136 COG: 
S110053; COG0439 Biotin 
carboxylase; E=le-137 PFAM: 
PF002 8 9 ; Carbamoyl -phosphate 
synthase L; E=5.1e-48 PF02786; 
Carbamoyl -phosphate synthase L; 
E=1.4e-100 PF02785; Biotin 
carboxylase C- terminal; E=2.7e-58" 
/codon-start=l 
/transl-table=ll 
/product = "biotin carboxylase" 
/protein-id="CAD78707 .1" 
/db-xref ="GI : 32445976 " 
/translation= "MFERILIANRGEIALRVIRA 



gene 
CDS 



302289. .303425 
302289. .303425 



gene 
CDS 



complement (303021. 
09) 

complement (303021. 
09) 



gene 
CDS 



303556. .304191 
303556. .304191 



ADHAYCVGTNRSADS YLKIDRI I ATAEVAGVDAI 

HPGYGFLAENAQFNEQCRSSGFEF 

IGPS PTAMDKLGDKNTARSMAIAQNVPWPGSDG 

LIADFDRAIDTAKEIGFPVLIKAT 

AGGGGKGMRVAETEDVLKSQLEAARNEAIAAFGN 

GGVYLEKFIQQPRHIEVQVIADTH 

GNVCHLFERDCSVQRRHQKLVEEAPSPDLPADRR 

EAI CDAAVRM I AGAD YSGAGTVEF 

IVDKDYNFYFIEVNARIQVEHPVSEMVAGVDLIQ 

EQIRVAAGLPLSFRQDELTCTGHA 

lECRLNAEDPTKNFQPCPGKIESMFVPGGMGVRF 

DSHVKSGYWPPYYDSMIGKLIVH 

RPTREQAIATMLRALKELQIEGITTTVGFHNWIL 

QNEAFVNGTVDTKFVDRNYKGQTS " 

7gene="hpnA" 

/locus- tag="RB8551" 

/gene="hpnA" 

/ locus - 1 ag= " RB 8 5 5 1 " 

/note="best DB hits: BLAST: 

embl:CAA0473 0.1; (AJ001401) HpnA 

protein [Zymomonas mobilis] ; 

E=3e-17 swissprot:P73212; 

DFRA-SYNY3 PUTATIVE 

DIHYDROFLAVONOL- 4 - REDUCTASE ; 

E=le-13 pir:C83067; hypothetical 

protein PA4631 [imported] - 

Pseudomonas; E=3e-12 COG: slrl706; 

COG0451 

Nucleoside-diphosphate- sugar 
epimerases; E=le-14 PFAM: PF013 70; 
NAD dependent epimerase/dehydra; 
E=0.51 PF01073; 3~beta 
hydroxysteroid dehydroge; 
E=1.6e-06" 
/codon- start =1 
/transl-table=ll 
/ product HpnA protein" 
/protein-id="CAD78708 .1" 
/db-xref ="GI : 32445977" 
/translation- "MQIETLDEFRYPKRRFMWHR 
LPACVSLHHNRLEAYVTLYSVGPQ 
GITMRVFVTGGTGLLGNTILRQLSDAGHDLIALV 
RGEPDPQVFEGINTQFAHGDLLDQ 
TAIENAMSDCDAVIHSAGMIHLGWKRLDDSMAVN 
RDGTQT I VDACLHHDCKLVHVGTV 
NTLGIADRNGVANEDTPLDHAGGQVPCSYVLSKR 
AGNEAVMQGVKDGLKAVLVHPGFM 
LGPWDWKPSSGRMLLEVSKAWRPLSPSGGNSTCD 
SRDVAAATIRAAEKLLANEIPS6R 
QYILAGHNMRYLELWKAMTEAMGARQPIMRAGPA 
QRWIAGVAGDIVGKLLPHEVDFNS 
AAVGMSSQYHYYDSSRAQSELDYKIRPFEESLTD 
AANWIRQHHR" 
,3035 /locus-tag="RB8552" 

.3035 /locus-tag="RB8552" 

/codon- start =1 

/transl-table=ll 

/product = "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein-id="CAD78709. 1" 

/db-xref ="GI : 32445978 " 

/ t r ans 1 a t i on= " MS E PN I TPNFRRWRKS WCLQ 

WRVCFCRASSVMLADPVGRVGEAF 

FEGADLWEFALGSRAVWWLARHADRCAVEVD 

FMGQQFADDVACNARDPSLSWSGT 

HDWLAGSHRLGHRLPKFEVPHWAGQNVLPAGRD 

FIGEQLFRGADGGGGHVSGIAGAV AS" 

/qene="sigW" 

/locus-tag="RB8553" 

/qene="sigW" 

/ locus - tag= " RB 8 5 5 3 " 

/function=" transcriptional 



/note="best DB hits: BLAST: 
embl : CAC17654 .1; {AL450450) 

Mutative ECF sigma factor; E=0.003 
dbj :BAB03 982.1; {AP001507) RNA 
polymerase ECF-type sigma; E=0.005 
swissprot ;Q45585; SIGW-BACSU RNA 
POLYMERASE SIGMA FACTOR SIGW; 
E=0.005 COG: BH0263; COG1595 
DNA-directed RNA polymerase 
specialized sigma; E=4e-04 PFAM: 
PF00776; Sigma-70 factor (ECF 
subfamily); E=5.2e-10" 
/codon-start=l 
/transl-table=ll 
/product= "probable ECF sigma 
factor " 

/protein-id= "CAD78710 . 1" 

/db-xref="GI: 3244 5979" 

/ 1 rans lat ion= " MDFDSTAAEPTTRESLLLRL 

RDSKDEFAWAEFSAIYEPVIYRLV 

KRRGVQDADAREIVQEILMSVAGAIERFDVTGQG 

SFRGWLSRITRNATIDRLRSVQSR 

RELTGGSSIVRIIDAAASNERGLDGFQQDAELSH 

EFDRDHRQQLFRWAASKVRSRTGE 

KNWI AFWRTAVEDQS lAAVASELGITEGSVYVAR 

CRILKRIRELVHQRLSD " 

/gene^"pkn2 " 

/locus-tag="RB8555" 

/gene="pkn2" 

/ locus - tag= " RB8 5 5 5 " 

/function= "unspecified kinase or 

ATP dependent regulatory protein; 

unspecified signal transduction; 

cellular communication/ signal 

transduction" 

/note="PMID: 7774814 best DB hits: 
BLAST: pir:B70936; probable 
serinetnreonine- specif ic protein 
kinase (EC; E=3e-31 
embl:CAA18685.1; (AL022602) 
putative serinethreonine protein; 
E=5e-31 ddbj :BAB06223.1; 
(AP001515) serinethreonine protein 
kinase; E=6e-30 COG: Rv2176; 
COG0515 Serine/threonine protein 
kinases; E=3e-32 PFAM: PF00069; 
Protein kinase domain; E=1.2e-4 9" 
/codon-start=l 
/transl-table=ll 
/product = "probable 
serine/threonine- specific protein 
kmase 2 " 

/protein-id="CAD78711 .1" 

/db-xref= "GI: 32445980" 

/ trans lat ion= " MIDRIMIAKQDPHFEMDKLR 

ELLDERLPVPEADRIEYHLSNCER 

CRSQLRAIAGEPMWWDETVEVLSKSTFIDQSSRP 

TSQPPSDPAQPQRSFDASLDWIRP 

LLQPLRDQHSDAVDDTRAIGTLDQYSIHSVIGQG 

GMGWLRGMDPELNRPVAIKVLSP 

HLAGVGAARTRFMREAQAAAAIVHPAIVPIYSW 

PSARLPYLVMPCIDGGNLQQRLDR 

EGPLELTEWRIGLQIAEGLSAAHKNSVLHRDIK 

PANILIEEGNGRVLISDFGLARAL 

DDATLTNSGMIAGTPQYMSPEQARGESLDARSDL 

FSLGSLLYALASGRAPFRAESPLG 

ILRKITETRAKPIQEINERMPRWFDNLVARFMEP 

DLTLRIESADEAVSLLREVNAHVR 

NPTGNRLPNSLEERRPRRSLRWTLAAAGVLTAVA 

SAATWNHRKQENSHTERSLPTVIE 

QSRRLTTMAPSVPAPPAPTTDDWDAVHFNAELHS 

IQFMLEQLANQLEGTDPTVNPDLT DGDQK" 

/locus-tag="RB8557" 

/ locus - 1 ag= " RB8 5 5 7 " 

/note="best DB hits: BLAST: 



VCA1075 [imported] - Vibrio; 
E=0.50" 

/codon-start=l 
/transl-table=ll 

/product =" conserved hypothetical 
protein" 

/protein-id="CAD78712 .1" 

/db-xref="GI: 32445981" 

/translat ion= "MKDVAQMNSVPTPALPKRRS 

RVWKSAFIATLTI PTAAVLVWSWT 

SLPSTWAQTRDPRS IDQPRRLNS YPQPNNLP YQE 

SRAQIASELSLSHQAYPNSDHQSQ 

PGFNNQSKTTLRIDKQAQPAQEVDVPVTRRTMDK 

DGNISITTEYIRQRIPLQLTGRLM 

TTEDSKHNERIKQLVNQLRGSTSSNEMDAKLAEF 

RSELEQEFKRMHEQQGSEIAAIEK 

RLQSLKEIHQQRAENQDKIIQRRIDQLLGRSDPL 

DWNYTPPNTPFAPLSPYSSNQYGS 

SPISDNFHPKSPYAYPESNSPRSTRSWQPSPNRP 

PSPSPQKPASRPIPPSYPGKLMEA 

YESALKASDAAVLEQSNKERLNEYTRLHEEGALT 

ASELRRATLDYASAQKQTALLKLQ 

LQS IQRDLARYLNAAELELAALNEQQKTNPSANS 

KSAQLERQRAEAAITDAKTTLEQF 

EAAMKLIPVDTDSAEPSDAPKNPNESADQPESEL 

DPTTNAPKASEDTTLNLETEEVET 

TPNEPVPSF" 

complement (307305- .3078 /locus-tag="RB8559" 
77) 

complement (307305. .3078 /locus-tag="RB8559" 
77) 

/codon-start=l 

/transl-table=ll 

/product= "hypothetical 

protein- signal peptide and 

transmembrane prediction" 

/protein- id= " CAD7 8 713.1" 

/db-xref ="GI : 32445982 " 

/translation= "MLDRFFSCVRSRFAPLVLAA 

IPVAMFSAIEAQEWAQQPVSAVP 

AGP I KD VS FQALWL I E SDD ANR I E YAGPAREALS 

TVGFGRLVPAGSAMASVTVGMDAR 

VDGGSRYGQLQCQVRFLNTTESGEVQAQVEFRTK 

PELGMT I ATTVRAPVGKWFLVGAA 

DSRTGLPTHADDGKRAVMIMKVTKDVITLE " 

complement (307962. .3092 /locus- tag="RB8561" 

81) 

complement (307962 3092 /locus-tag="RB8561" 
81) 

/ codon- start =1 
/transl-table=ll 
/product = "hypothetical 
protein-putative transmembrane 
protein" 

/protein-id= "CAD78714 .1" 

/db-xref = "GI : 32445983 " 

/trans lat ion= " MSAAPREGWTVGNRTVAI I A 

AMLLLAGLVATFSRIWDYQTPGP 

FDPENQGLCDFHNGIYFPTRAVLAGESPYSDAYA 

AKYPVARQIPFFSPVILLMHAPLA 

LLPLHVGEVLNVILQLFLLAAIAVLVAREAGYPK 

RYDAVLCIAMAMVYSRGGHITLFN 

GYFTFQLVLATFLSWWADRRPVRAAFALLWSA 

KPT Y I LPLGFLMIiARGNWRAL II G 

AVLSIAGAALLLAWIAHHEGDGDIVAGVSILQQQ 

IVDTQEVHRSMDDESPVYSWTRLD 

LFAI lAKWRGEDPGDLPHILAMAALLWPMWVLF 

VRARRGEDDGIAGVTGGVLLTALL 

VSLYHQSYDSLLMIAPVAGAWMARLDGWSNVSPK 

LRWLLCALMAFPALNYLSTRTFLL 

RFDLPEVWRVFTSLNGICLAISLVILCWIAFRK 

RADQENQVSGNAQ" 

complement (309281. .3109 /gene="pgm" 

12) 

/ locus - 1 ag= " RB 8 5 6 2 " 



/locus-tag="RB8562" 
/EC-number= "5.4.2.1" 
/function= "glycolysis and 
gluconeogenesis " 

/note="PMID: 7896694 best DB hits: 
BLAST: pir:G83004; 
phosphoglycerate mutase PA5131 
1 import edj - Pseudomonas; E=le-122 
swissprot:P52832; PMGI-PSESM 
2 , 3 -BISPHOSPHOGLYCERATE- INDEPENDEN 
T; E=le-119 swissprot :P74507; 
PMGI-SYNY3 

2 , 3 -BISPHOSPHOGLYCERATE-INDEPENDEN 
T; E=le-118 COG: PA5131; COG0696 
Phosphoglyceromutase; E=le-123 
BH0592; COG1015 
Phosphopentomutase/predicted 
phosphoglycerate mutase; E=3e-06 
PFAM: PF01676; Metal loenzyme 
superfamily; E=5.7e-52 PF00884; 
Sulfatase; E=0.0045" 
/codon- start=l 
/transl-table=ll 

/product= "phosphoglycerate mutase" 
/protein-id= "CAD78715.1" 
. /db-xref ="GI : 32445984 " 

/ trans lat ion= " MVLTFMTATRRRPWLIVRD 

GWGQNPDPKWNESNAVHLANTPVD 

DKLTQEYPSVLIHTSGEDVGLPAGVMGNSEVGHQ 

NIGAGRIVDQEVMRITRAIRDDSF 

FSNPWTGAIDHVKKSGGRLHLLGLMSDGRVHSD 

LQHGIAVIDLAKRNGLTGDQLAIH 

AITDGRDTSPRGGLKFVSQLTDHCEASGVGWAS 

VIGRFYAMDRDLRWERVEAAYNLM 

TQGTGQTFPSASAAIESYYANPTESSRDGDEFIT 

ASSIVPEGGSPITVQPGDAVLFMN 

YRGDRTRE I TKAFTFDDAAWKD I PGGGFDRGKKI 

DNLYFATMTGYETGLPVKVI FEKP 

AKMPNILGEYVASKGLHQFRCAETEKYPHVTFFF 

NDYRDNPFEEEEWGMAPSPRDVST 

YDQKPEMSAEDITEKVLKQIKSGKCDMIIVNYAN 

GDMVGHTGVLEAAIKAVEKVDACV 

GRWDATLAAGGSLWTADHGNCEQMIDPETGGP 

HTAHTTYQVPLIWDPEFVGKPLR 

EGGRLADIAPTVLALMGLEVPPEMTGRPLMETQT 

A" 

misc-feature 310485 3104 /note="cosmid pircos-alg06/ cosmid 

pircos-d2d07 joining point" 
gene complement (311043 3114 /gene="glbN" 

26) 

/ locus - 1 ag= " RB8 5 6 3 " 
CDS complement (311043. .3114 /gene="glbN" 

26) 

/ locus - 1 ag= " RB8 5 6 3 " 
/note="PMID: 10500158 PMID: 
10636862 best DB hits: BLAST: 
swissprot :Q10784; GLBN-MYCTU 

HEMOGLOBIN HBN pir: C70761; 

E=2e-18 gb:AAA29446.2; (M57542) 
hemoglobin major component 
[ Paramec ium ; E= 3 e - 0 9 
swissprot :P15160; GLB-PARCA 

MYOGLOBIN (HEMOGLOBIN) pir:; 

E=2e-08 COG: Rvl542c; COG2346 
Hemoglobin- like proteins; E=2e-19 
PFAM: PF01152; 

Protozoan/cyanobacterial globin; 
E=5.3e-25" 

/codon - s tart = 1 
/transl-table=ll 

/product = " hemoglobin- like protein 
HbN" 

/protein-id= "CAD78716 . 1" 
/db-xref="GI: 32445985" 



gene 
CDS 



WDEMYVRVLADPELTHFFDGVP I 
EKLARMQTEF I AS I TSGD IQYTGADLTKVHAGRG 
ITGVHFSKFCGHLTDALEANNVSS 
HAIDQVLGKLAMYSDKVTGSSNVDG " 

complement (311448. .3123 /gene="iolI" 

65) 

/locus-tag="RB8565" 
,3123 /gene="ioll" 



complement (311448 
65) 



gene 
CDS 



complement (3124 53 
97) 

complement (312453 
97) 



/locus-tag="RB8565" 

/note="PMID: 7952181 best DB hits: 

BLAST : swissprot : P42419 ; 

lOLI-BACSU lOLI PROTEIN pir: 

A69646; E=:0.026 COG: BS-iolI; 
COG1082 Predicted endonucleases ; 
E=0,002" 
/codon- start =1 
/transl-table=ll 

/product=" similar to loll protein" 
/protein-id="CAD78717 . 1" 
/db-xref="GI: 32445986" 
/ 1 rans lat ion= " MFKNFCPSALGINGRQSELI 
ELALTYAFRGMDVDMHDMLRRSQR 
TSFEDASKYLKASEIKIGSFQLAIDLDSDDDTFT 
AAVAQLHPLGEIAQQLGAERALIR 
VPAATNRLPFNEFFDVQRNRLGQIADVLGQRDIK 
LGIGFRAGAELLEGKEFTFVNNVE 
TFRALVDAVPNDNVGYVIDTWDWWGDGAMDQLS 
EISGEKIVALRLGSVPEEADLATV 
KSTDRLLPEMNGMTLDHVKWSHLNETNFSGPMS 
PTASTSQYKGQTREFLVQKSQEAI 
DGI CKEAGMTVAPLPMDL I ED I P YEPTPM " 
.3142 /locus-tag="RB8568" 

,3142 /locus-tag="RB8568" 

/codon- start =1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id= " CAD7 8718.1" 

/db-xref="GI: 32445987" 

/ translat ion= "MHTFCRRLANAFRAIFDWLH 

RFAGWRRIGNHMPDNWNILASLLG 

TPKPPDRPAAKVEPEKPAAEPEPSAESCESKEKE 

SDAAPKPKETSETAPAKASAFSSD 

DEGADDVLQALAAVKPPPKLPGFGVADDDDSDDV 

PSGFGVAENKISKRESQLKTGGDE 

SPASKRDRKHRGGPDLGEPPRKKGNGGGGGFASG 

LLGNDDFDDEDASDVEAFDDESDL 

lASDTVEITGWDADSVDEGDSAPAAKSDRDRNAG 

GRGRGGRSRRGRDADDSQDESGES 

KPAE S S S S RGGRGGRRGQRQKMSDSDVWEARSE 

SPRGDREESDSDSSETRGRRRRRG 

GRRGERSDSDRGTEERPKADRTDSDRSGSDRSKS 

GRGDRSRSGGREETRTKAWDDSDL 

ESSDDSFAADLFVDDGSVDEPVADRDSKRSQEDS 

DDGEPRAGRRRSRRGGRRRGGRGR 

EERDSSEVKARGGRGSESEKDDDSFDDELPRFAF 

ADDDDSVDTEREETDGEKETRARG 

RSRSRGGRNRDRDNDRERSGGSTRDGGSERSSSS 

ERSGGSARDGERSSSRGRRGERKD 

SRKKDPDVPTWDEIVNQIVDSNLKGRKTQPPRGG 

RGQSGGGNSRGRRR " 



SEQUENCE (SEQ) : 

1 agtagccggg 
61 gaccccggag 
121 cgaccgtcaa 
181 atattcgttg 
241 gttcgcgctg 
3 01 tcgatgatgt 
361 ttctcgggca 
421 gtgatgggtt 
481 cgaatgccag 
541 tcaatcaact 



ggtttgagcg 
ggtgtcgcag 
ccgtaggtgc 
tcatcaacca 
ccaagatcga 
acttcaacaa 
tcaacgacac 
ccggttgagc 
tgaagatctg 
tggacgggcg 



acagcgaaca 
atcgatcgct 
gctatcgcga 
accgttcgcc 
tcctcagcct 
gtcgttgaac 
ctccgactcg 
gagattgcgt 
accatccgcc 
gagcagggcg 



cccccggatt 
acgacgccct 
cctacggcta 
ctcaaccaac 
gcatcaaagc 
tctcgacgat 
atcacatcct 
aacacaatct 
gtcagcacgc 
tcgaccaact 



caaaacgcga 
ccggggtcgg 
ccatctttca 
ttggcagcgt 
cgactcactt 
tcttcatctg 
cgacgtcgta 
cgtcgtcgtt 
tgacctgtgc 
cttccctcga 



gcatctattc 
tgttcgcgag 
tcccgtcggg 
gaaccggttc 
tttgcggact 
ccgcatcagg 
ttgcgaaatc 
ttcactcaca 
gtaatccttg 
gcgtttggtc 



661 aagcaagccg ctgacgattc atggaacaac 
721 aaagcgacct gagccagctt ggctggtgaa 
781 ttcgagaaca cctccgtctc cggtgtgacg 
841 tgcaactcaa gaatttcttc cggtttgtag 
901 acacgcgaga cggcctctgg actgacaaca 
961 aaggtcagaa ccgccgccat ttcgtcgtcc 
1021 ggcacacctc gggcggggtc atacgtcttc 
1081 tgcagagcca tcttgatcag ccgatcttcg 
1141 ggtgggtaga cgtttggtgt tcccttgccg 
1201 tcacgctgat agatcgttgc acccaactcg 
1261 ttgggaggat caaacttgac gaactcaagt 
1321 ttcgtatcga cgttttccca aaagtggcga 
1381 acatccgatt ccaacaactg attcagcaaa 
1441 aaccaaagtg cttccaacaa gtgatgagct 
1501 atccactttt gcgtcgccgc gatgacttgg 
1561 gtccgatgcc gaacgccgtt gacggggtgt 
1621 tggcccgcga ttttgacagg tttttgcagg 
1681 cgcccgtgag cgtggtcacg attggggtca 
1741 gcgttgtgcc aatcggaaac ataaagagca 
1801 cggaaattgc ggtcgtcact gttgagcaac 
1861 aactcgcggt gttgctcttc ctgatcagcg 
1921 acctcatcgc cgtcaaagaa ctcgatcttt 
1981 cgattccaga ccttgaactc ggagatctgc 
2041 gggttgttct gctgcgacgt gtgagcaaag 
2101 acaggaaagg ttccgttgtt gtattcgctc 
2161 ttccggcccc cgctgatgac ctccacttcc 
2221 ctgagtttga acccggtgat tttggcgtcc 
2281 ccgccatttc gagtgatctt aaccggcccc 
2341 gcctcctctt caacgtcacc ttctcgatcg 
2401 atcgtgttac agatcaggat gtcgttttga 
2461 attgcattgg ctgccaccgg tcgaaattct 
2521 ttgccttcgg gccgaacttg atagcaacgc 
2581 ccccagtagt cgaaactggt gccgtgcgga 
2641 gcgaacgtca gcgggtcaaa tcggtacatc 
2701 cacggagtct cgtggttgtg aacgaggaaa 
2761 ggtccgtaaa ccaaattgtt ggcggcgtgg 
2821 atggggtatc gcacgtccgc cttgtcatcg 
2881 ggtccggacg tgacaaccac accgccgttc 
2941 tgtgcgaaaa tcttgcggac atcagctttg 
3001 aaggagtcat tcatctcctt gccaggttcc 
3 061 cacaatcgtc catgagcatc aacttgcatt 
3121 acttccgacg caaagacgtt gagctcgtaa 
3181 tcctctggac ttaagtaggc gacgctgccc 
3241 ccaccgatat ttgaggtcac tttgaccggc 
33 01 gttcgcccat ctgcggctgc ccaaatcact 
33 61 atcaccaatt catgcttcag cacatctgcg 
3421 gaacgactgc cccaaatgtc attgccgtcg 
3481 tttttgtcct caatcgcttc gtacaaatcc 
3541 cctagcaatg cctgtgaaat gataccagcg 
3601 actccgttga gcgtgagtcg ctcggaactc 
3 661 aagtcaacaa acttcgcgcc cgtgatttcg 
3721 gccaagttga cattcagttc cgccccatcg 
3 781 gcgatcggac tgaacaggac gaaacgcaga 
3841 taacgctgaa ccagcttgat caattcatcg 
3 901 aacgactcgt tgaaaccata aaaggtgaaa 

3 961 tcggtgtcat tggtgaaccc ctggtttctc 
4021 ctcatgtttc gaaaactgac gtcgagccct 

4 081 caaggatcat gctgcatccg atcggccagg 
4141 gattggaact caaaagcgtc ctgtgcatca 
4201 gcacatagaa agaaaaggct ccattggagg 
4261 tttctcagtt gtcgaatctt ggagtgcagt 
4321 tggctgcaaa cgaagcatcc taacgtggag 
43 81 gtatcgcgga ctggtactgt aacttcgtaa 
4441 aagctgttca cgaaagcgaa tcccctgtcg 
4501 gttcactgaa ctggggcgtt aaacaccgta 
4561 ttcagctcgg aacgctcccg cgagtcttca 
4621 tggtcgcttt tcgtgtctct ggtttcctca 
4681 gttctaggtc gaacgcgtgg ccttcggcac 
4741 tcgtgctccg tccgattgtt ggtgcaaaag 
4801 tcggaccgtt tctttgaaaa ccagcgccga 
4861 cgaccaacgt ggaccatttg aagaatgatt 
4 921 actcgattca gacaaccgtc cacaaggctc 
4 981 gacctctccg atggcgtgaa tcctgtgaca 
5041 agttccaact cacgttggag atgccacagg 
5101 cactttcgcg aagctacggt cccaacttga 



cgtttgccac gctgaaaatt ccccttcgcc 
gcggccaaca actccttttc caactcttcg 
ctctcatcca tcagctcctt ttccagcgga 
aacgtcgtcc ggcctttggt ctcctcgcga 
gacgcttcat tcccccaagt attccgaaca 
ttcaacaagg aacggaacgc ggtcatgggc 
cctgccacgg tcattttgcc ccacacacca 
ctcccgttga cccagggact accgaccaac 
tgagtcatgt ggcaggttgc acaatgcgat 
taagtcttcc gatcggcgcc gctgaggtgc 
tctgcgggcg cggcaaactc gcttccgccc 
accgtcttgg ctgcaacaac cgcatgccgg 
gcctcattct tgacgccatt tctttgatgc 
tctttttcat cgttgggatc gaaactcgcc 
tcggagtcat gcttggtgag ctcgactcga 
ttgagatttt caagcaacgc ttcgatagac 
ggccgattct tgaccgtcaa acgaaaaatc 
cgaatgttgt gctgcatgtg gccgatgatc 
ccgtcttcgc cgacgaccgc atcggtgggc 
tccaatcctg gcgttcccca aacctcaccg 
gaaacgattt tgatgtccac agcagcgaca 
ccgttgttga aacgatcttc gaagcctttg 
^c:gggcgacg ggaagtcacc tcgcatccaa 
tttcccttgt ctccatcgac gagccgttgc 
gactgcgcca gcttggccgt tttcgcaata 
gaaaggttca tctgttgccg accattcaca 
ttcagagcag ggtggtcaac agtgatcaac 
gtctcctcct tcagagtctt ctccgcggca 
agcttgtatt gcttcacccc caagaacccg 
agttcttcag gaaaatgctc ggaagataga 
ttctccagca gtttgtgcat tttaaagcca 
ccaccggtcc catcgctggc atagcaataa 
ttgggggagt tacccgcatg tggagtgatc 
cccgacgccc caatgttcag gttctgcttc 
atcccgcttt gccaatagat cccaccgtct 
tgagtgtcgg aggtaccaag cccttgcaaa 
ccatcggtgt ccttcaagaa caacaggtca 
caaaattcaa acccgagagg gttgtgaacg 
ccatcgttgt cagtgtcctc caatatcatc 
cactttgggt aagtgttcca actggccacc 
tggacggggt tggctaaatc gggaaactga 
ccttccggga cgttgatctt cgccagagat 
tctttcatag cattggaact ggcactccca 
ggtggaacat tgctgtcgtc agcctgcaag 
ttgtcgcgat tggcggtcat cacgtcgagc 
ttgctttgcc catccacgaa cgtcaacgtg 
gtcgctcgat accggttgtg ccagtgccag 
tgaagctctg catcagacgc tggcttttcg 
agttgcttgt aaccagcctc attgaggtga 
gactgaaaca actggtaagt cggcgagaac 
gctgcctcac gagtcgcctc ggtgtaagcg 
ggcagactcg cgtcgcctgt gttctcgtag 
tcctttccgt catctttttg cacctgcgtg 
cgatgctcgc cggctttttc cggtccggca 
acaacatttg gcgcgacgtg cttcaaatat 
gggcgttgat taacacgatc ccccgaaaaa 
ttcaactgat gctgaagaaa ggtttcgacc 
ccgttgccat agatggcgac gacatcgttc 
gctcgttgcg cgatgcccgg aatgacgccg 
acagtcgtga tcctttgcgg cattgttcgc 
ttcagctttt gttctgcagg ccagcgaaac 
tgacggaacg tgcacacgtt ttccgctggg 
agaacagaca acagcatcaa agaacgacgt 
atatgcgaca gccaacgaac actttgcgac 
ccaccgtctg gacaagtcgc ggattaagct 
agccgtggct caatttatca tgcaacggat 
gcattcccta tccggcggtc cctcataacc 
aaggagtttc ttcgcttctc ccattgacgc 
tctcttcagg cagcgaatgg atggcggtcg 
gcagtggcac aggagcggaa cgcatgaaga 
tggcactgaa ccagatcatc gaatcgtgca 
aaatgcttcc agtctcgcgt ggagaaaccg 
agaaaatcgc gatcgctgga gacatccttc 
tcactgacga gtgcgacgtt gtgcaactag 
tgttcgatca aattgtatcg acccacatcg 



5221 atgcgatcca aagtggttcg accgttgtca ttcggtacca agtccgcgtc gaccgcaccg 
5281 cggatccatc aatcgtcgct ctcgacaaca tcgatgtccg ctgggcgttc cgctcacaaa 
5341 gcaaacgcga tgctgcccgc aaggcacaca cggcttcacc aggcgtctcc gccccacacg 
5401 tcaccggcct cgttccggct gcccgtgcaa aagcgtcggc tttgacttca agcgttgttt 
54 61 gaaacgtctt ccgaacagaa acgaaagatc tggcaatcgg cgtgctccat gccagatgct 
5521 cagcgtctcg gggggctttt atcgacgttg actgacgcga gagtgtacgc atcctctgaa 
5581 cgtccttgtt cacgaagtcg ggcggcaatg cgttctgccc gagccgcgcg tttagattcg 
5641 gctcgtgcga gtcgatacgc ctgaacatcc gcctcacttt gcatcaatgc tagccgctgc 
5701 tgctgagtgt tccgatacgc cgagctcgcg atttgacgct ggcttgccaa cagttgttgc 
5761 tgatactgcg acgcgatcag gttgttttgt ggatagctgc tgaccaatgc gttctgccca 
5821 tatggatttt gggcatactg cgagccgctc gtgaatgaag tgggactgcc tgatggcggt 
5881 cctccacctc gagaacgaca ctggccaaat gcttcaccgc caaagaacgc aacggataac 
5941 aaagtcgctg tagcgactaa aacgccacgt cggaaatctt tcgcaccaaa atcatttgta 
6001 ccgaaattca ttgtgtactc catcgcgtcg tcgaagaatc aaaggcaacc gacgccgcca 
6061 acgcgacgac tcgtcctcta cctaaccaag gacactttcg cgaaaaaacg tcacgtgaag 
6121 tcgatccaac gcgggcagag cttattgaac cgagccggaa atcgaatgag attgaaccgg 
6181 ggatgagcga cgtctcccct ttcggctgtt ccctcaccgg tggaaaatgg attcgatgtc 
6241 cctctcctaa tttgtcagtg acgctgaaga gctcgtgcca tgaattttaa gacaacgatc 
6301 cttgctgcgg ccgttgcgtt tgcgtttgtc gccccccgaa caatccaaag taaagagcct 
6361 gccatgaaga ccaaatgggc cattgtgatt catggtggag ccggcagctc accggctcaa 
6421 cttggcgacg catcgagtaa gcaacgcacc aaaggtctgc aacacgcgtt gcaaaccggt 
64 81 cgcgatatgc tggcggacgg cgcgacggcc atggacacgg ttgaagccgt gattcgaacc 
6541 ttggaagacg atccaatctt caacgctgga cgaggttcag ttgtgacgaa cgagggacgc 
6601 gtcgagatgg attcatcggt gatggatggc aagacactcg cgtgtggcgc agttgctggg 
6661 gtgacacggg tcaagaaccc gatctcactc gcacgccggg tgatgacaga gacgaaacac 
6721 gtgctgctgg ttggtcccgg tgccgacgag ttcgcggaaa ctcagcaagt gcctctcgtc 
6781 gatccgaaat actttctgtc ccaacgcgat ggcgatgacg catccagcat tgcttcggca 
6841 actcaggatg aagacgaatc gcatcttgga accgtcggct gcgtcgtgct ggattcccac 
6901 ggcaacctcg cggctggaac cagcaccgga ggcacagcaa acaaactgcc aggtcgagtc 
6961 ggtgattcac ccatcgttgg ggcgggcacc tacgcggcca atggattgtg cgccgtttca 
7021 ggaaccggcg ttggcgaaga gtacatccgc -aacagtgtcg cgtatgacat cgccgcgcaa 
7 081 atgcggtacg cgaatcagtc gctcgagtca gcagtcaccg acatcatgct gaatcgactc 
7141 gatccgggtg ttggcggatt gattgctgtc tctcagcaag gcgagatcgt gatgcagcac 
72 01 aacacccctg gcatgagctg cgcagcagcc gacagcaccg gacgtttcga aacgcacttg 
7261 attctcgaca acggtggagc accggcggaa gcgacaacca aatcaaaccc atccgcatcc 
7321 gacgaatcaa agatcaggga actgattcag caacaggcca gcgattggaa tgcgggcaac 
7381 attgacgctt tcatgaaggt ctattggaag agcgatcaac tcacgttctc ctccggtggt 
7441 gacgtgacac gcggattcga cgcgacattg cagcgataca aagaacgcta ccccacttcg 
7501 gaaaagatgg gaaagctcac tttcacagac ctggaattcc ttccgctggg taactccgcc 
7561 atgcaagtcc tcggcgtttg gaaacttgat cgaatggagc cgatggaggg caagttcacg 
7621 ctcgttttcc gacgcttccc cgaaggttgg aagatcgtcc acgaccacac ctcaaagtcc 
7681 cccgactcgc aataggagtg aacgaccgcc gatcttcaaa gcagaaaggg aactcgctga 
7741 ttcgtttcca cccggctcgg gtggtagact cttccgtcac ggatcagtgc cacttcaatt 
7801 ccccttcgat cgtgacgagc atggttttct gctttggaaa gggcttccca acgcgcgtcg 
7861 gactgctgac gctggcatgg gtgatgtcga tctcaaccaa cgcattcgct atcgatttca 
7 921 atcgagacgt gcggcccatt ctctcagaga actgcttcca ttgtcacggg ccgaatgcgg 

7 981 aatcacgcga ggccgacctg cgactggaca ctcaaaccgg ctcccgccaa gagctctcgg 
8041 gaagccccgc ggtcgtgcca aatgagccgg accaaagtgg actgatcgag cggatttttt 
8101 ccgacgaccc tgatagtttg atgccgccgc ccgagtcgaa acgcaagctg tcggctcatc 
8161 aaaagttcat tctcaaatca tggatcgccg aaggcggtga ttacgaacaa cactgggcct 
8221 tcatcgcacc gactcagcca agcgttcctg agattgatca agccgcgagt tcttctcatc 
8281 cggtggaccg cttcatttgg aaccgattgc atgaacgagg tcttgagcct tccaaaaccg 
8341 ctgatcgacg cacactgatt cgccgagtct cacttgacct gacgggactc ccgccgacgc 
84 01 cggaacaagt caattccttc gtcaatgagt ctgctctgaa tgcctatgag cggttggtcg 
8461 atcggctgct tgcttccgaa cagttcggcg aacgttgggc gagaccttgg ttggacttgg 
8521 ctcgctatgc ggactccaat gggtttcaag ctgaccaatt gcgagacagt tgggcctatc 
8581 gggattgggt catcaacgcg ttgaatgatg acatgccgtt tgatcagttc acgattgaac 
8641 aactcgccgg tgacttgttg ccgaatgcga cgatcgatca aaaatcgcaa caggcttcca 
8701 tcgcaccgtg ccgtgcaatg tcgaagccgg cgtacaccca gaagaaaacc gggtgaacca 
8761 agtcatcgac cgcgtcaatg cgacggcgac agcatggttg ggaatgacca tcgaatgcgc 
8821 gcagtgccac gatcacaaat acgacccggt cagccaaaag gaatattaca agctcttcgc 
8881 gttcttcaac aacacaccaa tggaagtcga aaacccaagc ggcaaaggcg tgtcgtttga 

8 941 cttctggggt cccaaaatgg acctgccact acctccagat caagcgagca atcgaagccg 
9001 cctgcaagcc gaaatcgctt cgctcgaagc caaaagaaac gcggtgttag aagcaggtga 
9061 tttcgagaac tggacacagc aactctcaat cgcgctcgac aaccaacctg aattcaaaac 
9121 gctttcaatc gcctcggtca gttcgaccgg tgacgaagag atgaagattc tcgatgacca 
9181 gtcggtcttg ttcacaggca agcttcccga caagactgtc tacgagatcc actgtgatga 
9241 gattcccgca gatgtcatcg gatggaaaat cgaagcgttg actcatccga gtcttcccgg 
93 01 aacaggcccg ggccgcggcg atgccgtccg gtccaacttc atcttgagtg aattcgaggt 

93 61 agagattgtt tcgaaggaag gcagccccac ccaacgtctg gacctgcatt cagcatccgc 
9421 cgactattcc cagcccgcat ggaacgtcgc caacgccatc gacggtgatc tcaaaacggg 

94 81 ttgggcgatt gcatcgaaat tccaacaatc gcactgggca cagtttctac ttcaaacgcc 
9541 acttcgccaa agtgaagact caacgttgcg attccgcttg gaacaacgct acggaagagg 
9601 ccgaacgatt ggtcgcgtgc gtctttcagc gctgtgcggc aatccactcg caatcaacgt 
9661 ccccgaacaa gtcgcgaaag cggccaagaa acctgcgaaa gatcgatccc ccagtgactt 



9781 gattcaaaaa ctgacgaagc aactggaatc aacccagccc gccaccacgc ttgtcatggt 
9841 ggagatggaa gagccacgcg aaacctttgt gatgcaacgt ggcaactact tgtcacctgg 
9901 ggaccgagtt tcaccgggga cgccgatggc tcttcatcaa atgaacgagg accttccaaa 
9961 gaatcgcctg gggctagcac gttggttggt cgatcccgcg aatccgttga ccgcccgtgt 
10021 gacggtcaat cgctggtggg ctcaattgtt tggtcaggga atcgtagaca cgctggaaga 
10081 ctttggaacg cagtcatcgc caccgacgca ccccgatttg ctcgactggc tggcccttga 
10141 attggtcgaa tcaggttggt cgatgaaaca tgtcctgaaa acgatcgtga tgtcggaaac 
102 01 ctatcaacag gactccaaag ttcgggcaga tctgatcgag gccgacccgg cgaataaatg 
10261 gtacgcgagg ggaccgtgtt ttcgcatgtc agcggagatg attcgcgaca atgctttggc 
10321 ggtcagcgga ttgctctccg caaaaatgca cggtcctcca atcatgccgc atcagccgac 
10381 gggaatttgg cgtcaagtag ggcggaacga accgaagtgg atcgcggcaa cggatcaaga 
10441 ccgataccga cgcggcgttt acgtggtttg gcgacgcgcc gcaccttacc ccagctttgt 
10501 gaatttcgat ggtcccgatc gaagttcttg cgtggtgggt cggccgagaa ccaacacgcc 
10561 gctgcaggca ctgacattgc tcaacgatcc ggcttacgtc gaaatggctc tcgctctggc 
10621 aaatcgcatc ttatgtgaac accctgacgg gagcgactcc gatcgacttt cggcggcatt 
10681 cgaaatcgtt ttgtcgagac gaccagcccc ccaagaatcc gctcggttga tgacgttcct 
10741 tgatgagcga aagcagcacc ttcattccaa tcccgagctt gccaagtcgc tcgttcgaga 
10801 gcaatcgttt ccgaacaaca ctgtttatcc atccacagaa gaattgggcg catggttcta 
10861 cgtcgcgaac gtcctgttga atttggacga gacaatcacg aaggactgaa acctccccgc 
10921 aaatccctcg actcattgag atgatcgtcg aagatgaaaa cctatcgatc caaacaacga 
10981 atcaatgtgt ccgctgcggc agcaatgtca gcgaaccaga cgcgtcgaca actctttcag 
11041 cgtgccggca tgggcctcgg aggattcgct ctgacatcgc tgttccagtc tgaacaaaac 
11101 gcagcgttcg gacaggagaa aattggatcc gcggttggaa cacaccatcc agctcgcgcc 
11161 aaaagtgtga tctacatcca catggtgggc gccccgtcgc atctggatct gttcgattac 
11221 aaaccggttt tgcaggagcg ttcgggacaa ctatgcccgg atgagttctt caacggaaag 
11281 caattggcat tcattcgtga acaaccgaag ttgtttggca ccccaacgga ccaagctttc 
11341 caattccaac gttgtggaga atcaggagca caaatttcga acctgatgcc caacctacag 
114 01 acggtcgccg acgaattgtg cttcatcaaa acgctgcata cggaccaatt caatcacgca 
114 61 cccgcccaaa tgtgtttgct cagtggtttc gagcgattcg gccggcccag catcggttcg 
11521 tgggtcagtt atgggatagg aagtaaaaac aagaacctac cggactttgt cacgctgatc 
11581 acgggtcaag ttttgggagc gggcaacagt gcatggggaa gcggctttct gccaaccgtt 
11641 catcagggaa ttgaattccg cagccagggt gacccggtcc ttttcttatc taatcccgag 
11701 ggcatgcggc cgtcggaccg tcgaaaggtt gtcgatgccg tcaaggatct caatgctttg 
11761 cggttggatg atgttggtga ccctgaaatc tcgacccgca tcagccaata cgagatggcc 
11821 tatcggatgc aaacctccgt tcccgagttg atgaacatcg ataatgaaac ggccgacgtt 
118 81 catcagatgt atggcaccga accgggaaaa acgagcttcg ccaacaattg cttgctcgcc 
11941 cgccgtttgg tcgaacgtgg agtacgtttt gtacaactgt tcgaccaagg atgggaccac 
12001 catggaaaca tcgagaatcg cttggcggca aagacgaaag aagtggatca accgatcgct 
12061 gcgttgatca aagatctcaa acagcgaggc atgctggacg atacgctcgt ggtctgggcc 
12121 gctgagtttg gtcgcactcc gatggcacag gcgagcaacg gtggtggcac gccgacgaaa 
12181 gtcggacgag atcaccacaa agaagcgttc acgatctgga tggcgggcgg cggaaccaaa 
12241 cgtggtcaca cccatggtgc gaccgatgaa ctcggctacg ggattgtaga aaatggcgtg 
123 01 cacatccatg acttgaatgc caccattctt catctgctag ggatcgatca ccaaagactc 

123 61 accttccgct accaaggacg ccagtttcga ctcaccgacg tacacggaaa cgtcgtcaac 
12421 gacatcatca gttgatgcct ttgaaacggt cattccggag agtgcatcat tagggcaact 

124 81 gccgagtgat cgtaagtccg ttcaattcaa tcatggcacc ggcttggtcg gttcgctgca 
12541 aagcgatcgt tgagacaccg gcggcaaggt caacgactcc gagtgaatgc tgttgaaatg 
12601 tcagcattcg ccgaggagtc gccttggaat ccaaatcagg acggacaacg ctcttcatcg 
12661 tgaatcgcgg cagcgttgtg gtggcctcct ggtcgccggc cttcagcaca atctcggccg 
12721 aatccgaatc gcaaacgtag tgaatcacga cttcgtatcg accactctct ttcacctcaa 
12781 taggccaagt gattcggtct cgcgtcgttg tccagcgatc agcccaactg tgtgcccacg 
12841 aaattccatc cgcaaacccg acgccgcctt ccagcttcgc atccacagcg ggcaagtaaa 
12901 cggaccgcat cgaatcaatg ggcaccggcc gaatcgctgt gatactgggt gtaatttcct 
12961 gaacatattt ttggatctgc gatcgcaact gcttcgtcag ttccggatga ctcgacgcga 
13 021 tatcggtggt ttggccggcg tcggtcgtca tgtcgaacaa agacgccttc gacttctcaa 
13 081 tcgtcaatcg gtgagtattg gttcgcaccg ccgccttgcc aaacttctgc aattgcatcg 
13141 gattggggcg gtaggtcagg atcgaacgat cggcaagtgt gggatcggcc ccatcgcgta 
13201 tcaattcgac caagctcctg ccgtccagcg gcacctttgt cggtaaagga atatcacacc 
13261 actgcatcag cgtcggcaac agatcgatgt gagccgcaac ttgcgaaatg ctttgcggct 
13321 gaatgtttcc tgtccaacgg atgaaacaag gcacgcggca acctccttcg tgaacgctgc 
133 81 ctttggttcc tcgcatgcca ccgttgaaac gcttcccatt gggcccgttg tcggtcagaa 
13441 acacgacgat ggtctcctca tccaacgaat ggtccgaaag acactttagc aatcgagaca 
13501 cattggtgtc gatgttctgc accatcgcgt agaccgctgc tgttttctcg tcgatcgaac 
13 561 cgtcgttgta tcgatcgaat aaatctcgtc tcacctgaaa cggaccatgc ggcgcgttga 
13621 acgggacgta gcaaaagaat ggacggtcgt gatggttttg aatgaactcc accgcggcat 
13681 cggtcaacac gtccgtgatg tatcccttgg tttgcactgg cgtgccgttt cgttccaaca 
13741 acgcatcgtc gtacagattg aaatggccgc cgcagaaacc aaagaactca ttaaaccctt 
13801 gtccgttcgg gtgcagcggc atttgggcac cattgtgcca cttcccaaag cagcccgtcg 
13861 catatccggc ggaacgatac aattcggcga gcgttgtctc ttccgctctc atcacttcac 
13921 gccttccggt gacaccggcg actcccgagc gttcgggata acgtcccgtc agcaaagccg 
13981 ctcgtgtggg agcgcaaaca ggagagacat aaaagcggtc caacctcgca ctttcattcg 
14041 ccagagcatc gagcgtcgga gttgagatct ttggattccg gtgcgccgcc aaatcacccc 
14101 aaccttggtc gtccgtcaaa atcaaaagca cattgggacg atcggaagcc tctgcttgga 
14161 tcgaagttcc tgcaccgatc acgaagagaa ccatcacgac ggtacgacaa atcttctttg 
14221 cacaaaaaat cgtctttgcc attctgattc tcagtgtgtc caggattggt ggggcggggt 



14341 gcgagcacag tgttagactt tcaaccactt 
144 01 ggctctcgaa tttcccgagt tgatcgaatc 
144 61 gccatcggac gagttggtga agttcatgca 
14521 cattgccgga aaaatgggcg tcagcctggg 
14581 aggcctgcaa aagaacgttc tcggggtggc 
14641 cctggaatct gctggcatca agacgatcca 
14701 caaccttcct tccgtgccca atgtcctgtt 
14761 cgacgaacct ttgacttggg ccatgaacac 
14821 tcgcgattcc aacatcgttg cgttttccac 
14881 tcaaccaccg aacgaagaca ctgcaccggg 
14 941 ggggcgagaa cggatgtttg agtacggcag 
15001 ccggcttaac tattcgattg acctgaagta 
15061 ctggaacgat caaccggtcg acaattctgt 
15121 tgccaaccag caagcgttga tgtgtttggg 
15181 tgtcactggt ccagaaacat tgctcaccga 
15241 ggacaagccc gttcgcttca caaccactcc 
153 01 ccgagcaacc aagctgtttg gggctccatc 

153 61 agcccattgg gtcaagaccg gtggtcgctc 
15421 cgacggagcg tattgaaccg atgcagatca 

154 81 ttcgtcgggg agtcgttatc cctgcgcaac 
15541 attcgcgata ccaaaccgcg ttgactcgtt 
15601 ccgttggtgt tcattccacc cagttcgcca 
15661 cggtgctgcg tctggcgtcg gacgtcatcg 
15721 tattcaaggt cgctggtgtt tgcggaaaaa 
15781 ccgtttctca gcgataccac gcgtgtctgt 
15841 tcgacgagtt aatttctcat tgccgagaag 
15901 acttgcaacc ggccgtcggt ggttgtgtgt 
15961 agattgagaa cgtcatcgca atcaaactgg 
16021 tggttcgtgc ggtttgcgac gccgaccgcg 
16081 atgacaacat catcgcggac ctgttgactc 
16141 aacgggtccg aattcgcggc ggtttgctcg 

162 01 ttgaactgct cgatgaaatt catgccgtac 
16261 tgctcagccg caacatccag gtcacggatt 
16321 gttttgctgg ttgcatccct ggaatccacg 

163 81 gaacatggtg cctcgatcca aacgaagtcc 
16441 gtgtgatcgc cgcctaccct cacctccatg 
16501 ggtggctgcg ttgaacatgc ccaccacaaa 
16561 tttcactgga acagcgagat gcaaccgatc 
16621 cttgtgatgc tgctggttgc caccgcggct 
16681 atcgtttcgg atgaccaggg gtacaacgac 
16741 cccgctctgg atcgacttgc caaggaaggc 
16801 cccgcctgca caccgtcacg agccagcttg 
168 61 atctacgaca tgatccgcaa cgaggccccc 
16921 tacgccgtga ccttcgaacg catcggcgga 
16981 gtgctgcggc cggccggtta caagagcggg 
17041 cagcgaatgc tgccaacatc gcgcggcttt 
17101 attgattact tcacgcacga gcgatacggc 
17161 acagaagccg acaagggcac ctactgcacg 
17221 ttggatgagc acgcgggcaa cgagccattc 
17281 aattcatctt ctttggttcc gaccattcgt 
17341 gcgatgtatc cgccggttga ggtggaaacc 
174 01 cccgcaacgg tcgcgacacc gcaagctcgt 
174 61 atggacgcgg caattggcga aatactggac 
17521 acgattgtcg ttttcttctc tgacaacggc 
17581 cgaggccaca aggctcaaac ctgggagggt 
17641 ccggcgggac aaatccctgc cggcgtggtg 
17701 ctgccgagct ttgcagcagc cgcaggtgtg 
17761 tttgattggt ggcccactct tcgcggcgaa 
17821 aaacgccgcg atcaaatcgg agcaagggtc 
178 81 caaggcggcc tgttcaatct cgaagaagac 
17941 aaaccagaga tcctcgcgat ggtgaaggaa 
18001 gccgctgacc ctcgtggccc ctttcgagat 
18061 gagcgaagga cgcggcactc ctggaccaac 
18121 ttggcggtgc ctgattggtt cgccaaccgc 
18181 ctcgcgaaac aattgcatcg caggagaact 
18241 caagcgattc ttggaagacg ttgtcttgtc 
18301 acttcagcat tggctggtgc catccattcg 
183 61 gaagatgcgc agaaacggtt ttacattggt 
18421 cgttcttgtt ggattgttac taccagcagt 
18481 gagctgcagc aacaacttca agcaaatcgg 
18541 caacaagctg ccggcacaga aaggtggcac 
18601 aaaccaattg cgtcttggtt ggttgcctac 
18661 gtgggaacaa atcagtcatc cgttggcaat 
18721 ttggccgggc atgggaccag caccatggcg 
18781 ggctgcgctt cgttgcccca gcgatccagg 



cctttggctc acacacaagc aaccaaccat 
cgaagcccaa cttgacgcac tgctcgcccg 
acaactcgat ggcgacctga tcatcttggg 
acaactggct gtcgctgcca ttgagaaagc 
tcgcttttct gatgcggacg ctcgcgctcg 
gtgcgatttg ctcaaccgag aagcggtggc 
catggccggg cgtaaatttg gcaccgaggg 
gatcgttccc gcgaacgtcg cccatcattt 
cgggtgtgtt tacccgctgg ctcgtttgga 
gccgattggt gagtacgctc aatcgtgcct 
cctcgagtgg ggaacacgcg tttgcctgta 
cggcgtgctg catgacatcg ccaccaaaat 
ccaagccttc aacgtgattt ggcagggtga 
gcagtgcact tcaccggcaa acgtcctcaa 
agatgttgcg atgcaactcg gggaactgtt 
aggcaatgcg tcttacttga gcgatagttc 
cgtcaacgcc gaacaactga ttcgttggca 
gctgaacaaa cccacccatt tcgaagtcag 
gtgacttatc acccaccatt cgcgaaaatg 
cgttggcact ggatgccgat cggcaattcg 
attacatcga cgccggtgcc ggtggcatcg 
ttcgcgatcc gtcgattgct ttgtatgaac 
atgaatacgc gggtgctcag ggccgagaac 
caaagcaggc aatttgcgag gcggagttcg 
tgagtctggc ggcattagcc gacgaatcaa 
ttgcccaggt catgccggtg attggcttct 
tgccgtatga attctggcgg gagttcgctg 
ctcccttcaa ccgctaccaa actttggatg 
acgatgcaat cactctttac accggcaacg 
cctatcacat cgtcacggac cacggtgaga 
ggcactggtg cgtgtggact cgaacggccg 
tggatcgcgg tggcgacatt ccgtcggaac 
gcaatgcggc tttcttcgat gcggccaacg 
aagtgcttcg tcgccaaggc ttgctggccg 
tttccagcgg tcaatccaaa gaaatcgatc 
acaacgcatt cgtacgcgaa aatctgagcc 
cagaacaaca cgaaacgcct gacacctagt 
cgatcctgga cgaaccacct tttgatgacg 
tcggcggccg atcgcccaaa catcttgctg 
ttgggtcaac tcggcaacgg aatcatcaca 
acacgactga caaatttcta tgtcgcttgg 
ctgacgggcc gctacccgca gcgaaacggg 
gactacggtc atcgctacac gcccgaagaa 
atggatgagc gagaagtcat cctcccggcc 
atttttggca aatgggactt gggtgctctt 
gatgatttct atgggttcgt gaacactggc 
gtgccctgca tggttcgtaa tctcgaaccg 
tacctgtttc aacgcgaagc cctgcggttt 
tttttgtatg tgccgttcaa cgcaccgcac 
tcgtcggtgc aggccccgga tcaattcaaa 
agagtcaccg accgctatcg ctacggctcg 
cgccgcgact accgggcagc ggtcacctgc 
cgcctggaag cgaaacaaat gctcgacgag 
ggcagtggcg gtgcggacaa ctcgccgtta 
ggcatccgtg tgccttgcct cgttcgttgg 
aacgacgaat. tcctgaccag cctggaattg 
gaaccaccgc ccggcgtcgt cctcgacgga 
gccgagtctc ctcgcaagga aatgttctgg 
ggccaatgga aatgggtcca aatgggtgaa 
atcgccgaaa agaacgatca gtctcaagcc 
cgctacgcca attggcttcg cgaaatggaa 
ttctgacagc cctcgatttg cttctcgcga 
cccgggtcgc tcacttggga ccgcccccgt 
tggtgaaaag ttgcactcat ttttgagatt 
taatcccctt agaagtggca caacgagcac 
taccactccg gcaccacagt cgaattctcg 
cgattcagct ctcggggaga gtcttatgaa 
ggaactattg gtggtcattg ccatcatcgg 
gcaagcagct cgggaagcgg ctcgacgaat 
cctcgcactt cacaactacc actcggctta 
gaacaacgcc aacgacggat cgctgggcgg 
gattttgccc ttcatcgaac aacaaggatt 
caacacggac ggaacactcc gtaccacgcc 
aacggactac acgccttgga tcactgaatt 
aaccggtctg cccgcactgg gccgaaccaa 



18901 caatccactc gtcatgaatt cgggatacgc 
18961 gtttgtcgct cgtcagtaca ccggcttccg 
19021 cgccgccggc gaaatcgcga ccgatttggg 
19081 cggtcctgga tcggctgtcc tgagagacaa 
19141 tgatccggaa cgaccacagt tctggcgtgc 
192 01 cggagacggt gcccgtgcaa gctgggcacg 

192 61 gcactccggt atgaacacca tcctatcccc 
19321 cgacagcacc tggggcatgt acccgcccag 

193 81 gatgggtgac ggagcggtga agttcatcac 
19441 tcacacggtc tacaaagatc acaacccagg 
19501 gtggggtgct cttggcacca aagccaacaa 
19561 gccggaaagg catttgctcc ggcaaatgag 
19621 acgcttcctt cctttccgat ttcttcacgc 
19681 caataccaaa cactccaacc gattgatcac 
19741 ctgtgctgga tgcgacaaca acgaatcgac 
198 01 ggagatggaa gactacgaat cgatgatgaa 
19861 ggccattcgc tcacatcgcc atcaacgctg 
19921 gagtgatcgc cctcggtcga tccaatcgct 
19981 agactgccta gcaggctgtt gatttattga 
20041 cgccaaccgg ggctaacgcc caacggctaa 
20101 cagcctgaac cgttcaatcc ttcgcaaccg 
20161 tcaaaccgac tggaccgtcg agtttgcctt 
20221 ccaacccgac attcagtttc agatccgcgt 
20281 caccttccgc tgcgggccaa tcgtaaactt 
20341 cgatgtaggc gttggcacga tggaatgtga 
2 0401 gaatgtgagt ggctggccgt cccatcaccc 
2 0461 tgtcgccggt gggatcgttt tccatgtcga 
20521 tcttgtcaaa tccaatgtcc gtgatcggat 
20581. gagggtcaat cgaaagcacc aaatagctca 
20641 tgccttcgtt ctcaccttcg atgtagagaa 
20701 gatactgcaa gaagacggac agcggtgaaa 
20761 tgacggcgag gtccatgatt tgaggcgttt 
20821 tcagcttgca actgtaatcg ctgacattct 
20881 acccaagtgc ccaacccaaa gggtgatcat 
20941 cgttgcccgc cgtgcccttt gcatccatca 
21001 acttcaccgg cgatgtcgct gacacttgtt 
21061 ccgcgattgc ctgcttggcc aactggtccg 
21121 acactcgacc gatttctgat tcatccatct 
21181 cgtcggcgca aacgggcaat gagttggtaa 
21241 aggtggatgc gacgttggcc gaagttcgaa 
213 01 aagatggctc caaatggcgg tcgaatttcc 
213 61 agcggcgaag tccgtttgtg cacgaaacgt 
21421 attagcgacg ccccatgggg aaacaactga 
21481 tggacgtctg gaccaccgga aaaacccgca 
21541 cggagccgag gtctgtttca cgacgctacc 
21601 agtcagtgag aggctgcttt caatttgaca 
21661 gaaacgcaac gtgaaactcc cattcgagcg 
21721 taattgttac ctccgcgatg tcatcctttg 
21781 ctctttcctt cgtcaaaggc atccatgggc 
21841 acctgcctcg tcatctggcg agcctgcgat 
21901 cgcaatctct ccgaaggcgt tcgcggcggc 
21961 gaactcttca caactctgat cgccctgttc 
22021 ggcatcggca ccaccgccgg cagtgcgttg 
22081 attctcagcg tggttggctt tgtcgtgatg 
22141 acacgcgtcc ttcttcgcga tggaatctcg 
22201 ctgatcaatg gagaacaact tcattggtgg 
22261 acttacttgg tgtacatgct gacgacgatg 
22321 caaccagaca atgatggcga agaagaggat 
22381 ttttactgga tctccggagg cccgctgttg 
22441 catcgcgaac agatgaaaga agaaacctgg 
22501 ggagtcatcg ccatggcctg ctggacattg 
22561 ccggccaacg ttgagaatcc atcatacgaa 
22621 atgccagcca tgttcgtcgc ggtcatcttt 
22681 gtcatgtcca ttcgcgacgc tcgtgatggc 
22741 ggcagcaaca tctttgacat ttgcttcgcg 
22801 atcaacggcc ccatcgcaat ggagcccgag 
22861 tttctcttca tcctgacgat catcggtttc 
22921 gcggctgatg gaaccaagta cgtcgaaatg 
22981 atgtacggtc tgtttgtgat ctacatcctg 
23 041 cagatctccg atcacctgca aaccttccta 
23101 tagcagcctg ttgatttagt cgtataccga 
23161 ccggttggcg tctgcctaac cgccgctaac 
23221 tagcccgctc ggactggacg atccccaaac 
23281 attggtcatg tttttcgccg cgtttgcttc 
23341 tcacctacag aaagtcgttc gagttgacta 



ggaacaaatc cgcatctcgg gtcgcggcat 
tgatgtcttg gacggattgt ccaacacgat 
tgaccgcgat atccgaaccg cacccatcga 
tccttcttgg gctcgcgaca acgacaagat 
cggcgaaaac ttgacgactg acaacgctgc 
tggttttcac tgggccgacg gcgaaatgca 
caacagtgaa accgtctcac gcaccacggc 
ttcgcgtcac caaggtgggg tgcacgtcct 
cgattcaatc gaagcaggaa actcgcgtgc 
caacgcagga actcagagcc cctacggact 
agaaacggtc tcatttgact cgctttgata 
aattagaagc gttgttgcaa tctcgcagca 
ttctcttcat tcaatttgca accagatgac 
ctttctgtcc ctgatgtgcc tctcgttact 
tgtcatccag caacccgctg atgcggcaca 
ctcatcggcg aaagaagctt acggaaagta 
gatccgatca cgggtccagc gttttgcatt 
caagatcttc aacaccatcg atcacagaca 
gccgcaccgc gttagcggcg gttgattaga 
tgattgtcat tcggcatgcg actatatcaa 
cgttgggact ggctttcaca tccgattggg 
tcaaacgtgt cgatgcaaac gtggcgttcg 
aggtgtactc ctcttttaac tcgggtgaat 
ccaaacgaat cggaacgtgc agttcatcgt 
caccgccttc tttggatgga tgaacaatct 
gagcattccg aaaataggtc acttcggtgt 
attcaatcag tccgatcaat cgactcatga 
agttgctttg ctctttggcc tgacgcccgt 
gcgccccgcc accttttcgg acacgggctt 
ctcgtcgatc gcgtagagta cgtggggatt 
catcgccttg ctcattggac tgttccactc 
gcaactcacc atcgattcgt tcccgtttga 
gccgaatgtg gtcggcatga gccttggcaa 
cgatcggtgg gtgcaatcga tgatcctccg 
acggagccaa agcgtctctt gtcgagaacg 
ggatgacgtt ggcctcgtcc aaacccagtt 
tctgatcttc taccgcgatc ttctgcaccg 
tgaccgcact ctgggccacc gatgcggttt 
gcgaccccaa tacgacgcac gacagtgtca 
ggcggcgggt catggatgag gtcggcatag 
gttcgtttcc ggtggtgctc gttcagccca 
gccgattgct tgcaaaaatc caagtttcag 
aaaggagcaa tctgctctgg aaggaaagag 
tagccttcct accgtttcgc cccacaaaac 
aacatcgctc gcagggcaat caaccactcc 
tcgccggatt gcattctgag acgcgatcgc 
acgggcagga aaaacttcct ccgccgcttg 
gggtcgtgat caatccgtga aacgaacttc 
atcgtcatcc cactcgttct gatcttcctc 
ggatttgaaa tcgcgagtga gtacatcgga 
acgatcaatg ccatctccag ttccatcccg 
gtgctttccg accgagatgg gttttcgatc 
ttcaacggca tgattattcc cgcggtatgc 
ggcgttcgtg tcacctcggt caacgtttcc 
ttgattcttt gtgagttcat tctcatcctg 
caaggtttga tcttgatgct gatgtacgcg 
aaatcatctg agttgtccgc caacgacgag 
gagtccccac gcggcccgct cgccagtttg 
gacttggaac gatggttcgt caaagaccat 
aacggctggg gattgctgtt gaccagcacg 
gtattggcct gcgaatggct cggcagcggc 
ttgttcggac aaaccttcga aggcctgggc 
gcctcgatgg caaccagcgt tcctgacacg 
gactacgacg atgcggtggc caacgccctg 
ctcgggttcc ctttgttcct gtacacgctc 
atcgcggctc agagcggaga actgcgtttg 
ttgatttact acatcggcaa acgaggcgtt 
aaacgaggca aagcgtttgc tttgctgacg 
gctcacgaaa acgatgtcga agtggtccac 
aaaatgctgc cagcgatcgg agccattgtt 
gtgacaacaa ttagccgatg ggcgttagcc 
gcggtacggc tcattaaatc aaccgcccgt 
ctgaatcatc aactcacttg ttgtagactg 
cctggaatcg tggcgacgtg acgaaatgga 
cttggaagcc acgtccgttc ttcttcgcga 



23461 ccgacgtgct cgcgtcgctg acaacgcact 
23521 aagcaaatct gaggattgat caactgacca 
23581 aatggatcga gattcgcaac gcgatcaatc 
23641 cacttcccgt cgacatcatc tcacaaacgt 
23701 tggatcgatt gcgaactcga gcggacaaac 
23761 agtacacgca cacggtttcc gacctgttgt 
23821 tcgatacatt gccacccgat ggaaccgcag 
23881 tgatcgaccc ttcgctgatg actcagtact 
23941 cgattgtgga cggtccaccg gaatacccga 
24001 tcgctgacag tcatgcgatc ggatatctcg 
24061 gtggactgca attgatcgaa ggcagcacgc 
24121 gcaaggacaa caacgttgca ccaatcaatg 
24181 gttccccggg tgccgacggt gaagtgccgc 
24241 actccatgca gacgatcatg gagtacgacg 

243 01 ccagcgtgac gattcctcga gacaccttgg 
24361 acacaaagct gtccatgggt cagcgtcccg 
24421 tgggtgaagc cggcgacttc accgagtcgc 

244 81 gctggggcgg agatcgttca acacccgatc 
24541 gagtcttctt ggaccatttg gaagtcgaag 
24601 gtcacacgac ggtcttgttc cgcggcgaag 
24661 agatcctttt gcgacttttg ccgaccgcct 
24721 aaccgatctt gggagtcatt caatcggaac 
24781 ttcgcgtcgg attggccgcc acattgacgt 
24841 ctgggaatgc cgccgacact tccgtgctga 
24901 acttcctttg gtcttccatg ccagaccaag 
24961 tcaccgaccc caaccaccga aaacagcaag 
25021 atcgttttgt cggcagcttc gctcggcagt 
25081 cgcccgaccg atccctgtac cgggaatacg 
25141 aaccgctcga attctttcac gaaatcttac 
25201 actcggactt tgtggtcatc aacgatcggc 
25261 gcggcgagga gtttcgcgtg gttgacttgc 
25321 ccatggcggg tgtgcaccaa gccggttcgg 
25381 cggtgtatgt tcgagaagtc ttgttcaaca 
25441 gcgaagtcga acccaacatt caaggccaac 
25501 atcaacaaat cgaagcctgc gcagcctgcc 
25561 tggaaaactt caacgtcatc ggccaatggc 
25621 gtcgcaatct gccggagatc gacgccagcg 
25681 gcttcgaaga tttccggggg ttgctgctgc 
25741 cggaaaaggt gtttgtctac gcatttggcc 
25801 tgactcgcgt cgtcaccgac atgaaagcgg 
25861 ccattgtcac cagccaggca tttgtgtccc 
2 5921 caagacgctc aaccgccgga tgcttctgcg 
25981 gttggaagcg atgttgccac gatccgccgc 
26041 gatgggcatg ttctactttg gcaccggtat 
26101 tggcaaagac gcgaagatgt ctcgcatctt 
2 6161 caccgtgttg tcgggaacct cgctcagcca 
2 6221 tgcgacatcg attgctcgcg gcgaaaaaca 
26281 agtgcaggga aagcacactc gattcccgtc 
26341 ctttggctcg caagcattgg caaccatctc 
26401 ggagaatgat ccgcagacca tcttcaacca 
26461 gctccaacaa agcgacgaat ttcggcaacg 
26521 tgcgaaacga ctgcaaaccc gactcagcca 
26581 ccattccatt cgcgaagttg aaacgacgtt 
26641 caaacccaaa cctaacctcg atgggtttgg 
26701 caatgggaag tttgtctacg acacgtacgc 
26761 gtttcaaacc gattccacgc gcgtggtcag 
26821 tgtgtaccca gaatttggtg tctccaaggg 
26881 tccaaaaaac ctggaggagc tcgcccgggt 
26941 cttcctgaac cggttggcat ccattcgcga 
27001 gctggggttc tccagcggca tggggatcgg 
27061 ttccggtggc agtggactcg gcatccgcca 
27121 gccgctttct tctctatggc aaaccatggt 
27181 gttccaagac agcaccggaa cgatcgggga 
27241 cgactgcacg ccgcactcgt gttggcgttg 
27301 cctgcacggc aatcgaaccc aaattcgatt 
27361 gaatatcgcc gaggttttga cggttcaggg 
27421 accttcagca accctcaagc attccgattt 
27481 aaaccattcg agctgcaaac aacggaagtc 
27541 gcgaccggct accaaaccga agcggaaagc 
27601 gaaaaagact acgtcgaccg cttccgaaag 
27661 ttcgaacaat ccgatgatca tccggtcgtt 
27721 tgcgagtggt tgtcggaaca ggaaacgacg 
27781 gaatacgcct gccgcgccgg tcaatcaaca 
27841 gcctacgttc atgcgaacgt cgccgatggg 
27901 cgctatcagc gagcggtcaa actagaagct 



gcattgattg ccacggtccc gaatcagctg 
gcgatctcgc cgatcgcgaa aacgcactga 
tcggcgagat gcctcccgac ggcgaaccgc 
ccgagtgggt ttcaggcgag ttgagagaag 
aagttcgtct gcgtcgattg aaccgccatg 
cgatgaagtt cccaaccggc gaaagtccac 
aaggtttcga caaagtcagc tcggcactga 
accgagtcgc tcgccgaatc gcggaccgtg 
cggaaacaat gcggctggag tttgaggaaa 
tcacacgtcg gggcatgaat ccggtccccg 
gatcttttgg gatgcttcgt tacccgggtc 
gtttctatcg attcaccatt cgtgcgggag 
gaattcgttt gaaacacaac catcccgatg 
tgacggcacc ttgggactct cccgccgaaa 
gcggtgaact caacattgaa atattgaacg 
gcgaaaactt catgagacga atcaatgaag 
ttcgcctggc cggtcgcaag gtcgccgagg 
ccgagaaact cgacctcaca aaattcccgc 
gtcctctgta cgaccaatgg ccgccgaaga 
caggaaccga tgtcctggct tacgctcgcg 
ggcgacgccc ggtcgaagag agcgaaatcg 
tcgcaaacgg cgaaacattc cacgaagcca 
caccgaactt cctctacctt gcggaaggca 
ccaaccacga aatcgcgtcg cgaatgtcgt 
agctctttcg cgctgcccga gacggcgaac 
tcgagcggat gctggcggat cctaaaatgg 
ggctgcagac cgacacattc ctgaccttca 
acgagaaact gtccgaagcc atcgttcgtg 
gcagcgatcg aagcgtgctg aattttcttg 
tcgctcgaca ctatggcatc gatggtgtca 
catcgggttc cgtccgaggt gggttgcttg 
atggcactcg aaccaaaccc gtttcgcgag 
acccgcccga tccaccgcca cccaatgccg 
gtctgaccgt ccgagaacgg ttgctgcaac 
atcgctcgct cgatccctac ggtttggcat 
gtgagcaaca agacggggaa aattttcgcg 
gccgactccc caacggcaac gaattcacag 
agcagagcga tcgcttccgc cgagccttgg 
gccccgttgg accggaggac gacgcccccc 
aaggcgacac cttccgtgca ctgattcaat 
gatgagtaaa ccgtcctaca aaccacgact 
tggtgccagc acctgtttgg cgctgccttg 
cgctgcagaa tcgaccaaag tcgtcccacg 
gaacatgcgt gagttctacc cagaaggcta 
gaagccactg gaagagcaac gcgatcagtt 
cggcggtggt cacgatggtg cctacccgtt 
aaccatttcg cctgaccaat tggctgccga 
gctgcagttg tcggtcaagc gaggcactgg 
gtggaatcgc cagggcatcc cattggcagc 
tctcttccgt ccgcccaact cacaacaact 
cggcagtgtg ctcgacgcag tgctctcgga 
atccgatcgt caacagctcg accaatactt 
gggccgcgaa atggattggt ctgatcgccc 
tgattacgaa aaagccatcg cgcccgaggg 
gaaattgatg tacgacttga tcgcccttgc 
ctacgtcgtt cgaaccgaac tggccggcgg 
ttatcacgaa ctgacgcatc acggcaacga 
tgacacgatt tacatgcagc actggagtca 
Q-gggg^SSgc actttgctgg acaacacgct 
ccacagcaaa gacgttttgc cgacaatgat 
tcaaacgcat cttcaactga aagacaacac 
ggaacggatg ggagtgaacg tggacggtga 
actcctcgca tgattcgcct gtgtatcaaa 
acactgacag tgcccatcca agcacaggaa 
ggcatgcagt tcgtcgcgat cgcccccggt 
caacgtgatc gccaattcgc actggctcac 
gattcaccat cgcacttggt ggtgatcacc 
accgtcgcgc agtttcgcga atttgtggaa 
aacggtggag ccttgggatg ctttccaaaa 
tcgcccgatg tgacctggag gactcccggg 
gccgtcagtt ggaatgacgc ggtgaagttt 
acgtatcgat tgccaaccga agccgaatgg 
tggtattcat ggggaagcga tccagacgaa 
gctctcgagg ccgctcaacc aaacacaacc 
gacgacggcg atggcgtggt cttcaccgcc 



28021 gaatggtgcc aagatcgctg gtcggcggac ctctatcgac gctacctcga cgacgtcccc 
28081 tggccaaagc gatcccagtt caccatgacc gatccggtga tgttggaaga gacaccgcaa 
28141 catgcgtacg gagactggcg gtgcatacga ggcggggctt ggacgtgcgc cccggcttcc 
28201 gttcgctgtt cgatccgcac gtttgcagaa gcgtctgatg caacgatcta caccgggttt 
28261 cgagtcgtcc gagaatagca aacggatgcc accaacgatc gaaagtgata tcaacgtggt 
28321 gttgtcacgt ctgcctcgcg atgcaacagt tgtcttccat gtcggaacct caaaatgtca 
28381 gccactggat cgaccaagtc aaagacggcg attccatcgc cgccaaccag atttggcagc 
28441 attactatga ccgattggtg agatcagtcc gaaacaagct tttcggtcaa aaccgagcgg 
28501 tctccgacga agaagacatc gtcctgagcg tttttgacag tttctattcg gccgcgcaaa 
28561 agggccgatt tccggatttg tccgaccgag acgatttgtg gcgtttgcta ctgaagatgt 
28621 ccgctcgaaa agtggttgac aaaaaacgtc gtgaccatcg ccaacgacgc ggtggtgacg 
28681 ttcaactgca ttcgttggat aaacgcgaag acaaccagac tttcatcgaa gccatcggcg 
28741 acgagccatc acccgagatg attttgatga tggaagaatc ggtggagcaa tttttttcgc 
28801 atttgggtgt cgggcagttg cgagatttgg cgggtgccaa gttagaaggg tattccaacg 
28861 cggaacttgc cgcacgattt ggatgctccg aacggacgat tgaacgtcgt ctgcatctca 
28921 tccgtgaaaa atgtcaccag gaactgttcg atgagcattc gtcagaagaa actaccgatc 
28 981 gcgaccttgg aacgaatcga tgacctgtgc gcggagttcg aacgaaaatg gcaatccgac 
29041 caatccccct ccatcgaatc ggcgctgacc gaggacattt cacccgacga acgggacgtt 
29101 ttgctggctg aattgattgt cttggatgtc gactatcggc ggcgacgatc cgagagcccc 
29161 aacgagcaag actacctgga tcgctttccc gacaacgtct cagtcatccg cgacgctttc 
29221 cgcgggaacg accaacccaa gcgagccttc ctgccgcctt cggttgaaca actgagtgag 
29281 caattcccct cgttgcagat caccgaacta ctcggcgcag gcggaatggg tgcggtctac 
29341 aaagcacgcc aggaagggct cgatcgcgtc gttgctctca aaatccttcc cgaagaattt 
29401 ggtcacgacg tcaagtttgc gttgcggttc actcgtgaag ctcgcacgct cgcgaaattg 
29461 aaccatccca acatcgtctc ggtctatgag ttcggacacg tcgatgacac gtactatttc 
29521 ttgatggaat acgtggacgg gtcgacgctg cgggatgtgg tcgcggccgg tcagctcgct 
29581 cccgctcatg cactggcgat cgtgccgcat ctttgcgacg cacttcagta cgctcatgac 
29641 aacggcgtta tccaccgaga catcaaaccg gaaaacattc tgatggccgt cgacggttcg 
29701 gtaaaaattg ccgacttcgg actttcgcga atccttggcg atcaagacca accttcggcg 
29761 ctcactggaa cccaccagat catgggcacc ccgcggtaca tggcaccgga acaactcgaa 
29821 ggtgcccgtg gagtcgacca tcgcgccgac atttattcgc tcggcgtggt cttctacgag 
29881 atgctgactg gtgaacttcc aatcggtcgt ttcgcggttc catcgaagaa ggtccagatc 
29941 gatgttcgat tggatgatgt tgttttgaga acgctcgaga aggaaccgca acgtcggtat 
30001 cagcgagcca gccaaatcaa atcggacgtg caatccattg cctcgggtga ccgtgccgca 
30061 ttggcaccga ctcaagtcat gccatcggat ccccagcgtg atcatgccca ctcgtcctcg 
30121 ggcgcccacg accaggaact cgcgggccga ttgttgttga cccgccgcca actcatggag 
3 0181 cgggtcgaat catcgctgcg acctctgttc gccggacagg tgcttcagat tttcgtcggc 
3 0241 gtggcactga tcgccctggg agcctactgc tgggcgccca acacgcatgt gccacatcga 
30301 ttgttcagcg gcgtgatcgt gcacgtctat ggtttgctgc tgatcattgt ggctgccaac 
30361 gttttggttc gaatcaaacg cgtcgacacc tcccaaccga tcaacgaaat ccgtgaaagg 
30421 ctgaatgcag tcagaagctt ctcacttcgg gtcggtccga tcatcggaat tgcatggtgg 
3 0481 ttgatttgga tccccgtcgt cgtcgccgca ggattcgatt ccatcctcca tcccaactca 
3 0541 ctttatccat cgctgatcct gggtgtgatc ggactcgttg gctcgtgcgg gttgtacgcg 
3 0601 atggcgatga aatcagaacc ctggcgtcaa aaactcggcg ggcgaagtgt caccaatgcc 
30661 gagcgagagc tcgccgaaat cgagcaaacc aacatccgct gatttctagc ttctagcttc 
30721 tagcttctag cttctagctt ctagctactg gctactggct actggctact ggctactggc 
30781 tactggctac tggctactgg ctactggcta ctggctactg gctactggct actggctact 
30841 ggctactgtg gctctgtggc tctgtggctc tgtggctctt tccactctca aacaacccca 
30901 ccatttgccc gcagcgtttg accattcacc caagacccat ccggaccggc caaaaaggac 
30961 accacggatg cgatgtcatc gggttgtccc agacgttcca gcggcgacat gttcgttaac 
31021 cgatcgatca actcatctga cttgtcgtcc aggaaaagct tggtcgcggt tgggccagga 
31081 gcaacagagt tgaccgtgat ctgtcgtcct ctcaattcat tcgccagcac agatgacatg 
31141 gcctccacag ccgcctttgt tgcggagtaa acaccgtaat tcggcattcg caatccgatg 
31201 acgctggacg acagattgat cacccgacca ccatcgcgaa gtcggcaggc cgcttctcgc 
31261 atcgtgtgga aacacccctt caggttgaca tcgaccagcc gagcaaaatc ctcgtccgag 
31321 gtctcagcaa gcggttgcat cttcagaacg ccagcgttgt tgaccaacac gtccaccccg 
313 81 ccaaacgact ccgtcgcgaa atcaaagagt ttctcgaccg cgtccgagtc actcacatcg 
31441 gcttgaaatg actgtgccga accacctgct tctgtgatgc gttgggtcag ttcatcggcc 
31501 gccttggaac tgtttgcgta gttgatcaca accgaaaatc cgtcgctcgc caatcgttcg 
31561 gcaattgccg caccgatccc gcgtgaccca ccggtgacaa tggcaacccg gttgttgttg 
31621 ttcatttgaa aatgctttct tcgctagtga aatgttgaaa tgatcctcaa gtaatgacct 
31681 tgcacccgaa gagtacgccg gccctttaca ctatttcaac aaacccaccc ccgttttcat 
31741 ggcacgcgtt tctcctggtc aatctcttca tgtcccgagt cttacctctc ctttctacat 
31801 tgctactcac ttattttttg cctttcgcgt catccaatgc cactgagccc tccggcgtca 
31861 cgactcaccc aaacatcttg atgatctacg ccgacgacct cggctatgaa gcactgcagt 
31921 gttatgacgg cctcgatttc gccacgccaa aactggatgc gatggcagcg gaaggtgtgc 
31981 ggttcgatcg ggcctacgcg agccctgtct gcacaccctc acgggtcagc ctactgacag 
32 041 ggctgtatcc gtttcgacat tcgcatcttg gagtcctacc ggtacacaag ggaacacgcc 
32101 aaaaagtgga ctttggaaaa atgccgacct tcccgcaatt aatgcgggaa ggtgggtaca 
32161 cgaccagcgt caccggcaaa tggcaactgg cgactcttga agtctggcca aaccatattc 
32221 gcaacgcagg ctttgattcc tggtgcgtct ggcaaatttg gcgagacggc gaaaagacgt 
32281 cacgtcattg gaacccgacg ttcaacgaag acggccatat ccgggaagac atccagcaac 
32341 gatttggtcc cgatgttttg gtcgaccatg tgattgatca gatggccgag gcgcaagcgt 
32401 cgcaacaacc tttcttgatc atccacaatg aattgcttcc gcacgatcca atcgttgaaa 
32461 cccccgagga tcgaaggctg aagcggcaac ctcgtcttgc gcacatgatc gaatacatgg 



32581 acgtggtatt catgggtgac aatggcactc acgaggttga ctttgacaat ccgaaatttg 

32641 gacaaccggg ggaacaaaaa cacactcgtc atacgcgtgc cggaaatgtc aacggcggca 

32701 agttcgacct gaatgatgcg ggcactcatg ttccgctgat cgtctgggga ccgccatcgg 

32761 tacccgccgg ccaggtttgt gacgacttgg tggatgtcgt ggatctgttc cccacctatt 

32821 gcgaattgac cggtacgaca atccctgaat cactgagggt cgatggccgc agcatcgcgc 

32881 cgcagattca cggccaaacc gggaagcatc gtccgtgggt gcaccacgcg attagcggaa 

32941 aaggagacag tctgttcgac ggttcatggc gactatccag tcagaaggac caattgtggg 

33 001 acgcccgcga actgccggcc gaatcaccag cggcagcgaa tgatgaacac gccgaatcag 

33 061 ctcgcgtccg actcgagaaa gtttttcgaa cgcttcatca aaacgcctcc cagtgattga 

33121 gaaacaactt ccgggaccga gatgaatcga cagctcggtc gcggattacc actcgattcc 

33181 acctacttgc tgaacagcac gcagatcggt gtggggactt gttgggtctg gcgtgtaaaa 

33241 gtcagctcgc cggtgtccgc gtcgatttca aacaccgtcg cggtatggct gtctctgcca 

33301 gcggcaatca accagcgacc gctgggatca aggttgaagt tgcgaggcca accaccgcgg 

33361 atggcttcga cctcgaccag ttccatttgc cccgcttcat caatgcgaaa aacggagatg 

33421 ctgtcgtggc cgcgattggc ggagtacaca aacttgcccg acgggtgaac gcgaatttca 

33481 gacgcactgt tgaaacgctc tttcgctttg atttcttccg atagggcagg gacggtttgc 

33541 ccggccgtca tcgtccccgc cttcgcttcg taatcaaaca cggtgaccga aagagccaac 

33601 tcgttgagca gataaatgcg tttaccgtct gggctgaact tcatgtggcg tggcccgcca 

33661 ccggggatcc cttcgccaaa accatgatgg gtgagtgatg gaccgtttgg atccaatttc 

33721 caaatcacga ctttatccat gcccaaatca ggcgtgaaag cgaatcggtt atcgggcgat 

33781 gtcccgaccc agtgagcgtg cggtgcgtcc tggcggtttc caaccacgcc tgatcccgct 

33841 ggtgacaaca gggcatcgtg ttcatagacc tgcacacggc gtttgaccga accgtcttcg 

33901 gccagttcat acaacgtcgt cgagccaccg ccgtattgtg ctgagagcag cacgcttccc 

33 961 gtgcgatcgg ttgccaagtg tgcgactcca ccatcaccgc tctcgaccga ccccatttca 
34021 accaattttg cgcccgtgga atcggcttgc acacgataag ccgtgacgcc gttcccgctg 
34081 gaggcgtata gcacgggctt gtttggatgc atcgccaaga acccgggttg ctgtgccttg 
34141 gccgccaagg tcgccggact cagtttgccc gattcggtgt cgaaagtggc gtggtagatc 
34201 ccttcgctgg caccgcctcg gggcgttgtg gttccaaacc aaacgttcag tgtttccgcc 
34261 atcaccggac tcccttgcaa aacggccaaa acaaccatgg ccatcaggcc aaatttcatc 
34321 ccaaagtacg ctcgcgttgt tgtttgctgg tcgccgctca attcgctgct catctgcgtc 
34381 tctccgatcatggttgcgat tcttggctgt ctggtgagaa aaactcgttg aagggggcat 
34441 cataccccag taggcgtgga gggagagaga ggaaagtggc tcagggccac agggccacag 
34501 gagcacgagc taggagcttc tcctcccgtg acattgccga tcggcggtgg tttgatctag 
34561 gatgctcgtt cttcgaacct ttcctggaat ccgctgtgct ttcaatttcg cccacctatt 
34621 tgctgtacta cctgccattg atcattgcga tctcgttggt atttggtgca acacggcacg 
34681 aggatttgtc gctgatcctt cggcacgcct tccacacggc ccgctggatc accggcttca 
34741 tggcggtcgt tttcgccctg gttctcttct tggactggat ggtctgagcc gatgaggatt 

34 801 ttgaccctcg gtggcggcac ggttggacgc tggatcgctg acatgctgtg cagacgccgg 
34861 cacagtgtca cgttgattga cagcgaccct gagatcgttc gatccatcaa tagcgaattg 
34921 gatgtccggg caatcgaggg caacgcctcg cagagcaccg tgttgttctc ggccgacgtt 
34 981 ctgagtgccg acttgtgcct cgccgtcacc ggagacgacg aagtcaacat tgtcgccgcg 
35041 agcatggcca aagcactggg ggctcgccgc gcgatcgctc gcgtttacgc ccccgcgttt 
35101 cgtgacctgt cgacgtttga ctaccaacgt cacttccaaa tcgacagttt gctgtcactc 
35161 gaacaactct ccgcttcgga gctcgctcga gccatccgca accccgatgc gattccgctg 
35221 gaacacttcg ctcgcggaca actgcaggtc tatgaaatgg atgtcgcggc gggctccgcg 
35281 gcggcaggca agaagttgat ggaattgcaa cttccgccga gcgttcgtgt tggatcgctc 
3 5341 gcacgcgaag gacgcatgtg gatcgccagc ggtgccgacg aaacacgagc cggtgaccga 
35401 gtcagcctgg ttgggatgcc ggatgccgtc agcgttgctc gccaaatatt tggtggtgag 
35461 aaacgccgtc gcaaacaatc caaggtcatg atcgccggcg gcggcgaaac gggttatcac 
35521 ttggctggac tgctcggtcg ggaagacttt cgcaccgtgc tgttggaaca agacgcgaaa 
35581 cgttgcgagc aattggccaa gttgctgcct gacgtcaccg tggtgcatgc caacgccaac 
35641 cggcgcagcg ttttggaaga cgaaggtggc ggcacggtcg actacttcgt tggttgcacc 
35701 ggcaacgatg aaaacaacat catggcgggc gttgaggctc gcgagttggg cgcatcgcgt 
35761 gtgatgtgcg tggtcggacg tcccgattac gccaacgtgg tcggaaagct cggcatcgat 
35821 ttggcggtca gtgaacgaga cgtggtcgct cgccaaatcc ttgggttcct taacgaagga 
35881 gccatcatca gccaaagcaa attgccaaac ggttccatcg gtgtctacga attggaaatc 
35941 ctggaaggat ccgccgtcac acaggcgtcg ctggccaacc tgcctttggc ggggcgatgc 
36001 ctgatcgccg cgatacaacg cgacggattt gttcgcgtcc caaccgccag cgacgtgctg 
36061 gaagtcaacg acatcattgt cgccttgatc gatttatccg atgccgacga agtcctttca 
36121 ctctttcaca gcgattgatt gtgcctgtca ccacgaacgc tcctaagcca acccccaacg 
36181 ctgttgtgaa cgattccatt ttgccctcca ctcttgggaa ttcacctatc gcgccgagtc 
3 6241 cgccgacaat cgtcctgctg cgttttaccg gcgacgattg cccgggtttg accgcgtcca 
3 6301 tttcggaacg gttgcatcgg ttcaattgcc gagtcatcga cgtcaaccaa gcggtgatcc 
3 6361 atcgatcgct gctgttagga atgcttgtcc agattccggg cagcgaagac cccgaacgcc 
3 6421 tgattcgaaa aatgcggcga aagagtcgca aactcggtct gaaatgcaaa gccaagatcg 
3 6481 tttgcgatcc ggactatgac gcttgggttg atcgacaagg aaaatcgcga ttcatcctga 
36541 cgctgttgtc gcgttcggtc actgccgaac aatttgctgc ggtcagtcgg ttggtttcgg 
3 6601 accaaggact gaacatcgat gtgatcacgc gattgtccgg acgccccaaa cgcgaaccgg 
3 6661 gtgacgaact gacgcgtgcg tgcgtggagt tctccttgcg aggagatccg acggatgtca 
36721 acgctctcaa agccagcctg ttggaattat ccaatcgctt gaatttggat ctggcatggc 
36781 aacgtgatga cgcattccga cgcaaccgac gcatcgtcgc gctcgatatg gattcgactt 
3 6841 tgctgcaagc cgaggtgatc gacgaattgg caaaagaagc cggtgctgga gagaaagtca 
3 6901 gtgcgatcac cgaggcagca atgcgaggcg agatcgactt tgacgagtcg cttcgtcagc 
3 6961 gggtgcaatc cttggaaggg ctacccgaga gcgtccttcc caaagtcgcc gagcgtttgc 
3 7021 agttgaccga aggtgccgaa cgattgcttt cgaacctgcg tcgttttggc tacacgaccg 



37141 atcatgtgca cgccaatcaa ctggagatcg ttgacggaaa gctaactgga cgagtcctcg 
37201 gccccatcgt caatgccgaa cgcaaggcgt tgctgctcga acaacttgcc gccaacgagg 
37261 gcgtcgatcg caaacagatg atcgcgatcg gtgatggtgc gaacgatctg cccatgctct 
37321 cccgcgccgg tctcgggatt gcctttcacg ccaaaccgat cgtgcgtgaa tccgctgaac 
3 7381 atcagatgtc cacattgggg ctggacgccg ttctctatct gctcggcgtc agagaccgcg 
3 7441 atctggttga gtgactccat ggacgctgaa cagattgaag tttgggttga tgtcggcggc 
3 7501 accttcaccg attgtctcgt tcaacagacc gattcaaatg gacatcgcca acgcgaggcc 
3 7561 atcaaagttc tgagcagcgg gttggtgtcg gttcgagtgc tggaccacag cgacgacttc 
37621 acgcaggtcg aagtcgaact cccacccgcc tttgctatcg acgcgttttg ggaaaccgcc 
37681 aaacttcgcg agcaagctga accggcgaac gcacctcaat cgttcactgt attgaattgc 
37741 cagcgaaaaa gttccggcaa ggttcaactg cagctgaacg aaccagcgac ccagcttcgc 
37801 cgcgatcaac actgggtgct ggatgccgag attgaatcgc ccattttggc gacgcgaata 
37861 ttgctttcgt gcccgttgcc acattcactg ccaccgcttt ccgttcgaat gggcaccacg 
3 7 921 cgaggcacca acgcgttgct cacacgcaac ggtgcgccga ctgcattgtt gatcaccgaa 
3 7 981 gggtttgcgg acctgctttc cattgggact caagaccgac cgaacctttt tgcactcgac 
3 8 041 atcgtgaagc cccaaccgtt gacgagccaa gtgttggagg ttcgcggccg actcaacgcg 
38101 agcggcgaag agctcattcc cgttgatgaa caacatttga gatcctgttt gctccaactc 
3 8161 agagccaaca gctccgaaca agcggtgcta gcgatttgct tactgcatgc ccatcgcaat 
38221 gacgcccatg agcaaattgt ccaacggatt gctcgcgagg tcggattcaa cgatgtcatt 
38281 tgttcgaccg aggtggcatc gcttcctcgg atcgtcccgc gggccgagac gacgacgctg 
38341 gatgcctact tgctgcccat acttgaatcc tatgttgcga gagtctggga acaatttggt 
38401 ggggaaacga cgtgccatct gcgatggatg accagcggtg gcaatcttgt tgcctcggcc 
38461 ggtttccgtg gcaaggacag cgtattgtcg ggccccgccg gtggtgtggt tgcacttggt 
3 8521 gaagtcgcca aacaatgtgg cgtttctggc gcggtgggct tggacatggg aggcaccagc 
3 8581 accgatgtca gtcgctttga aggtgcagtc ggacgccgac aagaaagcat gatcggcggc 
3 8641 atccgcgtgc tcactccaat gatggacatc cacacaatcg cctctggcgg cggttcgatc 
38701 tgtgccatcc gcgacggacg cttgacggtt ggtcccgaca gtgcgggagc ctttccaggc 
38761 ccggcttgct acggccatgg cggaccattg acgatcaccg acgtcaatgt cctgctgggg 
38821 cgtctgccga tcgatcgctt tcccttcccg ttggatttgg aagcggctca aacggtgctg 
38881 cgacaaacac atcaacaact gccaaccgac agcactcttt caccggaaca actcgccgac 
38941 ggttttctgc aattggcgat cacgcaaatg gccgaagcag tccgagtggt cacaactgcc 
3 9001 gccggaagtg atgcgagaca aatggcattg gtcggtttcg gaggcgccgc gggaggtcac 
3 9061 ctttgccaaa tcgcggactc gctcgaaatg actcatgtga ttgatcaccc tgattcggga 
3 9121 ttgctatctg cggtcggtat gggtgccgcg tcgatcggcc gtattgctac cgaatcagtg 
3 9181 caaatcaccg tcgccccggc gattcctgaa actgccgata caacggcatc acgaacgttc 
3 9241 ctttccgggc agatcccggc attgcaatcg gccgcgaagc ggacactcgc ccaatgcgaa 
3 93 01 caacagcttc gacaagaaga aaacattcca cgggacactg ttttgcacgt cgtgcatcag 
3 9361 tgtgacgcca gaccattggg gacgcagtcg acgttgccga tcgatctgtt tcccctagag 
39421 acgcttccca atcgttttga cgcgaagcat cagtccacgt ttggttatgc ccggacatcc 
39481 atgcccattg aaatcgtgac ggtccgctgc gaagccacga ttgagcggac gtcaaacgca 
3 9541 aacgagttgg cacgcgataa gtcggcaagt tcaaccaaga cgattcgcac acccgcgaag 
3 9601 gcggacgtga tttcgatgtg ggttcgagga gcttttcgaa aagttgacct atgggaccgc 
3 9661 gattcgatcc tcgcagggca ggtcatttcc ggaccatcgg ttgtggcggg tccctattcc 
3 9721 gttttggttg tcgaaccagg ttggatggcg acgatgtcga gcgacggaat tttggagatc 
3 9781 cgcaaggagc ctgcttgttc agccaacaca aactcatcgg cactcagcac ggagcaactg 
3 9841 gacgacacca tcgcgatgga gatcgcggcg agaagagtgc agggcatcgc ggaagcaatg 
3 9901 ggcgaagtca ttcgccgcac cagcgtcagc gtcaacgtga aagagcgtcg cgactacagc 

3 9961 tgtgccgttt tcctcggcga cggttccttg gttgccaacg cgccacacgt gcccgtgcac 
40021 ttgggtgcga tgagccacac ggttcgtagt ttgatcgcag actacccgaa gatgtcgccg 
40081 ggcgattcct acatcagcaa cgatccgtat gccggtggtt cgcatctgcc tgacatcacc 

4 0141 gtcgtgaccc cggtcttttg tgatgatgac acacccctct cttcacgccc aaatcattgg 
40201 ccttgcgact tctttgtcgc ttcgcgatgc caccatgctg aaatcggcgg catggtccct 
4 0261 ggatccatgg cacccgcggc aacttgtttg gcagaagaag gcgttgtgct gcacaacgag 
4 0321 tgtttggttc gcaacgggca atctcaccac gcgcacattc gcaaactgct cagcagtgca 
4 0381 acgtatcctt cgcgtaatgt cgatgagaac atggctgaca ttgcggcggc ggaagcggca 
40441 gggcaggctg gtgcgaatcg aattcgaacc cttgccgact cgatgacgaa gactcgttta 
40501 acggaactgc taaaacagct actgcatgtc gctggtgaag cgaccgctgc ctggatcgaa 
40561 accctgggag ccgcaccgcg aaactttgag gatgcgctcg acgacggaac cccgatccaa 
40621 gtcgaactgc aaccggacca aaccaccggt cgcttgcgaa ttgattttac gggcaccggc 
40681 ccagttcatg acaacggctt caatgccaca ccgtcaattg tgaccgcggc cgtattgtac 
40741 gtgctccgct gtgtcgtgcc gggcgaattg cctctgtgcg acggagtgct caagcgaatc 
40801 gagttgatca tcccaaacgg cttgctctca cctctcgctg gttcaacgcc ggagaccagc 
40861 cccgcggtgg tcgccggaaa tgtcgagacc agcaatcgag tggtcgatgt gctgctaggc 
40 921 gcactgggcg tcgcagcggc atctcaagga acaatgaaca acttgttgct gggtgactca 
4 0981 acgttcggat actacgaaac cattggtggc ggggcgggtg caacggcata cgcgtcggga 
41041 gcggacgctg tccatacaca catgaccaac acgcgaatca ccgaccccga aatcttggag 
41101 tcgcggttgc cggttcgcct ttggcgtttc gcgatccgcc gaggcagcgg aggcctgggt 
41161 ctccatcgtg gcggcgacgg aatgattcgc gaaatggaat ttcttcgccc gttaacgcta 
41221 tcactcatca cctcaagacg aacgactcgc gcgtacggga tgaacggtgg agaaccaggc 
412 81 cagccagggc gtcaaacact ggtgcatcag tctcagcgga cggaacttcc cttcgcgacc 
41341 agccaagaag tcgatgcagg cgatcgcttg atcatggaga ctcccggcgg cggaggttac 
41401 ggcaaagaac tatcgacagc aacgtcggaa ctcccttgag gccgatcact cggacacatc 
41461 ttctgcttgt tcgtccgact cgtcgccttc atccgttttg acatccgctt cagaatcccc 
41521 gttgtcatcg gatccttctt ctttgaacgt gtaagtcaaa agctccaatt gaatctcggc 
41581 caacattgtg ttgcgatgcg ggatcaattt cacacgactt tcgatgacgt tcgtgttgct 



41701 cacatccatt tgtcgacgcc gcacgccgtt gcccggaacg gtcaggtgag cccgcacgag 
41761 aagtagcatc ggcgtgaccc aaggttgctc cacggtgcag ccccaggcat catcgaatga 
41821 acgactccaa cgaattgatc catccacggt cgagatcgca tgcatcgcgg tgattggctg 
41881 aaccgtcgct ccacccgaga cagccaccga acgcaatgac gcaaggtcgc gactcggctt 
41941 gcgatgcctt ggcagcacca ccatttgatc ctggatagac atcactgaca cgccatacaa 
42001 gccctttcga gctggcacca tcacatcagc aatttcacgt cccgacaaca aatcccaaat 
42061 cgttgtccgt ccattgtgat ccagcaacgt aaagtaacgt ccatcagtca tctccgcata 
42121 ggctgaggcc tcgttgacgt tttggctgtc ggcaacccga tgcagcacgg tttctccgga 
42181 actcatgtcg accaactcaa ccaattgagg gtcatgcatc tccttggcag cctccaggct 
42241 cgctctcaac tcttcggccg catcagacgg caccttggct tctcgattca cctgcgtcac 
42301 cgtcagcacg tgcgatccaa gcgacgagat cacggttcca atcgagcgtt tttggattcc 
42361 caacaatcca ccgtcgtgga tgtcatagct ctccacttga ttttgcgact tactgaccac 
42421 caacacccga ccttcggcgt gaacgaccgc gtttccaacg gcttcggttg gtgttcgcca 
42481 acgttcttcg cccgagacgg catccagaca aatcagttcg gttccttgaa gcaagaacaa 
42541 tcgatcgccc aagacgggac cgagaatcaa ctgcgcttcg ccattgtctg cggtcgctga 
42 601 gaccatcatg tttcgataga tctggttgcc gaacggggtc gcggcacttc ggcgtttagc 
42661 gaccggttgg ccgtcggtcc cgagtgttcg cctccatcga attgcgtctt caccgttcgc 
42721 cccgagtgtt cggaacaggt cgattgcgat cagttccgac tgggttaata acagcaacaa 
42781 cccaccgctg tacgtcaact cgttgccgag cgatcgcatt cccgaaaaac cttcgagggg 
42841 gatcgagcga tgaatgccat acgggttaat caacgccaac gagttcatgt tgtcactcat 
42 901 cgcattccaa ccgcgaagtt gacgtcccat caagtgagtc gactttgcgt agcgtcggtc 

42 961 attcatcatc atgccgctgg accgaggcat tctcatcgac tgccattcca aatcgacatc 

43 021 gcgaggccac tgttcgccca tgtcatattc acgtcctcgg ataggattgg cttgcagtgc 
43 081 gtcttcgatg aaagtgtctg ctccgaccgg ttcgctatcg ctcaaacgtt ggcccaaaag 
43141 ctccccgacg gccaatcgat ctaagttcca cgcttctttg ctgtacagtt tcgcaagcat 
43201 caacaaacgc tcattgctca atgattgcaa tgcgtcgtct gttgtcatct tatcggccca 
43261 aatcaaacgt tcggccaaca gaccatcttc catgagcaac gcacgggcgg ccatgtcccg 
43321 gcgcaaaccg ttggtcgctg ccacgacagg tgaaaagtgt tgcagcgtcc gcatcagcgc 
43381 cgctccatcc atgtttcccc gtgatgaaac tagtgactcg acctttccgg tcatctcaac 
43441 caactcatct tccgaggcag tcgacaccaa ttgtgccacg cgtgcactga tccaatcgtc 
43501 aacggtgaac tgtcgtgccg cattgggaaa gaccgcgtcc atgtcgcgat tgagcgtcgg 
43561 atgtcgcagc accaattcgg acaaatccag cagcaactcc aacgccttgg gatgatcaat 
43 621 ggaatcagcc gattgatcga cggatccttt gatttgcaaa accaacaact cggctcgctg 
4 3681 ttcgggttga tcaatcagct tcatcaaagc ttccaaatcg acactggagt cgctcgggtt 
43741 ttgacggaga ctgcccagca tcgcttccac cgacaacatg atcacttcat cgttgtcggg 
43801 atccaaatcg cgagcctttc gaagcaactc gagtgctcga ggtcgctgtt cctgctcgat 
43861 caacaattcc gcttttcgga cgagagcaaa gaagttatcg ggatcttgct gcaacagagc 
43921 atccacgcgt ggttgcaaag aacgctcgcc ataggccacc gcaaccatcg tcgcggattg 

43 981 cgagatcaat tcgccgtcga tcgcgatcaa gttgcccaaa gggaaacgca cggttcgtcg 
44041 ctgcaaaacc tcgccatttt gcaatgagat ctcgatcagc tcatccgtgg tcgtgggcag 
44101 atagtaacga tccgttccaa atacgccgcg gcccgagatc cgttgcccag gacgaaccaa 
44161 agtcggtggc gtggaccaaa cttctgatcc atcgtttaac gaacacgctg acacttcatc 
44221 agcaccaacg acgaaaaaca agtcatcttt gacaccggca acgtaggcca atcggatgcg 
44281 gtttttcggt gtgaaacgat tgcgtccagt caagacatcg agaacttgca atcgatcgca 
44341 ttcaatcgga gttaggacca ccgaagtgcc ttttgcaatc gccaacgaat gtgtccagcg 
444 01 acgactgagt tgattcgccg acacgcgagt gttgggacga tagatgctgt gaacaaactc 
44461 ttgacttcgc gggatttggt ttccccaacg aatcgttcga tcaaccaaat tgattgcgac 
44521 cgtggcgccg gctccggtcg gacaaatcaa caacccgtca tgataagttg gcatggctcc 
44581 ggcgacgcgg cgaaccggat cagaaccaat cccgccggac tccaccgcga ccaggacttg 

44 641 ttgccattga acggagccgt cggcgggatc caagcacaat aaaacgactt caccggcaat 
44 701 ctcggcaatc acataaaggt cacctcgaaa cggaagcggc ggccccagga aaaaggcttc 
44761 ggacaattcc gacggagtgg attctgattt accgattcgc caaagaattt tgccctctgt 
44821 ttttacatcc aacgccacca atgtgttcgc caatgaagac atgccacggc tcgcaaaagg 
44 881 gctgaaccgc tcgatctcaa ttttgctcaa gtcgtcgatc agaaagactc ggcgtccatc 
44 941 gctcgaaatt tgaccgaacg gcacgtcgtt ccagacacgc tgaacgagga tatcaaccgg 
45001 gtcttcttcc tcctccatcg ggtccgtcag ttcgggcgac aaagcggttt cggcggtagc 
45061 aaaccatggg tattgccaga cgcgttttcc ggttcggaaa tcaacaccca acaaacgctg 
45121 cgttgttcgc atcaacaatt gatcgcccac ctgcaacgga gtccaagagg gcggcggcaa 
45181 ttgaccggta gctttcagct gctgtgtctt ctcctgaatc atccgttctt ggcgagggct 
45241 ggtggacgac tccaccatcc atcgcgggtt gctcagtggc atttgaccgg agcctgaact 
45301 gttgcgatcc gcgctgccac ccaacaaggg ataattctcc gccaacgctt cagcggtgac 
45361 taagtcggat tcgctctgcg ctcggatgta ttctttccaa gcgacttcat cggcttcgcc 
45421 gttcgggaac atcgacgtga acgattgagg atacttcaac ggtgatggaa tcatccggtc 
45481 cgcccgagca cacgcgatgg catgcaagat caccacggac tcaccaactt gttcgacgat 
45541 acgttcctga ctgaccagat catccagcag cagcgaacaa gccagcggat tcccaagcgt 
45601 ccattcacgc accgccaaca aaaccgaagc cttgtatccc gcttcggtat gaaacgtttc 
45661 gcgtcgcacg cgctcgactg ttttccaatc tcgttttggc ccggcttctt ccagcagttt 
45721 ttccgccagg ggaccgtagc gaagctggta gatttccaga ccttggttcg gaagctcacc 
45781 aatcagatca cgaacatgtt ccagcaagct agctggcaac tgacttcgca attttgccaa 
45841 agcctccgct tgatccgtct cgtcctcaat caattggcca tcgcggtcca acatgcctcg 
45901 ttggtcactg agaggtcgat cggtctgatc atcatcatca gccctggcaa tcggataata 
45961 atagtcctgc atcatttcag aggacgggtt gtcaccggat cgaagcaaca ggtttcccaa 
46021 gtgcacgacc gcgtcgctgt aacgtttctc atcgatcgct tcctccgcgt cgcgaatgac 
46081 ttgccgcatg cttcgtggtg ggggaagaaa ctccgcatcc cccatctgca tccgcgaatc 
46141 aaatagccct tgagcttgag cgtttttcga ttggagcgac atcgcgaaaa acattgcggc 



46261 attggcgtgg aaattcacta ctcagaccaa ttaaaaacaa gctatcgaac aaacgcactt 
46321 cgcccgcgag ggcacacgaa atcgcgatct cattgtatcg ccccgcgttc ctcacaaacc 
46381 agtgtgatct tccgccgaac gggtggccga aggaatcttt cgccgcatta gaaaaggctg 
46441 gccggatccg cccgaaccag tttccgcagg gccagcaatc cgctgatcag gcacatgagc 
46501 gtcgtcagaa ccagcacgga cccggcgcga gagaccgtca atgccatcgg caaaccggtc 
46561 agcgattcga gcattcgaaa caacaacaaa gcgatcaaga cggcgggaat gaatcccaga 
4 6621 accgacagat acaccgattg tcgcaccacc aacccgacga agaatcgatt ggggtatccc 
46681 atcgccttca gcgtggcgta ctccgacatc gcgtcttgca aattggtgaa aaggatttgg 
4 6741 tagcaaatga tgattccaac cgcgaatccc atcagggtgc caatcaagaa gatcattcca 
46801 actggcgttt gctcgctcca aaacgtctgc tctcgatctc gaatttgttg tcgagtcaag 
46861 acttgccact gaccggcggc catttccgtt agtcgttcag cagcttcttc cggggatgcg 
46921 ccgtcttgca atcgaatcaa accaaggtca accgtctcca atggcttgcc gtagccacgt 
46981 cggggaaaat agtttgcaaa cgaggactcg gcgatcagca acgatccctc attggcaaag 
47041 tcggtcccca atgtgaacga acccacaagt tcgatcgatt ggttcgacaa ctcgattcgc 
47101 tggtccgctg gatttcctga atggtcgaac tgaaaaccat attcggatcg agttttgcga 
47161 tcgaccagtg cggtatctcg ttgccgcagg gcatctcggt tttcgtccag gccggcgata 
47221 ttcaaccagg gccgtggtga atcgcctgcc cccgtggaat aaagacttgc gttggggacc 
4 7281 gaaatcacgc ggatcggtcg cgatggactg ccttcgactc gcacccgtgc catggttcgc 
47341 tcgatcaaca acgcttgcac gtccgtcacc gccggatccg ctttgactcg gcggagcaaa 
47401 gaacgcggga acctttcgtc cactggcaac atgtaacgag cagggctgat cgcgaccaac 
47461 tcgcaatcga cgacctcgat caatcgcacg ggactatcga gcaacgcgtt gcgaaaaccg 
47521 ttttgagtga acatcagcac ggcggcaaac cccactcccg ccgcggccac gcacaatctc 
47581 gcgggattgc cgatcagatt tttccaggca agtggggtgc gtggactcat cagcgaccct 
47641 ctgtttcagc gtcacgctga aggttggaaa cgatctgcag aaaatccttc gcgtcgtcaa 
47701 agtcacggta aacgctggca aaacgaatgt aggcgacttc gtccagcgtc gcgagtttcc 
47761 gcaacacgac ttcgccgatc gttgcggtgg ggatttccat gtcaaattcg gcgtaaatcg 
47821 cttcttctat tttctggacc gtcttttcga tctcgtcgct ggtgacggca cgtttcgaac 
47881 acgctcgctc gattccgcgt ttgatttttt cgcgatcaaa cggctcgcgg gtttcgtcac 
47941 ttttgaccac ccgaacattc agtgcatcga tttgttccag cgtggtaaaa cgacgctgac 
4 80 01 acgtttggca cagccgcttt cgccgaacca cgtacccagc ctcggccgtc cgcgaatctt 
48061 ggactttgtc gttatcggaa tgacagaacg ggcaacgcat ttgcagtcaa attcaatgta 
4 8121 gaaaattgac ccgaaattca aacgctccgc cccggtggac tgggcaagca tgattcacga 
48181 gtctttcgaa ggtgtgctat gatcgtccag aacgtcttcg gatgcgaggt gtctggtccg 
48241 acaatcacag accgagccat cacagtgtag cgccccagaa cgcggtgatt tcggccaagt 
483 01 aggacaacgg tttcggctca gaacaattgt ttggttctgc gacgacttgc caacattggc 
483 61 atgatttttg gattgaaacc gctcgtgaac caaccctcga attgaatcaa caggaacgac 
48421 tcaacgatga cagacccggc caacccaaca gacgaaaacg caacctcgaa aactcccgcc 
48481 caaacggatt cggtccccga tccgctttcg tcggccgctt cgtccacccc cgtttccgat 
48541 tctggtgccc cagcccaacc aagagctgac acgcctcggg aacgcttcat ggaacagatt 
48601 gccaagaagc aatccctcga cgatcacgat ccggaggaat ctctttggga aggtggctac 
48661 agccctaagg cgatgatcgg ttcatggatt ggattgtccc tactgagcat cgtgttgctg 
48721 gtcgccgctg gattcatcga acaattcacg ttcggcatcg cgttgatcgt gatcgtggtg 
48781 ctttggatcc tggtgggcct gaactatgcc gcaaaacgat tgggcgtgca gtacgaactc 
4 8841 actagccaac gattcattca ccggaccggc atcctgactc gccgcactga tcgcattgaa 
489 01 gtgatcgaca tcagcgatgt aagctacgaa cagggcccga tccagcgaat gttcaaggtc 
48961 ggttcaatct ccatcatcag caccgacaag agcgatccaa agctggagtt ggttggaatc 
49021 aacaacgtcg gtgaagtggc cggtttgatc gatgacattc gacgtgccga acgacgacgg 
49081 cgcagcattc atatcgagca aaattaagca tgtccgtttg gtttgttcag aaaggcgatg 
49141 aacaactcgg tccgtacaag cccgccgaat tattggcttt ggtgcgagac ggatcggtcg 
49201 tcgcggaaac caggattcgc aaggacgatt ccaactggtt cacagcctcc gaggtcggag 
4 9261 gattgtttga agccgcgcga cgaccatcga ttgaacacta ctgcccacag tgcaacgttc 
4 9321 gcgtgagcca accgcccacg atctgccccg agtgcggcaa agacttggac cggaccaaga 
4 9381 ctcgcatcat cgaacacagc attggacaat ccaaacgctc atccgacaca ccttccgccg 
4 9441 ggcccagtga ctcagcaaag cgttggcttc aaaagaagat ccgacgggac aaaaaataat 
49501 gccaccgatg agatccgatt gcggtctcat cgatggcatt gaattgtctt tgcttgcggc 
4 9561 tcagtcatga ccgctctctt tcaatagcga tcatggatgt cgcgtgacag gatcaatact 
49621 tcagttcggc accgaaggtc aaaccgatga agaacacatc gtcatcggca atcagagagc 
49681 gaccgaaggt tggcgtgacg gtggccgagc tgatttgctc ttcggatgtg gccactccgg 
4 9741 tcaggtacca agcttcaaag cctccgcgga cagccatcag ttcgccaact tgatagcgga 
4 9801 caccggttcc gaattcgaac attccgctca gctcggtgga ttcttccgag ccgtttgcga 
49861 ggatgtcggt ttcgttgcgg accaacacac ttgattcagc gatattgagg aatccgcctg 
49921 ctcgagctcg aacatagctc gacgagaatc ggcccattgg attgtagata tccgctccga 
4 9981 cctgcaaacc aatcattcgg ttggcagcgt tgttccgaag caggccagtg ttggcaccgt 
50041 tgacggtcga gtagttgaag tcttcttcga aatcaatata gcgtccacca atcagcaact 
50101 tagccacgtc ccacgccatc atcgttcggt tcatttcaac cgaccagaac cgggattcat 
50161 agatttggcg ttgtgcgtcg gcttcgtctg cagcgtcgaa accaaaggtc ccaagtgtgc 
50221 ctggttgcag tggatcttcg atcaacagtg ttgacagacc atcgaccgct gcgaccgaca 
50281 catccgattc ccaatccagg acgccggtga agcccacttc gaaaccattg acgcagtcag 
50341 gaacgacgcc aaccgtgatc cttggagccc attcgaaatc cagatcatcg agagcaaagt 
50401 cacgagcaac ggtgaagttg tcgatccctt cgcgacgcat gaacaacgct tcgacgctgg 
50461 cgtagcgata agggtcgcac gtgggacatg ccattccacc gccgtatccg gccgaaccac 
50521 acgccccgcc gcactgcgtt ccgcaactgc cacatgcttg agcttggccg aggccatgat 
50581 gcccaacctg agcgacttcg ttgccgaagg aggacgaaga agcattcatg gctgattgca 
50641 tcgcaccgag atcccagcca ccggctgcag gatcagaggc ttgcacgaca cgaaacgctt 
50701 ggccatcgtg catcatcgtg ttgctggatg atgggccgcc gaattgctga gcatgcgtgg 



50821 cggcgaatcg tggaaaccag ttgttgttca taattgaacc tcaggacgaa caaagcggca 
50881 cggtgaccga cggtggctgg gtagtggtag gttgacgaac gtgacaggga ctaacgaagg 
50941 atcagttgat cttcaactgg caccaggtag gtctttcgaa tcaaacgaac gatttctttt 
51001 gcgtcgcggg aatcatcaca ttcaccttga acgatgatgc gatcaccctg acgcgagatt 
51061 tgcaccgacg cggtgcgata cttttgttcg atcagctcca tcaacgattg ctcttgctgg 
51121 ctctcgccac cacgaacgtt gctgggtgat gtgatatgca attcgtaaat ctcacggatc 
51181 gattctgcat cgcccgccaa tttttgctgc accaacactc gagtcttgcc ggggcgaacc 
51241 ccgatcaatc gcaagtgacg cgaggtgctg ccgatgactc gagcgattga atcgtcttcg 
51301 acgcggaccg cttcaatgac gccatccacg gtcagcgaag tgacatcgtt cgcgtcgcac 
51361 tgaatggtgt cttcgattcg caacgaccgt tcggcctcgg ttgccggaac cgaacgatga 
51421 aactcaccgc gatgacttgc caatgctgca ggaacgatcc gtgcggcact gactcgggat 
51481 tcggaagaga ccgaagcggt tcgcttggcc accgcaacca ttggtggtgc ttcgacggcg 
51541 accgccgcgc gatgtcgaac gggagcattc tgcatgcgtt cggaatcagc tcgttccaac 
51601 tgcgttggtg agtcaatctt caccggcaga tgacgctgtt gctggaccat catttccgac 
51661 ggagtcttga tttgcaccgg ctttggcagc cgcggcaaag tcgattcgtc agtgggttgt 
51721 tccacgggcg gagcgatctt cgtcggcacc gcaattttca ttggctcagg catcgagacc 
51781 aattcatcag cgaccaagtt ctcgcccggt tcatctgaat cagagaaact cagcgaaatc 
51841 gaatcgtctg catcactcaa cgaatccgtc gcgtcattga gcgaaaactc aatcggttct 
51901 gacacttcag catttgcaac aaccggcgtt tcgattggtg caacatcaac ggcttcaact 
51961 tcgacttcgg ctggctgcgg tgagactggt tcctgcacag cgggtgctag aacgatgtct 
52021 tcttcagcaa caacattttg ttcgacgacg ggtgctggct cgctcgcaat cttgggttgc 
52081 tgccattgca cttcagtttg gtcgacttcg aaatccatcg aaatggattg cttcgcttcg 
52141 acgttggtcg agacttgagc aggcgtcagc tcaacgatgc cgtttgattc agctcgatcg 
52201 taatctggtt gagcgtgcgg attgattctc ggcaaaggag tccgggctgg ctccacgacc 
52261 aacggtgtct gctgcgtcgc gggagcacca atcggaagca aaccgacagc ggtgccaatc 
52321 gatttcagct tgatctccga catcgtttgg tcggatgttt ggcggattgg atacaacacc 
52381 gttgttccgc ccgacacagc atcgccctga gtttgaatca ggttcacgtt gctcgagaca 
52441 gtcttaccgg cgacggtcga tgctacgcca gtcggtgcag caggtcgctc cacaaacggg 
52501 ttgctatgaa tcgtccctga caccaagcga atcggtggaa gtttcgcggc taccgcgtcg 
52561 tccgccaaca ccgtgtttcc ggcgacttgg ctcgcggcaa tgcaaccacc ccaaacaaaa 
52621 gcttgggtca accaacgagc cagcctttcg cgtttgcggc agctgctcat tcgattcgag 
52 681 cccgtcgggt tcattgtctt gccaatcagc atggcggatc cttacgctca cgatctttta 
52 741 ggatcatgta cccccccggc acattttccg cgtcgttggc cgagtgaaat tatccaccgg 
52801 acctccgaac accctctgta tcgtcgtcgg gtggttccgg cattaacgat tttgccagat 
52 861 cgaacgtttg gttctgctgg atcggtcttg acacagatac ggcttggacc agccacgccc 

52 921 actccgttga gcggaaacgc tcagccacca gcgaggcgat acgtgttcac aactcggctg 
52981 tttcagccgc gacggtgttt cacgaacagc atcgcgagac gcagcactgc gagacggcaa 

53 041 tgtcacaaag caaactgaat cacttgctcg tttgattcgt cagccgttac ttcgctgccg 
53101 aaccgcttcg aaagccacaa tcgcagcact aacggaaaca ttcaaactgt cgacgtgacc 
53161 ggccatcggc aacgccaccg gatcgatgga gtcacccgcc cagcgatccg ccaacccatc 
53 221 cgcttcgctg cctaacacca ccgccacctt tcctcgcaaa tctgaatcga aaaggggccg 
53281 cgagccttcc actcgcattg ctgccacgcg gtagccatgt gttttcagga atgcctccat 
53341 ctgcgattcg ctccccgacg ccgacggaac ggtgaacacg gctcccaatg atccacgaat 
53401 cgcgttggga ttgaatcgat ctgagggaca atccgacaac aggaccgctg atacgcctgc 
53461 cgcatcggcc gaacgaaaca ctgcccccaa attgccaggt ttctcaactc gatccagcac 
53521 caagatcaac ccgtcattcc atgaagaacg ctccaaaagc tgctccagcg aatcgtccgg 
53581 agccacaaac tctgctacac agcgatccgt cgaagccgtg taagcgattt tgcgaaacac 
53641 ttctgacgtg acccagcgat gcactccgga atcaatcgcg tgctgccttg cgacggcctg 
53701 ctcgggcata tccggcggcc ccgccgattc ggatagtttc ggttcgatct cgtagaacgc 
53761 cgtcagtttc aaccccgcct cgatcgctcg cagggactct cgtgggccat cgaccaaaac 
53 821 gactcctgcg gcgcgtcgtt ttcgggaacg gcgcagcgag acaattcggc ggacggcggc 
53881 attttgcaga cttcgtatgg tttccatggg gcaacacttt tgcacattcg cttcgatggg 

53 941 agaaccaacc cgagtctgct atcttctaac gtttgccaaa cagcttctct gctaccaacc 
54001 cctgaacgct cgaatgtccg aaaatgtgtt gtgccaaccc gtttccggac gccaagacca 

54 061 aaaggcattt ctcgatctcg aaaagcgtct ctacagcgac gatccgaact gggtgccgcc 
54121 gctatggagc gaacgggtca aattggtcgg attcaagcac cacccatttt aegataacgc 
54181 cgaaggccaa acctttctgg ttcgtcgtgg cgatcgtgtc gtcggccggg ttttggcggt 
54241 cgtcaatcac gcccacaacc gctaccacga cgagactcgc gggttcttcg gcttcttcga 
54301 atgcgaggac gatgaagagg ccgccatcga acttctcgac accgccggcg actggttgaa 
54361 gcaacgcggg atgacctgtg tacgcggccc ggttcacccc agcctgaact acgaagttgg 
54421 actgttggtt gatggcttcg acacgccgcc gactttcttg atcccctaca accatccgta 
54481 ctacgaacgc ctgattcaag ctgctggatt tgagaagtcg caggatctct acagttacga 
54 541 agcttcgatc gacatcctcg aaacgctcga cccgaaacta ctgtttgtca tcgaagaatc 
54601 gacgcgacgt ttcaatgctg tgtgccgttc gatcgacccg aagaacttta acgcggacgt 
54 661 ccgagtgttc ctggacatct acaaccaatc actgcagcga acttggggct acgtcccgat 
54721 gagcgaagcg gaagtcgatg accaaagcaa cggtctgaaa aacctgctgc ttcccaaact 
54781 caccagcatc gccgaaatcg atggcaaacc cgtcggtgcc ggttttggtc tgctggacta 
54 841 caaccctttg atcaaaaaga tcaacgggaa gctgcttcca ttcggttggt taaaactttt 
54 901 gatgggacgc aagaaactca agcgtcttcg tttggtcagc gccaatgtgc tacccgagta 
54 961 ccaaaaatgg ggtctgggtt tggtcacgct ctacaaaatt ttgccggaag cgatcaaatt 
55021 tggaatcgaa atgggtgagt tctcttgggt gctcgaaagc aatcaacttt cacgaggtac 
55081 gatcgagcgc ggtggtgcaa cccgaagcaa aacgcaccgc atttatgatc gatcgctgac 
55141 accggaatca gaacaggcat ccacgtgagc gaaccactct cgatcgattc gatattgggc 
55201 cccggcggca gcatcagccg acgactacca cgctacgaac ctcgcgaaca gcagctcgag 
55261 atggctcgtt cggtctcttc ggcattgacc gaccgcgagc atttggtcgt cgaagcaggc 



553 81 caaacagaat cgggttcgca aaaacccaag cgttcgtcgg acgatgtcga tgatcgtatt 
55441 ggccccgctg gtgatcggga caaagaatcc gatcggccaa aacgcgtcct gatctcaaca 
55501 cacacgatca gcctgcaaga acaactgatt ggcaaagaca tcccgctgct caatagcgtc 
55561 atcccacgag agttttctgc ggtgctggtg aagggccgca acaactactt gtcgctacga 
55621 aggatgggac gagcggtcga gaaatcagtg tcattgatgg ccaatgattt ccaaatgcag 
55681 cagcttcgcg aaattcgcaa gtggtccgac aacactcctg acggctcgct ctcaacattg 
55741 ccgatcaaac ccgacggtca ggtttgggac gaggtccgca gtgacacggg caattgcctt 
55801 cgcaataaat gcccgaactt caaggactgc ttttacttcc aagcccgacg ccgtgcccag 
55861 aacgcacaac tgttgatcgt caatcacgcg atgctgttca ccgacatcgc aatgcgacgg 
55921 cagggtgttt cactgctgcc tgactacgac gcgatcatcc tcgacgaatg ccacaccatt 
55981 gaatccgtcg ccagtgacca cctcggcatt cgattgacca gcggccaatt tgattacctg 
56041 ttcgaccgtc tctacaacga ccgccaacaa aagggtttgc tcgtcgcgca caacctggat 
56101 gcgttgcaga agatggtcga ccgatgccgg tttgcggcca gcgaaatgtt tgccggtgtg 
56161 ctggactgga tgcaagaatc acgctctcgc aacggccggg tgcatcatcc cgaagtggtg 
56221 cccaacccgc tcagcgaacc gatggagatt ctcgctcggc ggctgcgtgc tcacgccgac 
56281 gcacaagaca acgactccga tcggcaagac ttccaatccg ctcacgatcg acttctcgcc 
56341 ttggccggtg gccttcgcga atggctcgac caatcgctca aacaagaatc ggtctattgg 
564 01 gttgaaacct ccggcagccg tcgcggaatg gaccgagtct ctttgtccgc ctccccaatc 
56461 gatattggcc aaacgcttcg cgaagaagtg tttcagaatg aacaaattgg ctccgtcatc 
56521 atgaccagtg ccacgttggc gaccggcgaa caagacaagt tcaaattctt ccgcagtcgc 
56581 gttggcctga cgactggccg ctcactccaa gtcggcagcc cgtttgatta tgagaagcaa 
56641 gccaagctga tcatcgttcg tggactccct gatccatcag cgaaacgaga cgaattcgaa 
56701 gcggcgcttc cccaacaaat caagcgattc gttggacaca ccgatggcca cgcgtttgtg 
56761 ctgttcacga gttactcgtt gcttcgcaaa tgtgctgagg cgatcacacc gtggtgcatc 
56821 gaacgtgacc tgcacttgta cagccaagcg ggcgatcaaa accgaacgca actgctcgat 
56881 tcattccgca aagatccacg tggcgtgctg cttggcaccg acagcttctg gcagggagtc 
56941 gacgttcctg gcgacgctct caccaacgtc gtgattacga agctgccgtt ctctgttccc 
57001 gaccatccgc tactggaagc acggctagaa accatccgcg cccgcggcgg ccacccattc 
57061 cctgactacc aattgcccga agccgtgatc aaattccggc aagggtttgg acgcctgatc 
57121 cgcacccgcg atgactccgg aatggtcgtg gttctggatc cacgaatccg atccaaaccc 
57181 tacggccgac ttttcctcag cgcacttccg cctttgccat gtcacgatgt cggcaccgtc 
57241 ccacgacaaa agacaaagaa gcaatgagca agcgcagttc gcttattgga tgtgtgcagg 
57301 aaccgttgcg aatgcgatgc agtcaaagcg ttgatccgcc aagggtgtaa gttttcgcaa 

573 61 gctttatccc gaagggattg cagacggtag ccggtggttt gaacgcagtg aacaccaccg 
57421 gattcatgcc ccttgggcaa tttccgaccc tgaaggggtc gtagagactt cgcacctttc 

574 81 agggtcagat cgagtttttc gtttccgggg gtgcgctgct acgcggcacc ctccgcccac 
57541 aatctgtgat cccttccgag atgcagaaaa gctgaagtcg gaatcaaaaa aaagtcagcc 
57601 atacagaagt cgtgtcccac gagcgaacgc cacaatgatc caaatcatgt ttcgcgatgt 
57661 ggattggttt cgaacaggtc catcgatcgc tgcaccttca tcgagtcaag ccgctgeatt 
57721 cgaccaaggt catccgccat cataccggca tcgatggaca caggtgtcgt cggaaatatt 
57781 ggccgcttgg cgttagccac ggtttccgcg cctaaccggc tcacatcgtt atgcccgatc 
57841 atttctgacc ctatttagcg ggcgacatca atttcccacc accaaatttc gatttggtat 
57901 ccaatggggt cctccatcgc tcttcttgtc ccgtttggtt tttcctaatg cgtctcatcg 
57961 ctctaccgct gatcgtgctt ggtctcgctg ctccctgtct atctgccgaa cagaatcgct 
58021 tgacgattca gtggttcgaa cttgactcga cggagcagcc cgaactgacg aagaagctgg 
58081 cacgttcgag tcaggtcaag aactccgaca cggttctcac cgaacttcgc gaatgcgaat 
58141 ccgcacgctt gatggtgcag catgaattga gtgtcacttc tggtagtcgg gtgaatcaat 
58201 cgaccaaaat ggacgacacc acattcaaat tgaacttgga ggtccaacct gcagatcagg 
58261 gaagctacac gttccatgtc acctcctcgc tagaaactct cgcaccggga aaggacatcg 
58321 cccgacgaac cgcgataaga aaggtcgact cggcattcaa actgtctcca ggcacgagct 
58381 tcgtactgac aggcctggag cgaagcggac gtgaaagagc aatcacacaa gtgctggtcg 
58441 ttcgcttgca aggttttgcc gaagaacaaa ctgtagaaac ggttcaaaat gggctggaac 
58501 cacactcgag tcctgaagca gcactttcga caaacgattg acatggctcg tgggatcacg 
58561 ggcactaatg gaaacctctt ttattggcac tgctgttttg ctctttcgtg atggcttgat 
58621 ctcgttctct tcagcgatcc cgtttcttct cgtcagctcc aagactgatt tgtcgctcaa 
58681 tcgcgacgac tgcgattacc cccaagcccc aaagtgaaga aacaataaac gtttagcgtg 
58741 agattcatga accggcgtac caaagccttc ccgcgaagca gagcatcact aagctcaggc 
58801 gattgacttc tccagtgcct gacttggatc ctgaacgatg tccgtcactt cttctgcttc 
58861 cacatcattg gcgtgtggcg cgtgcaaaca caacaacgct cccggcgctc aattctgtgg 
58921 cggatgcggg catttcctgc atgagaaatg tgttcagtgt ggcgactcgg taagcctgac 
58 981 tcagaagttc tgcgttggct gcggacaaga cctgaacgcg tggctggaaa aacgaatcgc 
59041 ggaacaagag accaaactct ccgatgcggt ggccgccgtg aaatgccacg agtacgaacg 
59101 cgcgctaggt ctgctcaacc ttttagcgaa aagcggtgac taccgattcc aatccgttcg 
59161 cgaacaagct gctgctgcga aagacaaggt cgagagcctg cagaagaagg tccacgcaca 
59221 agccagccaa cgcatcaccg cagcaaaaga agctcacgcg caaaacgatc tttcaactgc 
59281 agtcaaattg ctggctcaag ttcctgaaaa tttgctcgac gacgaatccc gctccattcg 
59341 tcaaagcagt caggtgcacc tggatcaact caaaacactg catggcgaac tgcagcaagc 
59401 attggccgaa aagagctatt cacaagtcgc tggacttctg cagcaactgt tggaactgca 
59461 acccgacaac caaaagtatc aacaactttc gcaacaagtt ggagacaagc tgctgcgtcg 
59521 tgctgagaaa ctgtgtgctc ggcaggaata ccagacggcc cgtaacgcac tgaattcgct 
59581 tccgacaatt tgccacaacg ctcaattcgc cgacctttca cgtcgcagcg aattggcttg 
59641 ttggttgtcg aaacaattcg atgtggaacc gtacgcaacc aatgctcttg gccggctggc 
59701 catgcgttat gcgaaggaat ttccttctga cgggaaggcg tctgattgtg ttaaacaact 
59761 ttccaaagcg gtgaaatcga agcgcaccac tgcaagagat ggcttggcac cttggcgaac 
59821 caaagctgaa agctggattg gtggacgagt tggcattctc gccaatccgc aatcgttgaa 



59941 ggcattgcac gcactcggac tcagccgcat cagtggcaat ctgctgccga aaaaaggcgt 
60001 gatgtcaaaa ctcggcttaa gtaaaagcaa agcggtttgg ggcatcgacg ttggtgcatc 
60061 cggcatcaac gcgatcaaga tgcgagtcga aaaaggctct gaccagccga tcgtggaagc 
60121 cgcccatcgt gtagagctga agaatccaac ttgccgcggt ggatccaaat cggcaagcga 
60181 attaatcccc gaagccatca cgcgactgat ggaagaaatc gatgtatccg actcaaaggt 
60241 ctacgcgaat ttgcctgctt gcgagggcat cgctcggttc tgcgaactgc ccccagtcaa 
60301 agacaaagac gcggaacgat tgattgaaac agaggtcaaa acccgaattc cgatttcaac 
603 61 cgaagatctg gctttgatca cctgggttgc acctttgcaa aaggggagca ccgtcggtcg 
60421 tcccgtcgtg atggccgccg caacaaagct caccgttagc cgacgtgtcg acctacttgg 
60481 catcggtgga ttgaaactgg atggcttggt tccatctccc atcgcgctgg ccaacttcgc 
60541 ggctcacgag ttctctgagc tactcgcacc gccggccgac aaatcggcaa agaagaaatc 
60601 gaagaccgtc gaagaaacat cggacgagtc gagcgaagac gaatcattca gcctgacctc 
60661 ctcgagcaaa caaccgacgc tcgcactgat cgacgcgggt gcatccaaaa cgacgatgct 
60721 tttgatctct ccgatatcca tctggttttg gtctcacgag agtggcggcg aagacatcac 
60781 cgccgtggtc gctcgacgaa ccaaaaccac cgcggaggat gcagagcaat ccaagcgaaa 
60841 cctggcatca atcaaggagc ctcatgaagt cgacgacgac attctcgaga aacaggaaat 
60901 cactcgtgcc cgacttcgaa aactctacga agaagccgac aaaacattcc gccatttcga 
60961 tatccaagca acttggtgcg tcggctccgc tcatcaacaa cacggttttc tacgtcgtgt 
61021 gatgatgaaa tagcagaatc aacctctctc cgattccgcc aaaatctcct ctctgttgto 
61081 tccggccggg ccaatggact aggatgacaa ccaatctttg catcacacac actgtgatgg 
61141 agacaggttg atttcattta caagcggaaa gatcgatgag cgttgatggc ggacggcgtg 
61201 tgcaactgat ggcgacactg ttcttgacgt gtcttttcgg cctcccgcct gcgcccaaag 
61261 cggatgctca ggaatcctcg cccgccgaga aagaaccatt ctttcacaat ccaatcgcga 
61321 agggagccga cccttgggtc attcgagacc cgcatcaccc tcgttatctg tggtgcaaat 
61381 cggacaacga tcacggcatt ttgatcttcg aaagcgacga cctcaccaaa ctcggcaagc 
61441 gacaccttgt ctggaaagca cctgacgaag gaccttattc aaaagaggtc tgggcacccg 
61501 aactgcacgc gatcggagat cggtactacg tctactttgc cgcctcggat ggcgacaatg 
61561 cgaaccatct cacttacgtc ttaaaatcca agacgtccga cccacttggc gagtacaccc 
61621 tgcatggacc gatggccacc ggcgatggcg aagacggcca atcgcccaat gtttgggcga 
61681 tcgacatgac catccttcaa cacaacgaca atcttttcgc gatttggtct gggtgggacc 
61741 aaccgggcag cgacaaccaa ttcctgtaca tcgccccgat gagttcccca acgaaaattt 
618 01 cgggcccgcg tgttcttgtc tgcgacaacg atgaccatct ttgggaacgc atcgaaccgg 
618 61 acgcatccca acgtgggttg aatgagggcc ccgagatctt ccaagcgaag ggaaagacat 
61921 cgctgctcta ttcctgcggc gcttcgtggc tgccaaccta caaacttggt cggctcgaac 
61981 tcgtcaagga cgatccgttg gcggatggtg catggcaaaa gatgccggaa ccagccttcc 
62041 agagcaccga atcagtctac ggtgttgggc actcttgctt tgtccaatca ctcgacggca 
62101 aacagtggtg gcatatcttt cacgcaaagg tcggccccga acctggatgg aaccgagccc 
62161 tccatttgca acccatgaac gtcgccgacg atggaacgcc tcttctgggt tcgccgctgg 
62221 accgcacgac tccgattcaa cgtccgagcg gccaagcact cggtgaagag aggtagggcg 
62281 gaccggttca cgtggttcga aacgattcct tgtcttgttc actgaagtgg gacatgactt 
62341 ggagtgctct ttttcggccc tgcaagatag aatcaaccgg atcactcgtc ggacaaatca 
62401 acaaccggcg agtgattctt ttttgattcg actccatttc gaccaccatc tgaatgccca 
624 61 cgaccaccac cacagccgaa gacggatttc cagcaaccgg tcgtttggcg tcggtcgact 
62521 acggaaccgt tcgcatcggc gttgcgattt gcgatccaga ttggatcctg gccagtcccc 
62581 ttgaggttca tccggtcagc acgccggaga aggacgcaca gtatttcatc gacctggcaa 
62641 aatcggagcg aattgccgcc tgggtcgtcg gtttgccaat ccactgcgat ggcggcgaga 
62701 gcgataaaag taaagaaagc cggaaatttg ccgcttggtt gaaagaaacc accggactgc 
62761 cgacgcggct ttttgacgag cgtttcacaa ctgtcgcggc caacgccaaa attcggcagg 
62821 gaaaaacaac acgcaaaaag accaaacaac gcgttgatgc cgtcgccgca caagtcttgt 
62881 tggaatcgtt tttggaagcc tgccgatacc gcggtgaatt ggcgggccat tgcctggaag 
62941 attcaacttc cgatgagtcg ctcgatgatg catgatggaa cagaacacgt ggcgtgatga 
63 001 tgccccgtgt gacgcttgcc tgttatgaat aagcagccga tttgaatgag ccaaccattc 
63 061 gtttggcaaa atcagaaacc tctgatgaga tggatcgcaa tgaccagtcg ctcgatttta 
63121 ttccactcgt gcacgtttgc tctgctggga ctttgctccg ctttcatcca tccatccgca 
63181 acagatgccg cggacgtggt cgtcgttgcc tcaccggagt tcgacgaaac ccttgaacct 
63241 tggatccaaa tgcgatccga agaggggcta cgagtggcca tcgctcgtcc tgccgacaat 
633 01 gcttcggaca ctcaccagca aatctggaat gtgggcggcg cagacactcg ttacatcgtc 
63361 ttggtcggcg acacgccgga ttacgacacg cctcgaaacc aaccgcatca aattcctact 
63421 tggatgatcc cggccccggt cacctcgcgg ttcggatcca catcaacgct gcccaccgat 
63481 cttccatacg gcgaccgaga tggcgatcgg gtcagcgacg ctgccattgg tcgtctacct 
63541 gtgcgatcag ccgaacaact tgcgagcgtg atcgggcgaa tcgaggccta tgaaaactcc 
63 601 gacgacttcg gactctggcg acgatcgttt cagttgaccg gtggagtggg tggctttggt 
63 661 gcgatggttg acaccgccat cgaatcggtc acccgaggcg tgatcaccac ggtgctgccg 
63721 gccgatgcaa aaccacagat tgcctacgcc agcccgaacc atccgttttg tcctcccggt 
63 781 gaatccttca ccgatgcggt cctgaatcgc tatcgcacgg gcgctcgctt ttgggtgtat 
63841 gcgggtcacg gccaaatcga tcgcctcgaa ttgctgcaaa ccactgcggc ggatggaaaa 
63 901 ccggcagccc gcgaacattg gtcggtcgag tccttgctaa acaatcaaaa cgcaacgcag 
63 961 ttggaacgtg ctcccgaagg ggcgacgatc gctgtgctgt tggcttgctt cgcgggtgcc 
64021 tacgacgccc ccggagactg cttggccgag cggatggtgc tcgccgaagg tggcccgatc 
64081 gccgtgatcg cttcgtctcg cttgtcaatg ccttatggca acgcgtgcat gggcttgggc 
64141 ttgctgcaat cggtgtactc gagtggccct caaaacaccg gctgcgaccg tatcggtgac 
642 01 gcgatgttgc atgccgcccg cagtttgcaa tccaaacaac aaggtaaaca atcgaccatg 
64261 cgagtcatgg tcgacacgct cgcctcgatg atctctccag ccggaacgga tctcaagcaa 
64321 gaacgactcg aacacgctgc gctctatcaa ctacttggcg acccaacctt gcgtttgcat 
64381 ccacctcaac cactggacct ttcgatcgag ccatcgaatc aaaccgatca acaaaccgat 



64501 gtcgccgttg aacgaccgct aaccgcgatc 
64561 cacgacgcgc atggcaccac catcaccgaa 
64621 atcctccaaa cactcacgct tccgctggga 
64681 gttcatggta aaacagggtg ggccagcgcg 
64741 accttcccca tgcgacttcc ccacctcctg 
64801 ctcccagcct ccgccaccgc ccctgaagac 
64861 atttacatgg acgatatggc ttatgccgac 
64 921 acgccgaacc tggatcgaat ggccaacgaa 
64981 tccgccgtct gctccgcgtc gagatccgct 
65041 ggattgtcag gagccctcgg cccccaagcc 
65101 tttgcagaag tttgcaaatc agccggctat 
65161 ggccaccacc ccaaattcct gccaaccaac 
65221 tacagcaacg acatgtggcc gttgcacccg 
65281 aacgaccccg gcaattggcc gccgctgccg 
65341 cgaatcgtca acgacaacgt tcaaccggcg 
654 01 cgccgcagcg tcgagttcat caagaatcaa 
65461 cctcatccga tggttcacgt cccgctgtat 
65521 gcagggctgt ttggcgatgt gatgatggaa 
65581 gcaattgaat ccatcgatca gcagaagaac 
65641 ccttggcttt cctacggcaa ccacgccggc 
65701 acccagtggg aaggcggcgt tcgcgaacca 
65761 gctggcacca cctgcgagac attctgttcc 
65821 ttgaccggcg gcgaggctcc cgaacgaaag 
65881 ctagacgttc ccggtgcgaa atcgcctcac 
65941 caactccaaa cgattcgcaa cgagcgtttc 
66001 ttgggagatc gcgaaccagg aaaagacggc 
66061 ggtttggaac tgtacgactt ggacgccgac 
66121 cacccggaag tcgtcaagca actgcaggcg 
66181 gacaaacttc aaaaggtcaa gggttcagaa 
66241 gagcctcaac tgagttggtg atgtttgtgc 
66301 accgagctac aggttgaaca tcacgtaggc 
66361 cacgtccagc agggcagggc gatctcaaac 
66421 ttccatcaca ccgtcataga gtgaccgacc 
66481 cagttgcttg atgtcgtcat aagtcgtgac 
66541 cgcagcagcc atctctgcca acccttcaaa 
66601 gatgtcggtg taaacgatcg cagcgatgtt 
66661 ctcaaacgcc gaaacatcgc tcgtttcgag 
66721 agcgtcgatg cccgcgacca acttgccggg 
66781 cggatctttc agtgcccgcg aaccaacgac 
66841 ttcgatcgtg atttcgtcgc gcactccgcc 
66901 gatgatccgc tggacggcgt ctcggttggc 
66 961 atcgaccaaa tgcaatcgtt tggcgccgga 
67021 gtcatcgccg aaagtggttt gctggtcgta 
67081 gtgacgcaaa tcgatggcgg gccaaatttc 
67141 ggacaaaaat tgcgggtttg gggcagggca 
67201 tgaatacgcg tgaggataat tcgagaccgc 
67261 gccatctcgt cctcgtgttg ggtgatcaac 
67321 tcgatcccca atccgatcga gtttggatgg 
67381 ggtgccatca aactcgcctg gttggctttt 
67441 tgatccagcg tggattcgat gtgatctacc 
67501 gcgaaagttc ttttgccagc gttctgcgga 
67561 tcttggtcgt tgctccaggt gactaccgag 
67621 aggaagacgt gccgctcgag tttctcaccg 
67681 ttgacgattg ggcatcgggt cgcaaatcga 
67741 gaaaggagca cagcatcctg ctcgacgaag 
67801 tcgaccaaga caatcgcaag acgttcggca 
67861 catcgttcaa acctgactcg atcactcagg 
67921 aagaccatcc cggcaaagtc gaccagttcg 
67981 actcgctgga cgatttcatt gagcatcgat 
68041 tgtgggaggg cgaaaccttc ctctatcatt 
68101 tgctgtcgcc gaaagaggtc gtcgatgcgg 
68161 cgctgaactg cgtcgagggt tttgttcgac 
68221 gcgtctattg gaaccgaatg ccggactacg 
68281 aacaagatgt tccgccattc ttttgggacg 
68341 cgatgcgttt actggtcgac acggcctacg 
68401 gattgttcgc ccaactcttt ggcactcacc 
68461 tgtacgccga tgcgatcgac tgggtgagcc 
68521 gtgacggtgg cctgatggca accaaaccct 
68581 tgagcaacca ctgcaagaac tgccgctacg 
68641 gcccgttcac gactctgtac tgggacttcc 
68701 acaatcgcat gaccctgcaa ctgaagaaca 
68761 ccatccgttc gcgtgcaagg aagatccgca 
68821 aaagcgtcac cggaatgcga agtgaaccat 
68881 accgcacact cacaccggat tgacgcgata 
68941 actccacctt ctctccgaaa caactttgtg 



acaaagacgc caactgattc aaccagtgaa 
catcaaatcg aggtttccgc tggcaaacgc 
gaatcaggtc cgttgatcat tcggggtttc 
gcccaaagaa ccttcctccc tgattgaccc 
gctttgtgcc tgcccgcctt tttcacagct 
atcgcgggat cacgccccaa cattgtcgtg 
attggccctt tcggagccaa aggctactcc 
ggccgcaagt tcactgactt cagcgtttcg 
ctcctgaccg gttgctacca ccgacgtgtc 
aagatcggac tcgccccggc ggaaaccacg 
cgaaccgcat gtcacggaaa atggcatctg 
caaggtttcg accagttcta cggcattccc 
gacaccatcc gccgtcaaca aaaagatcca 
atcatcgaat ccatcgcggg ccagccacct 
gatcaagaac aaatgacggt ggaactcact 
tccagcgaca aacccttctt gctgtacctg 
gtgtccgagc gttttcgcgg caagagcggg 
gtcgattggt cggtcggcga gatcctttcc 
acactcgtga tcttcaccag cgacaacgga 
agtgccgcgc cacttcgcga aggcaaaggc 
accctcatgt ggtggcctga aaccattccg 
accatcgatg tgctgccgac gatcgtggag 
atcgatggcc actccatcgt tgatttgatg 
gaatccttcg tcggttacta cggcggcggg 
aaattggttt tcccgcatgc ctatcgaacg 
atgcccgacg gttacgccat gaccaagtcg 
gtttcggaaa ccaccaacgt gatcgaagcc 
gccgcggaag tctatcgtca acaactcggc 
atccgtggcc ccggaaagct caacgaaaac 
tgaaattgat gcattgctcg gtgggggacc 
cgtaccgaag caacgcgaag ttacggcaag 
atcgccagcc aatcgcacga cctcacccag 
gacaatcgct gccggcatct tcatctcgac 
gccaccgctg gcgaccaagg ggatgtcgct 
gttgggacct tgcatcatgc catctttcgc 
ttctgttcgc gatcgcagct tcgtcgcaaa 
ccatccctgc gttgcgactt tgccatcgcg 
atacttgtcg cacatttcga caaaccaatc 
caatcgcgtg acgccgacat tcagcaaaga 
gcccatttgg cagggcaacc cagtcgcctc 
atcgggactg tcgcctcgcg cggcgtccaa 
ttcctgccaa cgtttggcga attcgaccgg 
gtctccttgg cgaaggcgaa ccggttttcc 
catgcgtggt tcgcttccag gtgtctgcga 
gagcagggca gggttgcgtc agcgctaccg 
cgtttactgg acccgtcatg acaaaaattc 
tcaaccacga ttccgaggcc ttcaccgatt 
cggaaaacga cgaagaagcc acccatgttt 
tcagccccat gcggcacttt cgcgaggaat 
acgaactgac cggtgatcgg cgaaaggctc 
aaacgctctc ggctcactcg gtcgaaaaaa 
tccgtgaaca actcaaggcc accgccgaac 
acaatcactt ctactgttcg cccgaccaat 
tggtgatgga acagttttat cgaacgatgc 
aaggtgcccc cgaaggcggg cagtggaatt 
3.aggcggacc caaagaagta ccgccgactc 
aggtgatcgc gatggtgcga gatcgattcg 
atctgcctgt ttgccgcgac gatgcgttga 
tgccgctctt tggcacctac caagatgcga 
cgcgactctc acactcgatc aacttgcacc 
cggtgaaagc ttataaagat ggcaatgcac 
aaatccttgg atggcgagaa tacgtccgcg 
aagaacgcaa cgcactccac tgcgactcgg 
gcaacaccga catggcctgc gtttccgatg 
cgcatcacat ccaacgattg atggtgcttg 
cgttgcgttt tcatcattgg cacatggcga 
tgcccaacgc acttgggatg agccaatacg 
actgcgccac cggcaaatac atcaatcgga 
atccagcaaa atcaaccggt gaggatgctt 
tcgaccgtca caaacaacag tttcaaaaca 
tcgaacgaaa agactcctct gaatggacgg 
gtggaacgat caaattgcag cagtagggtc 
gtaagcccgc attagcttca ttcattcttc 
aaacgaggca gcctctatcc tggggcgaaa 
gggtgaactg aaagacgcag cgagacggcg 



69061 tcgcccgatt gtgcgaaaga accgaggcca acgccaaacg ccgcacatcc ccaatgacgc 
69121 ctgcgtggct tcttacgatc aacatgaaac gcatctttct tcaaagctca atcctgctcg 
69181 cccttggact ggcaggatgc acgcccgctg cctcggagcg aaccttcgac gttcccttgg 
69241 aagaacccac tcgagcggac cgccccagcg aaatgattgt tgccgactac caacaatcaa 
693 01 aaaggcgtga cgatctccaa acacagatgg actggccggt tctatttggc ccacatcgaa 
69361 ccggtgcgac gcaagcgtcc atccctgcgg tttggcccga acagggtccc gcccttcgat 
69421 gggacgtgga agttggcacc ggatatggat ctcctgtgac cgccgaaaac cgcgtcgtct 
69481 tttcccatcg cgtcgatgac cttgagtgga ttcagtgtca tgatgcccag gatggagcgg 
69541 tccggtggaa acactcgctg gagactgacg ccgaatgcga ctttgaatac agcgatggcc 
69601 cctacagcac accgttgatc gaccaggagc gtcgtgccgt ctatcacttc agcggttctg 
69661 ggacgctggt ctctttggac ttcgacacag gcgaagaagt ctggcgccgc gatcttcacg 
69721 gcgaatttca ggtggaaccg gaattgttcc catccggagc gacaccgctg ctgcacgggg 
6 9781 aacacttgat cctttccttg ggcggagcga acaacgacgc gggaatcatt gccatccatg 
69841 tccgagacgg taaaaccatt tggtcatgca cagatcacaa agccgcctac accacgcccg 
69901 tggtggctca acttttcggg caagaattcg ccttcgtcat gacagctgaa ggcttggttt 
69961 gcctcgatcc cgccaacgga gaagtcgact gggagatccc ccatcgaagc cgtgccccca 
70021 tgagctacaa ctcggtctcg ccgttggttt ggaacgacca tgttctgatg gtcaccggcc 
70081 cgggaccggg cgcagtgtgt ctccaaatcc tgccggatcg ttcctacgaa gaacgctgga 
70141 aaaaccgacg actcctcgat agccaataca ccaaccttct gctctccgat ggccatgtct 
70201 ttgggttcac gtccgcgggg cagggcggtg ccgaactgcg ctgcatcgaa ttcgtcacgg 
70261 gtgaattgaa atggaagtac cacagcgttc tgcgtcgcag tcagggactg atcgcaggcg 
70321 atgcaatgtt ctttctcgga gaacgcggcc acttagcctc gctgcaacgc agcatcgaag 
70381 aaccgcgagt gctctcgttc acccaacaac ctctcatgaa ggcagactgc tactgctcgc 
70441 cagccatcct aaagaacggc atcgtgctta aagacgaaga ccgaatagcg gtctttgact 
70501 taccaccttc tccataatca aaccacttgg agacgtttac caatccatcg tcgaccgcga 
70561 gtcacaaatc ccatcgtcgc cagagcgatc attgcgagaa acgtcccagg ctccggcaca 
70621 gcggtgaccg tgctgaatgt gatgttgtcg aactgatgag ctctcgcagt tccaccctga 
70681 ctccatgtca gactggtcaa accgtgaaat tcatcaccga acgtaacgac caatggagtg 
70741 agttcgtctg cttcgaactc accagacaac aatgtcaccg aactcatcgt tccgttgaca 
70801 atttttgtac cggtgaaatt cactcggacg gtctgctcaa tcgttgttga tagctcgctg 
70861 taaccaagta atgtgatgga ctccatatca aatgtctgtc ctggggcagg agcattaaga 
70921 gccagcgtga attgaattgg cccagtcccc gccgcctggg acccaagtga agtggattgg 
70981 tctgactggc catcccacca ctcatcactg ctttgttggg gatacaaaaa cttgcccgtg 
71041 gaaatcccga catacattcc ttcaataaca tactgtgacg ccccaggctt atcgtcagac 
71101 gaccctacca cagtaaaacc atcagaatta ttgaactgca ttgtcctcgt gacgacagcg 
71161 ccatgaactg tttcagaaac caaaacgttc gcaagacaaa cgaagcaaca aacaaataaa 
71221 acaaacggtc gccaaacttg gctcttagac ttctcaaccg tcttcatagt catcacctgg 
71281 tgcattcact tgagtacgct ttcgagccac aacctgctcg agatcagcgt tttccgctaa 
71341 acctagttgt cgattgatcg ttcggatcgt tgcgagcgga ctgccgggat cctgcgactg 
714 01 ccattccttc acacgggcct gcacatgcaa gcgatccaag gattccggca tcacttccaa 
71461 ggactgctcg taccagtagc ccgctcggtc acgggtccct tgacgataga ttcccgctgg 
71521 caatccttcc gacagttccc accacatgtc gcctattgcc aagaagtcct cagcatcgtt 
71581 ggcaccttcc agatctcgct tggcaacccg ggaaagtttt tcacttgttc cgttgatgac 
71641 caacggcaac cccattgccc aatcgccttt cagaaagcaa taaaaccggc cgagagctga 
71701 tgccaattcc gcatcgtccg gattctttcg gtactcggcg accttctcag cggcttgctc 
71761 atattgcttg ttcgcagaag acaactgtgt ctgaaggagc cggacaagct tttcagcgga 
71821 cgcgtcacgt ttgccacccc ggacattggc tcgccgaaac cctcgacgct cttcctccac 
71881 tcgtggcaaa gcggaagcaa tccggccaac tccttcaaaa tcgtttcgtt ctaaaccttc 
71941 ttgaattgcg actagcgctc ggcggacaaa tgccatttga ctcgcagtgt cctcgtccac 
72001 atcatttcga cgcgaaaact ccaatagccc atctcgattg acctgataaa gatcgacttc 

72 061 aaaggtggac gcgtgttgat ccaagaccgc ttgaaaaagc atgaagtcgc cgctgtcgat 
72121 cgccagatca atcgcagcgt tctgcaaggc gtaggcacct gcggcgtcat ctgacatcac 
72181 caatgatttc tttagaaaat cgttgctcag ctccgcctgt gtctgattct gtttcgctcg 
72241 aactaactgc tccccgaaca actgctcgat tcgttgattg gaggtggcta agtcaggaga 
72301 tggtgggacc ggcaatcgat tttcacccca actctctgcc tcaacctccg tattctcgac 
72361 gatcttggat gcttcaactc ggccagaaac cttcgcttcc accttcgtct cgctgcggga 
72421 tggtttatgt gcagcaagca atgcatcggg ctctttgtgg atggcccccg cccgctgccc 
72481 atcacctgac tcatacgtga acgcgttggt ccattgtttc gttgacaccg aattggcaac 
72541 atacgaaggc cagccatttg cccaagcaac caacgtacct cccgatcgca attcaccgag 
72601 cgatcctttc aaagcaccta gatccgccgg ttcgaagtcg accggacgag tccattcgac 
72661 tgctagatca tcattgacct ccacgagcat cagcgtgttc tcaatcccat cctcaatatt 
72721 gcgatcgcgc ggataacggt tgtccccaaa catgtagttg cgatgagccg gcaacatcca 
72781 attcgttttg ctgtcaaatc gctctggaga aaccaactca tcaggaatga actcaatcag 
72841 ctctttgttt ggggaacgat tccacggcat gttgagatcg aacttctcaa atagatcttg 
72901 atgcccgaga taaggcaaaa tcaaaaccct ccaagacaat gtcttgatcc cagccttcgc 
72961 ttgcgccgac ggcgggtaat gtcctttgtc ttccttgtat gccatcaggg cttcggtcaa 
73021 tcgattgaga ttctcgactg cctttgcatg acgctctggc tcacccaaag gatttgccgg 

73 081 tttccgatca gagactggct tccagtctcc gattgtttct tccggactgt ttgcatcacg 
73141 gtctttggct gccttctgca ccgccgatgt tgtagcggcg ttcgccgtct cctttacccc 
73201 tgctttggcg tcgaccacct ctgatgcgat tgccgtgctg actggctgag tttgtgcctt 
73261 cgcggactcc tctttttcgt ccgaactttt cggacgagca cgctgcgcag cgcgcatcat 
73321 caactcttct ttgctgggac cacagccccc gcagcacaac aattggcatc ccaaaacgag 
73381 cgacaccatc gtcccgacga cgctgcgatg accaactggt cgcaagacga ttccacttcc 
73441 tgaagtgtta tcactcatcg tttgcaacca ccgcaccgat taaaggaaac caattcgaaa 
73501 agtcttcgtt ttcgggatca ccgatttgga cgtaacctcg tttctgatcc aaattgaaga 



73621 taggttcaat ggatacatcg agtgattttg gactgacatc cagaacgctg agagcacgct 
73681 cttcgtcgaa gtgctttcga acaatcagat gaaactcgtg ctcttggtcg tttccaagtg 
73741 tgcccagatc aaggccatct cgcgaatgga gcatgggtga gtgaaaacta atgggagcgt 
73801 gcacgcgtcc agttagctcg acactttttt cgatcacatc tgactcacca ttgaccagca 
73861 agtgcagctt cacttcgcct tgaaaacggc ctggcttctg agctcgacat cggagtttga 
73 921 cttggttgat gacggtcggc tgctgttctg acaaagacgg ctgtgagtcg gaaggaacga 

73 981 catttgattc ccaatcgaat ccgaccaaat cggactcggc accgacgact gcaatatcgt 

74 041 cgtgctgttg gctgtagatc aaaagagtcg actcaacgac ctcacccgca tcgccggagt 
74101 gaaaaacacc cttggcaggc acaaagagtt cggtcctgac cacgccggaa accgtgagat 
74161 cgatcacctc tcgcgttggg tcgttggtaa tgacccgcgc ggcttgttcg tattgttcgg 
74221 atttctgacc tgtgttccac gtcagtgtga cactcgtttt ttcgcccggt gcgagtgttc 
74281 ctttctgtgc acttccaaca gtgcatttgc aggttgtctc tgcaacgtca atcgtcagtg 
74341 gtcccgcccc atgattccaa atctcgaatg cgtgcgaagc ggttgtgtgc gggtcgacca 
74401 gcccgaagtc gtaccgggta gatggcaaaa ccgcttgcgg caatcccatc gactgggcct 
74461 cttttcgcag ggcctcttcg cgtttgatca acgaaacctt cttctcgtat tcagcgcgtt 
74521 cggtcttcga gactccatac ggagcgtaag tcacgatggt cgccaatccc gttgaaaggc 
74581 cgccaaggac agacgccagc aggattccaa caagcaaaaa ctttcctgtg atgctcatca 
74 641 cactctattc ttcaaagaaa aggagaacca acgaaaatgg aagccaaaag ctcgccatct 
74701 ttgtccgcaa acattcctga actcgcgctc attgggtggc atgctttgcg taaaacggcc 
74761 acggcatcca ttgaaagatg ccgtggcttg aatggacgga actcgcaatg attccacctt 
74821 acgaaacgga ttgttcaaac ccgaaatgga tcactcaaag ttgcctttga agtagccatc 
74 881 gttgatgaag ataccggcga agaatttgtc gcgaggcatt tccaacttgt cgtcggcgta 
74 941 gccaacacca gagatctcag cttcggtcaa accggtgacg gggaaaccag gattcaagaa 
75001 tccgtcggcg ttgctgtcgg cgaagacttc aacgtgacca tcgcccatca ggatattgca 
75061 gctacctgcg tgaacggcgt accagtcacg agtgtcttgc atgtagatcc cgttgccgcc 
75121 tgggccggtt ggagcagcac atccagctgt ggtgggttcg ccacgctcac aaccgatttg 
75181 cgcgagcaga gtttcgttgc taccaatcaa gtccaagttt cctgcagtcg cattccagta 
75241 ggcaggtccg tcgttgaacg cttcggtcaa caacgatccg ctggcgatgt actcgacagc 
75301 ttcggtttga ttcagggcag ttccccagac gccggttcca tcgaaaccga gagtcgcacc 
75361 gagaatcgct tcgtcgacat cgccaggagc tgcacagccg atgaagccaa cgttgctgct 
75421 gctgatacgg cttcgctcca agtcggcgac gctcattgga ccagtcgatc cgcccaaacc 
75481 cttgaatcca ccgccggacc aagtggtcaa ttcgccggtg ctcggatcag cttcggtttt 
75541 gaccattcca cggaccaagt gccaagaagc ggcgtagttc gtgttgtaac cgttggtcaa 
75601 aaagtagcga gcgatcaact cggcacgaac agcggtttct ggagccgttc cgccaaaggt 
75661 ggtggagctt ccatcgcccg aaacgccttg ccaatctgct ttgccacact ggccagcact 
75721 caggcgtgcc aacggtgcag cgtccttggc gtcggtggtg tccttcccgt acaagtcatt 
75781 gagtttttcc gagcccttca atgggttcga agggtccaac gactcgttca tgttgccgtc 
75841 gccaacgttg acgaggtcag caacccaacc gtattcgtcc atgcaaccat cgcgtcggaa 
75901 gtccgacgct ccactgcaga acgttcccga aggagcacga ttgctgtatt tgaacaggcc 
75961 gatgccgatg ttcttgaggt tggcttggca ctgagctgca cgggcagctt cgcgagcctt 
76021 ggtgatggct ggcaacaaca aagctgccaa cagagcgatg atcgcgatca cgaccaacaa 
76081 ctcgatcagg gtgaaacctg attttcgaaa ttgcgacttc atagtctttc ctgagatttt 
76141 agttgccgaa agccgaacgg cttcgacctt gtgttaaaat gagatgtggt gagtttggtg 
76201 gcgttggact cacaagagct gagacttgtt ccagcaagct tcagcttgtc gagcgtcacc 
76261 gactccttct tcttcggaac gaaacctggt ggagagactc tctcccaagc ttcaattcca 
76321 ttgcgtgatt gcaatcacgc caatcgtctt ctttcagttg ccgtgacgtg tgccacaggc 
76381 taaggcaaac ctcctttcgg tagagaagct tctggttgct gtcagcctcc ttgaacagat 
76441 tcggtggctg acaagacatt catttgggtg gtgttttaca agaagccgat agcaccaaca 
76501 ggcttcacaa cttgcattca caaacctttc acaatcccta ctcattcgct tcgcattcac 
76561 acagttgcga aggcagcttt cttgatgaat tctcccgcga taagactcac aaaactcatc 
76621 gaatcaaaac cgagggcccg cgagtcttca ccaagacttc acacacattt ctgagactga 
76681 ttcatgcaaa actgctctgt cgattcagtc gatagcaaga ttgcaaagcg ggcatttcgc 
76741 tggtcatcgc ccgaacgtgc cagtgcgtta gtgatagaga ccaagcaatt gccggacgca 
76801 tggcaaagga gactgccgag caagcaaaaa gcggactgtc gggcgcaaca ttcgatctat 
76861 ttcgcggatc gatagcgtcc gactttagag ttcttcaccc ttgctattga gcgagtagta 
76921 gctcgctagt tccaagtcgc cggcttcgac ttcggcatcg gtccagaagt tggcgtccat 
76981 tgggaatcca ttgttaataa aaccgtcacc attggcgtca tacagtcccc gaacacttcc 
77041 gtctgccatc aggacgtgac aaacaccacc atggtgactc gacatcccgc ggtaatcttg 
77101 tcgcgtgtcg taattccaaa ttcgcagcca tcctgagacg ccttcgcgtg gcgtgctggc 
77161 ggcaatcacc ggttccttca gataatttgg atcggtttct ttgtacagaa ttggctctcc 
77221 gatgatggga gtcacatacg gcgttccaga tgtcaattca ccgacgttgc cggaaagctg 
77281 gccaatcggc gacgcatcac acaaaagcgg aactgtgctg ctgggtgctt tggcggtgtc 
77341 caaaagacgc gtcgtcaatg gaccacgggt cacgttgagc gacttcagat catttgtgca 
77401 ggcagaatct ttgggctggg gctgaccgct ttcgtccaaa atcacttcac tacggaccag 
77461 gaaccacgac gcggcgtagt tcgtgttgta accgtcttcc agcattttgc gttcgatata 
77521 ttcgactcga tccgcgggac tcgcctcgat caaccgagcc acattcttga: taacggtgcc 
77581 catttcgttc gtgtattcct cggcgcccaa ctgatcgacg cagtcgttgg tcgcccagtc 
77641 ggcctcgttg tcactgagca ggtgttcgat ggcgctactc gcctgagccg atgtgctggt 
77701 gcaccgcatt tcgctgacca acacgccgcg tcgtaccaga tccgccaccc atcccacttc 
77761 agtgggcaca ccatcacgac gaaagtcgaa tgcacctgaa caaaactcac cattgggcgc 
77821 gttcactgtc cgtccagtca ggccgacacc aaattgcttg aggttgttct gacactgagc 
77881 ggctcgagct gcttcacgtg ctttcgaaac ggccgggagt aacaacgctg ccaacacacc 
77941 gatgatcgaa atcaccacca acatctcaat caacgtgaat gcggcgttgt ttcgcggatc 
78001 caatgggtgc ttgagagatg attcgaaatg gcggcggttg atggacatca caatgaagac 
78061 cgtggaggag gatgagacga gagcccaaag agggattctc gtgcagctcc acttttcgca 



78181 cggctcgtct tcatcgcaaa ttcaccgaca ccaaaacaca acaccatcgt ctgcgaagta 
78241 ttccactggc agcgaacctc agcatttggt atcggcgaac acagttggtc ccttcccacc 
78301 gataaaactt tgactggagc ccaccacgat gttgctgcga atccctccca cgtcccgtcg 
78361 cctcgctttg atcggactgg cctgcctggg aacacagagc gttctggtac ctcaatcgat 
78421 cggaaacgcc aaagaacagt cggttttaga acaagcagac gacgcatccg gtgaaaccta 
78481 ctcgctgaaa cactcgcttc agcctggtca aacacttcgc tatgaagtca cccacgttgc 
78 541 caaaaccaag actcgcatca atgggtccga agaaattgcg aatgttcaca ccaagagcaa 
78 601 tcgagcctgg accgttgcgg aagccgacaa ttccgaaatg acgttcgatc actcgatcga 
78 661 atccgtcgcc atgacgcagc aaagcggcga cgcggatgaa gttcagtggg atagcaccac 
78721 gggcgaagaa ccgccaaaaa tcttcagcgt tgtcgcttcg cagatcggaa ctccgttggc 
78781 aacggtgacg atcaacaagc agggccaaga agtccgccgg gaagaccatg cgggttccaa 
78841 gtcctcactg ggaatgggga cacttgcgtt ggcactgcct gacaaacccg tcaagattgg 
78901 cgaatcatgg gccgttcctt ccgaaatcca agcccgaacc gaagacggct tcgtcaagca 
78961 aatcaaaatt cgtcagcttt acacgctcaa gaaggtcaaa gccggtgtag caacgctgag 
79021 cgtgaagagt gaaacgctga ctccgatcga agaagaatct ttgcgggcac aagtcattca 
79081 gcaactcagc aacggaacgc tgcgtttcga tgtcgacaac ggctatttgc tcagcaaaga 
79141 actgatctgg gacgagtccg tcgttggctt ccaggggcct ggcagcatga tggaataccg 
79201 ggccaagatg acagaggaac tactacccga agacagctcc acaacggtcg cgtcgaagaa 
79261 gaaaagtcaa cgctgattgt cgctgccatt ctcggcagac tcatcttccc cgtaggccag 
79321 gtcctacctg gcggggtgat tggacggtac tcgttcgaac tcgagaatca aattgtgact 
79381 cacctcgctt tgtgacaaag catcggctcg agtgccaggt gaaacctggc ctacgttgct 
79441 ctgctgcgtt tcacgcaagc ggtctgccat ctgacgaagc tcatcgatta gcaaatcttg 
79501 ctcgcgatag aagacgggag ctcgaggtgg tgatgcgact tcctggtgac cgacaccgga 
79561 acgacggtcc ttcagcagtt tggtccatag cagcctttgc ttagtcagct cgggatcatc 
79621 ccactcgggc aagccatctt ccagctgatc caaagtgttg cgaaccgcga catcgagtat 
79681 ccgttcaaac ctcgtttgct cgaattccaa ccggtagaac ggttccgaca acggagcgtc 
79741 taagaacgac ttctcgtttc ccagctcatc atcgaaccag ccgctccact ctcccgtctg 
79801 aaaacgttgc cattcggcga tcagggatcg ctggcgactc tctcgccgaa gatctttgtt 
79861 gaccaattga tcgacaacac gaatgaatgc ttcgcgatcc aattgatcct cttcggactg 
79921 cggaagagct tcgtcgctga acaggttttg ctcgtaagca atctgtctca aaccattgat 
79981 cacgatccat tcgttggact cggcattgtt gagcaattcg cccaacgcac cgaacacaaa 
80041 cgactgctcg cgagagagct ttgaccattt tgccgcaacc tccagagctt tgcggcgaaa 
80101 gggatgccac tgaaaatgtt gatacggact cacgccgtaa ggcagcccat tccaattcgg 
80161 aacactggca tatcccttag cagcgatcgg cgtgtcgacg cacaagccat acaactccga 
80221 ttgcgaaacg tgttccccca tcacttcgct ggacttcagt tctgcatcca atcgcagctc 
80281 cacatcacga tatcttgctg cttcattctc gaaggcatag ttcattcgct gaccagccca 
80341 agatcgactc ggatgtaaac ggacagtgta ttgaaaagct gaagcttcca ccaaattgac 
80401 cgttggaaat tcagacaaca gcaagctttt ccgggcatcc atttcataat cttccaggga 
80461 atacacatcc gcattcagat ggatattccg agttcctggt aacaccccaa cggttgactc 
80521 caaagtctca ggcgtgatgg gctcctgaac gatcgccaac atctgctttc gccactgcgg 
80581 catttccgac ggcgtcgtcc accaacgatg aatgaagaga cataggcaca tcacccccaa 
80641 agccagcagg caatacgcca gccctcgtcc ccacacagca gccgaagaaa caccttccat 
80701 cacaaaaggc gtcaaacgcc gacttgcgaa cagcaacacc ccggccgcca gccacagcaa 
80761 cccggtgtag cctccgtaga acgcgagcag gccaaacgcg ggtagcatcc agagcagagc 
80821 caccacaggt gcgccaaaga acgccagcga tggccggcgg ccccactgcc cggttagaac 
80881 gccgatcaag aacaagaggg tctggacgac cacgctgaac ccgaaagcct gccacaaggt 
80941 tccagttccg taatgcatcc caagatacaa atcgaggcca gaaacaaacg gaaacaccag 
81001 tatcagaaac aatgatgccg aggcggttgg gatcagtcgc gtcacccata tcaggttggg 
81061 tggccatcca cggtcggcga agaacccttg ccgcatcccc cgacgatcgc cgtgaaaagc 
81121 cgcgactcct agcatcagat gagcggccgg ccatccgagg acaagcaaca gatcatgaaa 
81181 accatgattt ccatccgtcc aaatcgataa tagcgcgaac agggcaaacg agaacgcggc 
81241 accaatgacc agacgagtct gccgaacctg ttgccaaagc aatgcacgaa acggacgcgg 
813 01 tttggcatgc caagccactt gcaccacgct tggttgccaa gccgaagtcg aggccaccgc 
81361 tggagcctgc caacggtggt cactcgcgcc agctaaacga tgccgagcca atccggccac 
81421 cacgcccccc agtgcagcga ggtaaacagt tgccaaacaa agcatcgaaa ctcgctgagc 
81481 agcggtcgcc tccaagaacg tgacgccata gtggaaatgc gtgctgacca acccagcgac 
81541 cgagtaattc cacaacaagg tcaggcccac gatcaacggg atcaccaaca tcaccgaacc 
81601 gaccggattg cgaacgatca acgccgcgat ccagccacag atcagcaagc ccactccgta 
81661 aaccatcagc tcgaagatcg attccgaacc ggagatccgt ccgatcgcag accaatccca 
81721 gcctccccgc aacaccaacc acgacagcgc aaaacaaacc gacgcaacca agaccgccag 
81781 cacccaggca gcgaacccga tcaacagctt ggacaacaac acttgccacc aagtcaccgg 
81841 caacgtccgc aaccaatcca gggttcgctc ttcgttctcc gtgcccagcg agatcggtgg 
81901 aattccgagt gccaccaagt tgggtagcag cacccacaac acggaaagtg tcgtcaggct 
81961 cggatccgag gccggatcga tcgcggcacg aaggaccgcc agcaagaaca acgccgccgt 
82 021 gccgatgaca cccgccaaca atagcggttg cagcatccga aagtctttcc aaaccaaatg 
82081 ctgaaatgct gatcgtggac gcgaaacggc cgggtaagca aaagcatcaa gcgaagtggc 
82141 catcgttcac ctcctgacga cgatcgaatg acgatggctc atcccgaatg ggttgaccac 
82201 aaaccgcaat gaagacttct tcgagcgtcg gcacgctgac atggaattgg cgcacaccat 
82261 cattaggtcc ccagtgcaga tccccggcgt cgttccaatc cgtgatcacc caacgtcttt 
82321 gacgaccgtt gctcgtttcg ctcaacaccc gcccaggcaa atcgggtgat tcgacatgag 
82381 cgtcatccat cgtcgctgtc acgattcgag tgtgctgctt caagtcgttc atcgactgaa 
82441 ccaatttgac ttggccgcca tgaatcatcg ccaccatgtc cgcgactcgt tcgacttcat 
82501 tgatgtggtg gctcgatagc aacaccgttc gccccgcagc ggcacggtcg accatcgatt 
82561 ccaagaactg acggcggacc atcggatcca acccactggt cggttcatcc aagatcaaca 
82621 gttcagggtc gtgagccgtc gcgagcgcca aagccacttt ggctcgttgg cctttgctca 



82 741 cgatgaagcc atcgtcgtag aacgcactaa cgaaccatcc gatttcaggt ggagtcatcc 
82801 actcatacag ggccggcgag tcagacacat agccgatctt tcgcctgatc tcctcggagt 
82861 tctgaccaca gtcatgtccc aagatcgatg ccattcccgc gtctggcttt tgaaagccgg 
82921 tcaagatccg aatcaacgtt gtcttgcccg caccgttctc accgagcaac gcgaatacag 
82981 tccccgccgg aacatgcaaa ctgacgttgt ccaaagcgac cgtgtttcga aaccgcttgc 

83 041 tcactcgatc gacttgaatg atgcctccag gttcaacatt cggccgagct tggtgcgaat 
83101 catacgcagg tcgattcgag gcaatgtcag aagcttcccg tgtcgttggc tcacgcccga 
83161 tggaatccaa ttcattcatg gctcggctcg ttgttggagg aagaagcaga ttcaaggttt 
83221 gattcgtctg agttggactt cgtggcggct gcactcaatc gcgccagctc cgactcaaaa 
83281 agtgaccgca actgatccgg cgacatgcca ccggcaatcg catcggccag agcgcggcga 
83341 accccgtccc cgaccaagtc atcacgggcg gatgtgcagg cttcgatcgc atcgcggcgg 
83401 accaccatcc cgcgaccacg cagtggcttg agaatttctt gggcctgtag ctcgcgatag 
83461 gcccgcgcga tcgtgttggg attcagcgcg atgtcgcgag aaagctgacg cacgctgggg 
83 521 agcatttggc ctcccaccag cgtgccatcg gcaacggcga gcttgacctg acgaacgatc 
83 581 tgctcgtaga tcggcacgtc gctggccgcg tcaatggaaa agaacatggg cgtctcctgg 
83 641 gaaggagttc gagacaccag acgatgcacc gaacctgtgt gctatcacga tagcacacct 
83701 cagtgccctg tcaacgtttt gcagcgaaaa atccgcacga ccgttccgaa cggaccttcg 

83 761 tcccacatga tttctcgatc ttttcctgcg aacgacaaag tgaacaaggt gacaccgatc 
83821 agcgagccaa cctttgcaaa cgccgccact caacacgtgg catcccgacg cgaattcgtc 
83881 cgcgtcgccc gcaatgatcc aaccgattct cgaagccaat tgccgatgtc gaccgccgac 
83941 tcagccaccc aatcgaccac accccgttca gcctttcgcg tcgattcgct cgcgatcggc 
84001 atgatcgtga tgttggcaat gacggtgctc gggcgtggca tcggattcat tcgcggcatg 

84 061 gcgttttgtc gcttgatgga cgataccgat gtcgggcgct ggtcgatggc gtttggattc 
84121 atcacgttga tcactccggt gatgctgctg gggatccccg gggtgctgcc tcgattcacg 
84181 gaacactttc gcctcaaacg ttcactcaca ccattcgtac gccggatcgc gatcgggaca 
84241 ctgtcctgca cagtgatctt cgtcacaacc atgttgtggc ttcccgattg gtttggatgg 
84301 atcgttttcc tgcaaccgca agacaaccga ctgatctatg gcgtcgcggg cgcagtcgtc 
84361 gggatgattg tttacaactt catcagtgac ctgaacggtt cacttcggca agtccgaatg 
84421 gtctcatgca tgcaattcat gcagggcgtt ggattcacgc tgctcagtgt tgcatggttg 
84481 ctgactggtg gaacattcac cggcgttgtc tggatgtttg ccgcatcatg cttggtcgca 
84541 tcgctaccag ggctgtggtc actcgttcgc tcttgggatt cagcgcagat tcttcaagac 
84601 tcggaagaag aatccaactc acccactgca gacaaaacac tgccgtccga tgacgtccca 
84661 tttggcctgt ggaacatgat tcgccgactg gcaccctacg cgaccgctct ttggctgatg 
84721 aatttgattg ggaacctttt tgagctttcc gatcgataca tgattttgca tttcattccc 
84 781 gccaccgaaa cactctccgc tgaaatcgct gggcaagccg cggttggcca atatcattcc 
84 841 ggccgcatca ttcccatgct gttgctgagc ctggggacaa tgatcggtgg ggtgatgctg 
84 901 ccctatttgt ctgcggactg ggaagcgaaa cgtttcgccg cggtgcaaac tcgattgcgt 
84 961 gatgcgttgc tggcggtgtc gatcgtcttc acatgcggaa gcgcgatggc gattctgctt 
85021 ggaccttgga tcttcaatgt gctattgcaa ggccgctaca ccgacgggat gacattgatg 
85081 ccgatggctc tttgcttttg tacttgggca gctctcgtga ccgtcggtca gaactatctg 
85141 tggacggttg aaaaaggaaa atgggttcca gtcgcgatgg cagccggatt gctcagcaac 
85201 cttgcactga acgcttggct gctaccgatg tatggcttgc aaggtgccgt gatcgcgacg 
85261 ctttgctctc acggcgttgt gatgagcgga gtttggttgg cgatgatcgg ctgtggatac 
85321 aaagtcgaca tgacgctggt cgtgctctca ctgttaccgg tcagcctgtt ggtctccccg 
85381 tatgtcgcgt tgctgctcgc gatcgcaatt ggcctgatgg tttggaatga tgatgcgact 
85441 cgtcaacgat ggttggaaag cctgccagaa aacgtccgag cgaagctgcc cctgttgggc 
85501 tgacgctcaa gttccgtacg atgggattcc aatcccgtcg aaatggaaac ccaaagcgtc 
85561 agcgaggctc tccaacgttc acgacggccc catccggggc gacctcctgt tcttcgccaa 
85621 ggactcgggg cttccacccc gagctatgga agacggcccc atccggggcg acgcagacga 
85681 atcacggcat tccacgcgaa ccttttgcgt cactattcgc acttgtatcc cgatcagcac 
85741 aattcacaat tggctcggta ttgccgatca tcgcacacga ggcgggcgtt cctccgcgaa 
85801 cgtgaacaaa ccctctttcg ccggccttga cgatcgcccc aaaaatggcg tttgatacgg 
85861 ggatgaacac acccaacgac tccaccgaaa ctcggcaact tggcccaaaa gacattgagg 
85921 ccaaaattcc ccaccgcgct cccatgctgt tgctggatga agtggtggag atcaccgaga 
85981 acacgctgca cgcccgcaaa acgttctccg cagacgagtt ctttgtgcag gggcactttc 
86041 ccgattaccc attggttccc ggcgtgattc agtgcgaatg ctgcctgcaa gccggagcga 
86101 tcctgctgtc cgaacacacg cccgctgtgg gtgagtttgt gcccgtggca actcgaatgg 
86161 acagcgtgaa gttcaaaaac atggtgcggc cgggcgacac agtcgacatt catgtgacgc 
86221 tcaaagaaca actttccaac gcttttttcc tctccggcaa aatgctgctc aacgggaagg 
86281 tcaccactcg gctcgaattc gcatgcagca tcacccagac tcaaccaacc ggttcctctg 
86341 actcggatgc aacatcgtga gcacggaacc aactccggcg atggatttct tagggatgtc 
86401 tggcaagaca ttcctggtca tgggtgtcgc caacaagaaa agcgttgcct tcgccatcgc 
86461 caagcaaatc gagcaggccg gcggagaggt catctatgcc gttcgcagtg aagctcgtcg 
86521 agatagcttg tccaaattgt tggccgaccg ccggttgatc gtttgcgacg ttgaacaaca 
86581 agaggacatc gaagcgatgg ccgctgaact tcaacgagac aacgtgacgc tcgccggtgt 
86641 tgtccactcg attgcctttg cggattacag cgatggcatc cgtccgttcc acgaaaccac 
86701 ccgccgtcag ttccttcaag cgatcgacat ctcggcgttc tcattggtcg cagtttgcaa 
86761 tgccatcaaa gatcagctcg ccaacgatgc cagcgtcgtc acgattggga tcagcaccac 
86821 gcggatggcc agcgaaagct acggcttcat ggcgccgatc aaagccgccc tggaatcgtc 
86881 tttggcattt ctcaccaagt cgttcagccg attcagccaa gtgcgtttca acgccgtcgc 
86941 ggctgggctg ctgaaaacga gtgcttcggc ggggatccct ggttacgtcg attcctacct 
87001 gtacgcagaa aaagtcatcc cacgcgggga agcagtttcc acgaacgagg tcgcgtcaac 
87061 ggcagccttt ttactgtcgt cacgcagcag tggaatcaca gcccaatcga tcgtggtcga 
87121 tgccggcatg tccatcaact attttgatgc cgacgtagtc ggagcggtca ccaacgcatc 
87181 ggtcgactga aaccgggttc ccaagctgag ttcgcccact caccgacaac acacatccat 



87301 cttctcacgc gaagacgcca agatcagtgt 
87361 cgtgttcgaa gggatgcgga tctactcggg 
87421 acggctctac gaaagcgctc gtgcgatcat 
87481 gacgactgca gtgaacgaaa ccgttgcaaa 
87541 ggtcgtcacc cgtggtggca accaattggg 
87601 agtgatcatc atcgcggata cgatctcgtt 
87661 ggaactgatc accgcgtcga cgatccgcaa 
87721 gtcgttgaac tacctcaaca acatcatggc 
87781 tgaagcggtg atgctgaaca ccaaaggcga 
87841 catcgtccgt ggcggccggt tgatcacccc 
87901 cacgcgtaac accgtgattg atctcgctcg 
87961 catgacccgt cacgacatct ttgtcgccga 
88021 agtcatcccc gcggtcaaac tcgatgggcg 
88081 gactcaaaag ctcaacgcgg ccttccgtgc 
88141 cgtagcggaa gtcgccaaga ctttcggctg 
88201 ccgctacatt ttctgtgcgt actttgggtc 
88261 gaacgagaga gttcgccaaa gaacctccca 
88321 ataaccaggg atcaggcatc cggtatagat 
88381 ccctttccgt tcggggttgc gagcgatgta 
88441 gcattctgat tcgcgcagca catgatcgta 
88501 cttcaagcaa gagttccaat cctttcgaac 
88561 ttcacttaat ccgcaccaca acagatgggc 
88621 cgccaactgg taacgaaaca ttgcatgggc 
88681 cgggtccaac caacctgttc gacgatcttc 
88741 cccgcgatag gactccggat caagccgttt 
88801 aaaatcctat cacccccagc cgtagggaac 
88861 tcgccaagac tttcggccgg cccccaacgc 
88921 cgagttccgc tacggcccaa tccctcacgc 
88981 gttgcggggg ttttgaccga gatccccttt 
89041 gcgagtgaaa acgtcaccgg aggacgaaac 
89101 gacaaaccgc agccacttcg tccggcgggt 
89161 caatgtgtcc attgggtcag cggtcatcga 
89221 ccatcgctcc aaatcggccc gtacgtcagc 
89281 ttgcgggtcc gtggcggaat catcccactt 
89341 atcgtcgacg agagtcgcca cacgtgcgac 
89401 cgctcctgtc attgctcttc tcgtttgttt 
89461 aattgtcacg tcgaccatgg atctatttgc 
89521 ccagcccttg gccgcgcgaa tgcgtcccaa 
89581 catcctaggc gaagggaaac tgcttcggcg 
89641 cctgctgcac ggcccaccgg gaaccggcaa 
89701 gcagaacagc gagctgatca ctctcaatgc 
89761 agtgctggcc aaagcacgcg accgcgtgtc 
89821 tgatgagatt catcgattca acaaatctca 
89881 gggcatcatc tctctgatcg gtgccacaac 
89941 gctcatcagt cgaagccaat tgttcggctt 
90001 cctgctgaaa cgggcgatca ccgaccgaga 
90061 cgacgaagac gcgatcgact atctctcatc 
90121 cacggcactg gaagtggcag tgcacagcca 
90181 cgatgttgcc gaatccatga ccagtcgcat 
90241 ctatgatttg gccagcgcac tgatcaaaag 
90301 gtactggctc gcccggatgc tggagggcgg 
90361 ggtgattttg gccagcgagg acattggcaa 
90421 cagcgccatg caagcttgcg agatgatcgg 
90481 aacggtggcc tacctcagcc ttgctcccaa 
90541 cgcccgtcgg gacgttcgag atcgccaggt 
90601 acactacacc ggcgccgaag aacttggtca 
90661 cgaggagggc gtggccaagc tggattactt 
90721 cgagcgagga ttcgaatcag aactcgcctc 
90781 cagagagacc gagtgaatca catccgtgac 
90841 gttctattcg atcgttgctt caacaacttg 
90901 gtgatccttc gcagtcggtc ctcgaaattc 
90961 tgacgatccc tcagctcgat ttcgccgaac 
91021. acgccagctt catggagctg gcccatgtcg 
91081 gatttgccgg tgagttcgat gaattgcatg 
91141 agtcgtttct cattgcccca cggcgctcgt 
91201 gttttcgaca accggcgtca aatctcggcc 
91261 ttgatcgatc accgccacaa acagcaacga 
91321 ggcaacagac gcttccgggt cggaaacact 
91381 agattcaaac gcgggaacgt ctgtcgactg 
91441 cttctccgga tactggcggg cgatcttttg 
91501 aaaagcggcc gaccagaacg gcatcagaac 
91561 ctcatcgccc tttgtgctgg gcaggtctgg 
91621 attgtcgcgg gcctgaagtg ccgtcctagc 
91681 aaacttgtca tccaaaactt ggtaggtcgc 
91741 cgcttccaaa ctcgcgcccg ctaagaactg 



ctacgatcac ggactgctgt acggcgacgg 
caaagtgttc gcactcgaag accacatgac 
gctggacatt cccatcgcaa tcgatgcttt 
aaatggtttg acagaaggct acattcgctt 
gctcaacccg ttctcgtgtg aagatcccca 
gtaccctgag aagttctaca ccgaaggttt 
ccatcccgcc gcgctcagcc cccgcgtgaa 
gaagatcgaa gccattcgcg ccgggtgcat 
agttgccgag tgcaccggcg acaacatttt 
accaatcgat gcaggaatcc ttgaagggat 
cgaaaatggg atcgaagtgg ccgaggaagc 
tgaatgcttc ctaaccggca gtgccgccga 
agtcattggc gatggcaaac ccggcccaat 
atttgtggct cgttagtagt tcgccaagac 
gagcccaatc gccgaaactc ttggcgagtt 
attgcgacgg aaacactcgg ttcgttttaa 
actatctggc tcaaacaatc ggaggttggg 
ataacatgca aatccatcga ccggcacaag 
ctccgcaaca tcttcgattg ccgaacactc 
aggctgccgc tgaaactgga aaccgattcg 
cgccttcatc gcgagacgtt ggtcagtact 
gtggtctggc atcaagcaga aaatggggca 
gaggacttca cggaacttgt acaagaaacg 
aatcgccatc gtccagtgga cccaagcaag 
cgatcgctgt ttctgtctcg cgtccatcca 
gtcgttcgcc cgcaggaaag gtagcggaag 
cgaaactctt ggcgagttcc gctacgacgg 
tgctgaaaac gcgtcgcgga gatgcttgag 
gccgacccga gactgacttt ccgcatcgac 
gtcctccgtc aaacgcacgt ggacgtcatc 
caagtgaatc gattgcgttt gatcggtcgc 
ctcaatttgc caaatggatt cggtctccac 
gatacgttga tgcaatcgca tcggtctcat 
ggcatgattg gtcgtccaat ctcgtcccca 
aatcgccaac gttaccaccc cagcgattgc 
tcgcgagcgg tcatcccgct gcttactttg 
cgatcaagaa gctgaccatt tgtttgctgc 
gaagctatcg gagttcgtcg gccaacaaca 
attgatcgcc agtggtcgcg ttggctcgat 
gaccacgctg gctcacctga tcgcgtcaga 
catcagcagc ggtgtgaaag acgttcgcga 
ggcgggcgac ccaaggccac tgcttttcat 
gcaagacgca ctgctcgccg atgtggaatc 
cagcaatcca tacttcgccg tcaatgccgc 
agaacctgtt tcggtcgaag acatgcgcag 
atgtggcctg ggaaatcaga atgtgaccat 
tgcttcggat ggagacgctc gcaaagctct 
tgagaatcca aaagcatcga tcacgcgaga 
cgctggttac gatgccaccg gcgatgatca 
cattcgaggc agcgatgtgg acgcgtcctt 
tgaagacatt cgcttccttt gcaggcgatt 
cgcggacccg caggcgttga tcatcgcggt 
acttcccgaa gctcaactca cgttaagcca 
gagcaacgcg acgacttccg cgatcagcgc 
cattcccgtt cccaagatgc ttcgttgcgg 
tggcgacggt tacaagtccg cgcacaacac 
gggcgtcgac cgccgctact acaaaccggt 
gcggctgcaa aaaattcgcg atcagttagg 
tctcggacgg ggaaagcagc gtaacgaatt 
tgagggattc actacagcca atccagctga 
ccgagcagac cgccgactaa aacccattcg 
ttgttgattc gcaaaataga gtcaccattg 
atttcgcctt cgttgataac gtccaacagc 
gtcgcaacct agcagacgga ttcaaagacg 
ccgtccaacc agcttaattc aacgcaaaag 
agaaaattcg atcgccgcca ctttgccctc 
aaccacgcca aaacggtatg caacggagtt 
ccgaaaactc ggaaacgaaa ttttgtttcg 
aacatccaag ttcattccca cgatcgcaac 
aagattgtcg acgaccaaca aagagtctgc 
gactttgcca cgatagtcat ccatcgacag 
atcaaaagtc ttgccgataa ttttttcacg 
ctcacttcca cggtcgtctt gcatggatgc 
ctcggacaaa tccaaacgac caaccgtctc 
tgtcgtcaag agatcgggtt gctcatctgc 



91861 gttttcgttg gcctcggtga tgaatgtttc 
91921 gctgatttga aaacttgggc cggcgatttg 
91981 atcggactcg ccggcgattg cctcacctga 
92041 aacccgcgag gcttcttcaa catgttcgaa 
92101 caccatcaag gttgctgcgt cgggagagga 
92161 aaccactcga ctgggcgcat ctgactttcc 
92221 caacacaagc tggctgtctg agcggacttc 
92281 caatagttgc aaactctcag ccgctttcag 
92341 ttgcagctca cctcggcggc cgatcgcctt 
924 01 caagcgacgc gaagcctcac gtttccacgt 
92461 ggcactatct tcgatgccgg actgaccgct 
92521 ggagagaaac gtccggagct gctcgggctt 
92581 atccaactga gaagcaacgc tctgaggcac 
92641 ctgcgcagca acctcgccat tcgtattatt 
92701 agcgctgcta tccccgacca gcaatagatc 

92 761 ctgaccatcg ttggtctcca aagatggagc 
92821 aggctggcga gtgcagcctc caacaatttg 
92881 gatggttttc ggcaacgtgc ccatcacgac 
92941 tccctgaaga acgaaagcca ctattctact 
93001 tcaattctgg tcggattgga tgaccgacat 
93061 cgtccgttga tacgactcgg tagacgctcg 
93121 gactgccgcg ttatgcggag gaccggttcg 
93181 aatcatcatc tcaggcagcg gtgtgctcag 
93241 gatcatcggc ttcgcgacga cgacgcagac 
93301 tcatgacacg gcgagtgatc tcagcgtcgg 
93361 agatcatcga gacgttttcg agcgatggca 
93421 aaacggggct ttcataaggc tcggtaccac 
93481 atccgaggca ttggcgtctg gattggtggt 

93 541 ccagaaaaaa cgttccaact gtttcaatcg 
93601 aatcgctggg aaaaagagcc aaatgagtcg 
93661 gataaatcag cgatcagagg cgttggtcga 
93721 ttgccatgtc cggtcgggaa tccgcatcag 
93 781 aaacagcgtt cgcagacgtt ccatcaacaa 
93 841 gtgtaagtca acgtactgaa tggcctgacc 
93 901 gccttgcgac tgattggccc aagtaaacga 

93 961 tgcctcgaat cggcctgcga cggtctccat 

94 021 gtaaacgtaa gtctgaaacg taaaagcagc 
94081 tgggaatggg tctaaaacgc gacacttcgt 
94141 ctgaacccct ctaacaagcg accaccaaaa 
94201 cttgaccgaa ccgttttttg tcctagagtt 
94261 ttccacaacc tgtccactat tcgtcgttga 
94321 cgcaaaaagg aatgagccat gttggtactc 
943 81 aacaacgata tcaaaatcgt tgtggtggaa 
94441 gaagctcctc gcgaagtgcc agttcatcgt 
94501 aacgaagcgt tcgatgcgaa cgcaccagcg 
94 561 ctgctcactg gacgttcgtg aaggcacatt 
94621 aagcaaatcg gacgctcgta tcccttgacg 
94681 tcgttggcac gccagctttt gttggcaaac 
94741 catcgccctt tcacggcggt aacacgggtt 
94801 tctgcttgta aaagctggcg gcttattttt 
94861 tggggctctc ggccacctgc agctagttca 
94921 gagcttcgct cgttcccaaa tcgaaggacg 
94981 tggccatgtc gctttctcgg gctactgaaa 
95041 gatctcgctg cgcattttgc cggtcaaatg 
95101 gaggatgtgt ttcagcttct ccagcacacg 
95161 agagtcggag tagatcaaat agctacacgg 
95221 gtcacgcagc gaccgctgct ccggatctcg 
95281 tgtggaagtc cccgccaccg aatcggtcaa 
95341 cagcactcga tcagcggacg actcaatccg 
95401 ttcgcgttcg agcaactcat tcatctgata 
95461 gttcgccgcc gtgatcgcat tgtgcatttg 
95521 aatgtcgcta tgcggtgtca gatgagaatc 
95581 attcgccccg ctttcacggt ccagctcctg 
95641 atgacgcatc gaaccgtggg atccggtgac 
95701 gctggtgtga tccgtcgtga acgtcccgct 
95761 gaagacgctg cgaattaaga acccagggac 
95821 caagcaccca ccacgatccc gcacaaactc 
95881 gtaaaagatt gccccctgct tggcatcggt 
95941 gcaataacca acgtaaacgt cgtcgttgaa 
96001 tcgatgcaac tgcaaactgg tctttgaaaa 
96061 cagagcttcg agaactgacg gcaaataccc 
96121 gcccgcttcg atgcgttttg ctagcttggc 
96181 atcgatcggc ggacgttcaa aatcgggctg 
96241 caagcaaaac agcccaccgc tgatcacccg 
96301 gaaggatgga gcggcaccat ttgaatgcag 



catcaaattc tgaacgcgag ctgtcgcttc 
gcgggccatg tccaccaatc cccccatcgc 
ctcgacaaat tcttccagga tcatcgcgcg 
ttgcagtaac gcgtctttcg cttgccccat 
acttgcttgc agcaatcgat cgatttgaga 
gtttcgcagg tcttcgatcg cgaaaccgat 
cggatcggaa tcttcttgaa gcgactccgc 
atcccccaaa gacgccaagt gtgaaagtga 
ctccttctcg gttgcatccg cgtgttcgat 
ggcgatgcgt tttagctcgc tgaaagcggt 
tgtgatcatc cgcatctcga tatctgcttc 
caaatcattt gcgagctgct tcctgcgacc 
gtcaatcacg gctgggtcat cgacgactgg 
actggtggcc ggtgactcgg gcgttgaagc 
gtcgtcggac gtctggtccg tctcctcgga 
ccccgacgca ggatccgtcg actcagcaga 
ggtcaacaag atcgcaaaga cgcatgctcc 
ggcaatttga gcttgaatca tccaatgaca 
ggccaacggg ccagttcgaa taccgttcgt 
tctgagatct gacaaaggca tttccggtca 
aaataaacgc caccaattga atcaggaact 
gtgcagacgc caacttggcg aacctgcaca 
ccgacagagc atcgagcaag tcggcgaagc 
gttctgcgac gctccagctg ctgcggatct 
atggtgctgt ctcattctgc gatgcagttg 
tggtttcggt caacattgcg attcttcctg 
gttgacgccg gtaccgattt ggatgcctcc 
aatgcaacca gggtgccaaa actccgaaaa 
caccggagca aaccatgcgt ttcgcagcga 
ccgcttgaat ccattcgttt tttggcggct 
acagacgtct ttttcaaaac cttgtaaatc 
cgagaccgat gctggttcgc caaacgctga 
ttggcgcccc tccgcggatg aaccacggcc 
gttgtcgtag acggtgccat cgatttgcca 
accatcccat tcaaagaaca gtccttccag 
tgagacgtca aatgcccccc gatcaggtcc 
agtcatcgga tagaaaatcg ccaattgagt 
atttgctagc aaaacagggc gtttcgcaac 
gatttttcac tgtttgtcaa cccggatttc 
gttggaccaa ggatctctgt ccgtactccc 
ctaggcagtt gttactagga ggcaggcggc 
tcgcgaaaga aaaacgaaag cattgtcatc 
atccgtggag acaaggtccg cctcggcgtg 
cgtgaggtct acgacgcgat ccagcggaat 
gattcgaacg acgtttcgta ggttaatttc 
cgccctcctg acaacttttt tgttgtcgca 
ccttcatgcg atcttggttt gatactcgcc 
cgaccggccc cttcgtctag cggtctagga 
cgagtcccgt aggggtcact gaataagccg 
tgcgccgata aaaaactagc cagacatctc 
gctcagcacg gacacgctcg cccgctcgat 
tgcttcgagc caacaagacg cgacgtggca 
gtcagggtgg gtcgctcgaa ggatttccag 
ctgataagcc tctgactgat cttctccgct 
agatttcact tcggcgggca actccataaa 
atatcggaac agccgtgttt ctagatcaaa 
gggtcccatc gaggcgaact ctttggcaaa 
cgtgaactcg tcgcacatca acagatactt 
tcgcgtcgcg ctctcgctga tgaaatcagg 
ggactgatga agtgcctgcc gcgtttcgaa 
cgtttgatgc tccaacacca tcaacgccac 
cgtccgaaag tactccccaa gatcgtcctg 
agcttcatcg gtgcagatga cgttgcccat 
ataccaacca ccccatcgtt cgtcgaacgg 
accaagcttg ggacgccccg ccgcatcagc 
attctgagtt cgcgacgacg cgtgacatga 
gggttctttt tcttctgatt gggacaacgt 
tgaagcgaat tccaaaacat cgccatgctg 
atacaacgct cgcggccgcc gaggcgaaat 
caccagcgtc tgcgaactta gcggcacatc 
aaactttgga tcgctgttca gtgccacttc 
gacaggatca ttgacctccg cattgaggta 
ggccgatgcc gaggacgcca tccaaacgac 
cgatttcacc gcaaaaacac accgttcatg 
cgagaccata gaagattgct cgtacgtggg 



96421 atgtagattt caaattcaca acaaacaact atgctttcca aaaccgccga atacgctctt 
96481 cgcgccatcg cctgcatggg cagccaggtc ggaaaaccag tcccggcgga tcatttggct 
96541 gagcaaacca aggtgcctcg gcggtatctc actcgtgtgc tgcaagacct cggtgccgca 
96601 gggctggtcc gctcacgacc gggtcccggc ggcggttacg aactgtgcgt tccgacggat 
96661 acattgacga tcctggacgt cgtcaacacg gttgctccag tcgaacgaat tcgaaaatgc 
96 721 cccctcggcc tgaaatctca caccgatctt tgtccgctgc acactgaact cgatcgagtc 
96781 tacgccgcga ccgaagaagc ttttcgtggc gtcacgatcg atgacttggt caactcaacc 
96841 aatcccatcc tgccgctttg cgaaagttga gattgggcga ctccaagagc ggcattggct 
96 901 tgcgtccagc agccactctt cgcaagcaga tctagctcgc atgaaccggc gagccaactc 
96961 gcgaaggctt taagaaagcg acgtggccga gcgatttcaa atccgaagcg gacttccgga 
97021 aagcggcctg cgtgttgaac aaccagagaa tccgtgtctg ccgattggct ggtcgcttgg 
97081 ggtgaggcga atcgccatcg gtaaacacga cgactccgtc atagccgcga tgctcatcga 
97141 tgaactcggt cacacaaccc aagtcagtgc cgcctcgacc ggtgatgctg acgtcgcgcc 
97201 gcgccgatcg aaacgtcagt ggttcgcacc ggatttgcgt atcaaaccaa acgacatcga 
97261 tcgagcgaat gccatactga aaaaaccgat tgataatcga gaatccgttt tgcaaatcac 
97321 ggtggctcat cgagcccgag acatccaccg cgaaaagcaa tcgcgtggtg aagtcgtacc 
973 81 ggctacccat ctgagcgaac ccgtaacgcc gactcggccg catgcgagtc agccgacgat 
97441 caactgacag aacactctga cgaaattgcc gcaagatcga ccggtagtcc agtggtggcc 
97501 gaagcgttgc caacaaacgt tcttgagcgt gaccgggcaa ggttccccaa ccaccattct 
97561 ccgccgcgtc cgcgacggca gatttgatct cctcatgcag cagatcgtcg gcatcccact 
97621 ggtccgcgtt ttcagcaccc gtttggctcg gatcgacata cgattccaaa tcattgccca 
97681 ccgacgagtc cgctgcctgt tccagcgagg gtgaatccgg cgtgggctca ttcgactccg 
97741 ctgaagagcc gtcatcttct tccacggacg actcatcacc ggtttcgtcc ctttgctcgg 
97801 ccaccgcgtc gttgccagag tcgccagcgt cacttgagtc gtcgcccagc gtttcttctg 
97861 gatgactcgt gccctcgcct ggtttcgaat catccgactg atccggttca ggcgttgagt 
97921 cttccgaggg atcgggttga tcaccttccc gttcggcgag ctgccgatag taatattcga 
97981 agtattgatg atcgtgcgtt tcgcttccca agacttcccg cggtctgggc atgggcaact 
98041 tacttcgcaa gcattcttgc accgtcaaat tactggccga gtaagacaac tcggtcttcg 
98101 gttggcgtct ggcataggga tgccccaaca gaattcgcat cgcttcaaac cgcagtacct 
98161 cccccagttc atctcgcttg agcgacgaaa caaacgcggg attgaattca acgcggccat 
98221 gagccactcg- caacgttgcg atttccggct tggccacgac ttcgtgcagc gtccacgcgg 
98281 caaacaacaa cggatcaacg agataccaac tctcaacgac acgagtgatt cgctctcgcg 
98341 ccttcgcgtc tcgtttcgaa tcggttttcc cagaagccgc tgagtcgctc actgaattcg 
98401 gatcccgtcg acgtattcgg tcagcaaggt tgtcagctcc attgactccg caaagaactc 
98461 catcgccggt tcgtacttcg gctgatccgt catggaaacc aagtgagcga ccgcttcatc 
98521 cagctttcgt ttgcgaagca gcttcaaata gctcagcatc cctttgagag ctttcgttcg 
98581 atcggcatcg tcgtatgggt caccgaccaa gtgaacgagc aactgttcgt tcaacatcag 
98641 cagttcgccc aacgccatct tctcaatcgc tttgcgatgc tttgtgaatt gcagcaacac 
98701 ttgatcgggc gacacctgca atcgcgatgc aaggcttctg cgaaacgcca tcgccgcggg 
98761 cgaacccacc acgccggcga tcaacttgac gtgcaccggt tggatcgttt cgacgccttg 
98821 aagcatactc gacacgcgag cccatccgcg gcggtcgggt gttttgacca accctgacat 
98 8 81 tgaattcgcg tcttccgccg agggcgagat tccttcgccg tcgaggtact gaggctgctg 
98941 ctgcacaaat ccggtgactc gatcgtccac cttgtgctga tcagcccaaa gcaaccaatc 
99001 gtcgacggtg ggagcgaact gatacaggtt gaatcgtgaa acgagagcgg ggtcgagatc 
99061 ggtcagctga tattcatcgc cttcgttgac cgccgagacg atcacactgc cttggggcaa 
99121 tcgctttccc gcaagtgtct tgttcaatgc gagctccatc accgattgca gtatttctgg 
99181 acgagcccga ttgagctcgt ccaagaaaag cacgatcggc tcctcgtccg atggccacca 
99241 aaatggcgga agaaactcgc ttcgaccagt cttctcgtcc ttgtgcagca atccgatcag 
993 01 atcgcccgga tcactcatct gccccaagaa aaatggcacc acacgcattc cgcgagagga 
99361 aaagtgttcg gcgatgatct gcgatttccc gattccgtgc cttcccacca acatgatgtt 
99421 ttgatcgggc ggagtgattt cgagtaactg aatcaactcg tgaacatcaa ctcgaacagc 
99481 caaaggaatg tctcaatcaa cgtggaaatc aaaatcaaaa atcactcttc gtaccgaatc 
99541 cattcggttc gccaatcagc ctgcatgaag ccgatcatct tgaaccgcaa cccgtcttca 
99601 tacagcgacc gcagcgttcg aatcgtctct tgcaattgcg gcaagacctt ttcgaaccgc 
99661 gtgaacggga tgtggatctc gatcatgtgc tgactcgaca acttcacaaa cagacgcagc 
99721 ttctgcgtgt ccatgctggt cgcgaagtca aattgttctt ctttggccag tgccttgacg 
99781 cgagcgacga tcgctttgga cttcatcgag cggacttttg cacgctttgc ttcacccgcg 
99841 tgccgcttga cgtaccgacg gagaacttcc gccaccaagc tctccacctc cccgaagtcg 
99901 ctgatcgggc ccgcgatttc gcgatcaaac tttccaaaca cgcattcgag ttcgccttgc 
99961 tcgttgcgtt ccagacgcaa ttcatcggac cgataatacg tctgcgatgc gtttgggtct 
100021 ggaatcgtga accgcaccca cgttggcgat cgcgagtcac gatcgccggg gtggttggtc 
100081 aatcgcacgc cctcgatctt ttccaacttg gatcgccact gtttgatcag cgccgcgcgt 
100141 tgccctggat caatttcaat cggtgactca cccgaaaagt gctctcgaaa gagctttcgc 
100201 acggagggca gcggcaactt cacgtcagcg tcaactcaac ggggcagtgg tccgaaccgt 
100261 gaatttcgca gcggatccgt gcatcggcaa cacgatccca aaaatttttc gcaacccaaa 
100321 agtaatccaa acgccaccca atgtttcggg cgcgagcatc acttcgatac gtccaccatg 
100381 aatagtgacc cgggccatcg tggaattgtc gaaacgagtc aatgaaccct gcctcggtga 
100441 ccgcatccag tcccgcacgt tcctgatcgc tgaagcccgc gttcttgcga ttggcctttg 
100501 ggtttgccaa atcgatttcc ttgtgggcac agttcacgtc gccgcagaac aaaaccggtt 
100561 tgcgtcgatt cagctttttg acgtagtcca aaaatgcttc gtcccattgc attcgataat 
100621 ccaaccgagc caagccccgc tgtgagtttg gcgtgtagac gttgaccaga tgaaagtcat 
100681 cgaaggttgt cgtcagcact cgtccttcgt tgtcgtgctc ttcgatgctc aaacccttgg 
100741 tgactttcaa aggctcaaca cgactccatg ttgaaacacc cgaataacca cgtttggtcg 
100801 ccgtgttcca aacctggtgg tatccaagct catccgccca agacaaatcg acttgctcgg 
100861 gttccgcttt tgtctcttgc aaacacagca catcgggctg ctcactctcg acaaattcgc 



100981 tttcaatcaa ctttggtttt ggaacacaac ttcttcgtac ggttgaacaa ccacaaagcc 
101041 atgtccttgg aaaagaaact gcaacgattc ccccgaaccg cgaccaatta tcgttttgaa 
101101 ctgcacatcg gtcttgagtt gaggttccag ttgcccggac cacagcacgg ttgcatttgg 
101161 gtccgtcacg acaggttgat gaggtgtgac cggcaacgtg atcggatcgt agtgactggt 
101221 gatcgcaacc atgcccgtgc cttccaggcg gatgttgaac aacccaccgg ccaacatcgc 
101281 ggtcatcttc ttcatcattt tgatgttgta gttcaatgac atttcgaacg cgaggaggtc 
101341 gtttccgttg acgcaaatcg cttcgttttg caactcgagg atggtgattt tctttccgct 
1014 01 gtcggcgcag aacacggaac ccttgccact aactttcgtc agcgccgtgc cttcgccgga 
101461 aaccgccttc ttcagcaaat tccccaatcc ttgcgacaga atcccttcgc gttcgaactt 
101521 cacattgccc gtgtaagcga tcattgaacc catttttgtc cacacctcgc cgttgaggtt 
101581 gatgtccaac attcggtccg attccagttc aaacaggccc tgattgaggt ccttgtcccg 
101641 cgttttctcc aagaagcggt cgagggaata gcgtctttca ctcatcgtga gatccttcag 
101701 gtcatggtga taaaaagaga acaacgtcgg actgcacgcc cacaacgcca ttgaacgcgt 
101761 cgggcgggtg ccacaccaac cgcagtgtag cgattccatc ggaaccggtc gtgcctcggc 
101821 gggtttcatc cgtgcgagcg ggttgctgag agcgtagaat catcctaccg atgatttcga 
101881 aatgcgatct ctcccgcctg cgttcccaac cacgcaccaa agcatgagac ctgtcctagt 
101941 gaatgcatct cgcttccttg ttgacaacca atacgacgat cagcttctcg aagtgatcga 
102001 cctgggccat cgttctcgta cgctgcatgg tcaattcggc caaaccatga ctacgacggc 
102061 taattgtctg ctcgtcgttt gcgttttgct tgtctcgttg ctcacgcccg cccccgtttt 
102121 cgcggccagc tacgacgacg cggtgcagca attttgcaac ggccagtacg acgccgcggc 
102181 aacgaccgcc gcttcggaag tcgagcgagg tgtttggagc gaacgttggc cgcggttgct 
102241 gatacagtgc caactgactc agggtgacta cgctgatgct ctgcaaactt atcgcgaagc 
1023 01 cctgcagcga tatccaacca gcatcgcact gcgttacatg ggtctggatg tattgcgatt 
1023 61 caacggcttg catgatgaag tcggcacggc agaagcagac ttgttcgcac aaatgcagcg 
102421 tgccttcgcg ggatacatca ctcgcgacaa cctgatcgcc gcaggccgct ttctgactgg 
102481 tcgtggcgaa gatgctcgca aggtattgga aatgttctat gaccgggtgc gtgatcgcga 
102541 cccggattac ctcgatgcct atttggccac agccgaactg gcgattcgaa aaggtgattt 
102601 ccaagtcgcc gcgaacacgc ttcaacaagc cttgaaactg gaggaagaaa ctcccgacct 
102661 gcatcacttg ttggccaagg cgatggaatc cagcgatggg caggctgccg ccgagcaaat 
102721 cgcaatcgca cttcgtatca acccacaaca tctgccttcg ctgcaatggt tagcggaacg 
102781 tgcaattgat cgcgagcgat atgatgaagc caaagacgtc gcgaaccaaa tgcttcagat 
102841 caatccgcat gaaccgtcgg catgggcttt gatggcggtc atctcgcact tgagaggcga 
102901 atacgatgtt gaaaaactaa tgcgtgcggc ggccttgtcg acatgggaac aaaacccaaa 
102961 cgtcgatcat ctgattggtc gcaaactttc tgagaagtat cgcttcgccg aaggtgctgc 
103 021 gtatcaaaac ctggcactcg atgctgatcc tctgcacttg gccgccagct tccaactcgc 
103081 gcaagacatg ctgcgattgg gcgaggaaga gatcggttgg gaaatcgcgg gggaggtggc 
103141 caagtcagat ccatacaatg tggttgccta caacctgatg acgcttcgtg atcgaaccgc 
1032 01 gaagttcacc gttttgaaac aagacgggat tcaagtccgc atggacgcaa ccgaagccaa 
103261 actctatggc gatgcggtga tggagttgct tttggatgcc aagcaagtct tgtgcgagaa 
103321 gtacgacgag atgccggaca aaccgatctt ggtcgagatc ttcccacacc aaaacgactt 
103381 tgccattcgc acgtttggtc tgccgggtgg ggcagggtat ttgggcgttt gcttcggccg 
103441 agtcatcacg gcgaacagcc ccgcgtctca aggcgaacgc ccatcaaact ggaaaagcgt 
103501 tttatggcat gagttctgtc atgtggtcac gctggagaaa accaacaacc gaatgccgcg 
103561 ctggctcagc gaaggcatct cggtttacga agaacgccaa cgcaatcctt cgtggggcga 

103 621 gaagatgtcg cctcaatacc gatcgatgct gctctcggac gatctgactc cggtcagcga 
103681 cctgagtgcg gcgttccttt caccaccatc cgcgatcgcg ttgcaatttg cttactacga 
103741 atcgtcattg gtaatcgagt tcctgatcga acaacatggt catgatgctt tgctggcagt 
103801 gctcgatgac ttgtccgcgg gcattccgat caacgatgcc ttaacccggc ataccggatc 
103861 actgcgacga ttggacgctc agtttgatga atacgcaaaa aaacgagcca accagtttgg 
103921 ggctctggtc gattggtcgc gtgacacgct gcccgaaagc ggtgacctgg cgacctggaa 
103981 aggttggacc cgagcgaacc cgaccaacta ctggggtttg cgagagttag ccaaaagtgc 

104 041 catcgaaggt gaacagtggg agcaggcgct tatcccgctt tctcgaatgc agcaattggg 
104101 cgtgctcacc ggcgaacgcg gcggtccgct ggaatggctg gctcgagccc atcgcgaatt 
104161 gggacaagac cggcaagaaa tccgagcaat tcaagacaat ttgtcgcaat ctagcgatgc 
104221 tttgccggct ttgcggcgtt ggatcaatat tggccaatct gaagagcagt gggaaaacgt 
104281 tttggacgcc tcgcaacagg cgctcgcgat ccagcccttg ctgcccgaat ttcatctcgc 
104341 atccgccgtc gccgccgaaa agctcgatcg ccacgaattg gcggtggaag ctttgtcggc 
104401 gctgctcgca ttggatccgg tggaccctgc ggcgttgcat ttccgtttgg ccaatgccta 
104461 cgacgaacaa aacgagtcct tccccgcgaa acaccatgca ctgatggcat tggaattggc 
104521 ccctcgctac cgcgacgctc accgattgct ctggaaactg caccacgcgg tctcggaaga 
104581 gaatccggcg gacgcggaac agaatgatgc cgagaatgtg tcagccgtcg aagaggagga 
104641 agctctatga ctcctaaaaa cctcaaccta atgggcatct tgcgagcact tgctcggggc 
104701 agcgtccatt cccattcacc tctcccccga ccatcgtcgg gggagaggtc ggacgcggcg 
104 761 ttcagccgtc gtccgggtga gggggcggat cacatggcgg tcgaaatcgc tatgccacag 
104821 aacgccccat gtcacgcgga cgccccctca cccgagcaac gcctgaaggg cgtgctcgac 
104881 ctctccccca atgaaattgg gggagaggtg acaactgcgt tcacggtcgg tgacaagacg 
104941 caaactcgcc agcaatccac ccctctaacc ttctctcaca aaccattcgg ccaccgtcgt 
105001 tggattttgg ccgcgttgat tgtcttgatg ggcggggccg cgtttgctca acgaggttac 
105061 tggggacgtc gcaaccgatc gcttcccacc gaccgcaacg gtgtcccaac atgggaagtc 
105121 gatccacgat tcccacaaga cagcttcacg ttcgctcgaa tcgaatacga ttcctacggc 
105181 ggtcgaggac gaggtggcgg aggctgctgg accgactatc ccgacagcga tctcaacttc 
105241 tcacttcgtt tgcagcaact gacgtcactg aaggtcaatc ctgatccggt ggtcattcgc 
1053 01 ctgaccgatg atgaattgtt cgactaccca ttcatctaca tcatcgaacc cgggggcctc 
1053 61 gtcttttcag aagaagaggt gaaagcactc cgcaagtatt gcctcaacgg cggcttccta 
105421 atggtggatg atttctgggg cgactcccag tacgaaaaca tgcggagcga actcaaacgc 



105541 tacgacctca aagaaaaacc tcaggtcccc 
105601 ggcagtgtcg gttcgtggga atggtcgcgt 
105661 cgagccatca ccgatgacga agaccgcatc 
105721 ggcgatggct gggaacgaga aggcgaagac 
105781 aaagcttatc cgatgggaat caacatcgtg 
105841 tgtatcaacc atcgacggtt ccgacgctcc 
105901 taccatgacc cacgaagacg aagcccaagt 
105961 aatcgttcaa gaattgtcta aggtcatcat 
106021 gatctgcttg tttgccggcg gtcattgttt 
106081 attgctggtc agcagcgtcg cgaaaatctt 
106141 ccccgatttg atgccggcgg acatcaccgg 
106201 tcaccgacaa ctgcaattcg tcaaaggacc 
106261 aatcaaccgg acaccgccca agacgcaagc 
106321 ggtcaccgcg ggaggccaac gatttgaact 
1063 81 aaatccgatc gaaatggaag gcacttaccc 
106441 tttcaatgtg ctgatcgact acttgccgcc 
106501 gacatcgacg aagcccgaac cgattcagcc 
106561 tcacgccgcg gttcgacgcg ttccgattgc 
106621 ggtcgccgcg actcggccgg gacgagatgg 
106681 ctggggtgca gggcttcggg ccgctcaaac 
106741 gctcaacggc cacgctcatg tccgaacgga 
106801 ccgacaccga gtcttgctga gctatcgagc 
106861 ggtcacacga cttcttcaag aaatccccac 
106921 aatctcgctc gtcggtcaaa gccaccaacg 
106981 cgaagtcgtc gcgggctgcc gcttcgccgg 
107041 tggacccgca tgctctgatg cggatcaaga 
107101 aaggtttctt tggcggactg caccgaagcc 
107161 aataccgagc ctactcgccc ggcgacgatc 
107221 gcacagatcg ctattacatc aagaagttcg 
107281 tagtcgacca aagtcagtcg atgggttacg 
107341 cgagaacgct ggccgcgacg ctcgcctact 
1074 01 tgatgacgtt cgacgataca atcgccgacg 
1074 61 tgcgtcaaat ccttgcctgc ttggctcgtc 
107521 gaccgatctc tcaaattgcc gcgatcaccc 
107581 acatgctctc gccgaccgaa gacctgcaac 
107641 atgaagtgat cgtgttgaga atcctggatc 
107701 ccaaggtctt ggtggacatg gaaacgggac 
107761 gcaaagccta tcaggccgcc ttccaagaac 
107821 ccgtaggagc cgcgatttac accagcacga 
107881 atttcgtctc cgctcatcaa cgtcgtgcgg 
107941 caggaggaac cttcggatga gcatgctcgc 
108001 cgcggcaccc atcttgtttc acctgattcg 
108061 ctcggtgatg ttcctgcgag aggtcccgcc 
108121 gtggccattg ctgctccttc gagcactcgc 
108181 accgtttctg cgtggtgctg acgaccagtt 
108241 gttggtcgac cgaagtgcca gcatgcaacg 
1083 01 agctgaaacg gttctgactg atctatcagc 
108361 caccaaaatt gaacgcatct gggaaacatc 
108421 cgcactcgct ctgcaagctc tcacggaagt 
108481 aaccgccctt cgcgaagccg ccgattgggc 
108541 agaagacgga ttagccgacg aagaagcgat 
108601 catcgttttg atttccgatc tgcaagacgg 
108661 ttggcccgaa caagttacct tggacatccg 
108721 gaccgcgatc atgatgaatg atcaaacctt 
108781 tgccaatcgc aagcttccgg tccgggtcgt 
108841 gacgttgcaa tggaacgacg ccagtgacaa 
108901 ccagatcatc cgtctcgacc agccggcgga 
108961 caactcgccc tttgacaata ccgttcactg 
109021 gctgcatgtc ggcgacgttc aagacgatcc 
109081 ggttccgctg gacaacgcaa cccgaacggt 
109141 gccgaccgac ttggatccac gtgaatgccc 
109201 cgacgccttc atacagtcac tggtaacgta 
109261 gttggatgaa tccgatgacg cgaaggattt 
109321 cgattcactc tccgcggaag aagccaccgt 
109381 tttttcgtcg ccgttgtttg catcgatggc 
109441 ccaattttgg aatcatcgaa acgtttcagg 
109501 aaattttgat tctggtgcac cggcactggc 
109561 gggttggcaa ccgtcttcga gccaactggc 
109621 tcaaatgttc accggcaacg aagggacgac 
109681 gccgcttccg tggcccgcaa ccgagggcgc 
109741 gccttacgag caagcctctg atgctgactt 
1098 01 gctaccggac gaagccccca tcacattcgc 
109861 cgacgcgatg gacgaagaca cgctcgaacg 
109921 ggcggaaaaa ctgactcaag cgactctgca 
109981 acgtttgtgg caatggttgt tactcgctgc 



gcgatcaact ccgccagacg ccgcgccgac 
gacggaagcg acaccagtgt tcctcactac 
atggttttca tttgccacaa caccgacctg 
caatggtact tcgatgagtt ctccgtcaag 
acttatgcga tgacccactg aatcattcct 
tccaacccga ttgcacttca tgacaacgga 
ggtggagcaa attcgcgacg gacgagaacg 
cggccaagaa gaagtcatcg agcagttgct 
gatcaccggt gcaccaggac tggcaaaaac 
tcatttgaac ttccagcgga ttcagttcac 
aacagaaatc ctggagcaat ccgccgacgg 
aatcttcgcc aacgtgatct tggccgatga 
cgcgttgctc gaagcgatgc aagaacacca 
ggaagaaccg ttctttgttt tggccacgca 
gctgcccgaa gcccaactcg atcgtttcct 
caaagacgaa ctggcggtgg tcctgcaaac 
aatcttcacc ggtgaagacg tcgctcgctt 
ggaatcgatc gccgcatacg ccgtgcggtt 
aacgccggac ttcgtcaacc aatacgcttc 
attggtgctc ggtgccaagg cgagagcttt 
agacatccag gctctcgcgc acccgacgct 
cgaggccgaa ggtttcaacg ttgagaacct 
ggaagtctga attgagctgg ttccaacgtc 
gcagcacggg tacactccag ggcaactcct 
cgggtggcaa cggtaaaacg gccgcctcga 
atctgcaact gcgatccaaa cttgtcgtgg 
ctcttcacgg ggcctccgtc gagttcagcg 
cacggggact ggattggaaa ttgttcgctc 
aggacgaaac caaccgtcgc tgctacttgt 
ggtcactcga atacaccaag atggaatacg 
tcctcactct gcaacgtgac catgttggac 
tcgtcccggc gcgcagccga gtcggccacc 
caacgtcggg atctggaacg gatgtcaacg 
gtcgccgagg tttggtggtg ctgatcagcg 
ggtctctcgc actgctgcgt tcgcgacaac 
ctaacgaagt gaatctttcg gtggacgaat 
gcaaaatcca agtcgatccc gatgcgactc 
acgccgagac gattgaatcg atttgcaacg 
ccgatcgccc tctgcaggaa gcattgtctg 
tgggtgccag tcgatcgggc atgctgagtt 
cccgtttttc ctagccggtg cactggccgt 
gcagcacccc aagggtgccg tgccgttcag 
acgactgacc cgccgcagcc gtttggatca 
gttgttgttg ttggccgccg cctttgctag 
caacaacaca cccgttcggc ggatcgcggt 
ggaagacctt tgggaacaag caatccaaaa 
gcaagaccaa ttcgcggtgt ttgtcttcga 
gctcgcggca acatccgatg aacgtcagaa 
tcaacccact tggcgagcca ccgatctggg 
acagtggccg accgagtcac cgacagcggg 
tacccgaggc cagctaccgg gtcctgccag 
ttccaaactc gatgcactgc agcaaaacac 
gcgagtggtg ccgaaggaag aaggcaatgc 
ggagaacttg caagaatcat ccgcgtccaa 
caattccaaa ctcgcttccc agttcgatct 
acaaacgatc caggtcgaag ctgatgcaac 
ttccgtcacc gcaatggtcc tgtccggtga 
ggttgctccc gagaaacgct cgtatcgact 
tcgcgaaagt ctgttttatt acctggaacg 
ctcgattcaa accaccaccg tcgcgaagtt 
gctcatggtt gtcagcgatg tcgcaaacac 
caccgaagca ggcggccgag tgttgtttgt 
gatcacttgc atcgcatcag tcaccgacaa 
cgatgactac gccatgtggt cgcgagttga 
ggacccgcaa ttcaatgatt tcagcaaaat 
acttgaagaa gccggttgga acatactggc 
tcaaacgggc aacgtttggt tgctgacaac 
tctgtccaca aaattcatcc cgttgatcgc 
gagatatgtg gctgggcatg tggtgggaac 
agaactggtt cgctcggacg gtgaatcatc 
catcgatcgt cccggtgttc atcaattgaa 
gagcaatctg tccgagtccg agagtcaaac 
tctgggcatt ggattgggcg aagccgccac 
acttcgagat cgcgaattgg aaagcaacca 
gatcggaatg ttggtgatcg aaaccctctg 



110101 gacgactgcg tttttggcaa tcgcttgggt ggctggcttg gaccatcacc ttgatcgttc 
110161 ttctgtggct ggtgtggacg cagcgaccgc ttgcattcct gtcgtcgatg acactttcaa 
110221 cctcttcgct ttccacgtcc gtcgcgatcg tatcgacgtt ggtgattgcg atggtggctt 
110281 gcttgatcgc cgcccgcttg cgaatccgaa acattcagtc cgtggctcga aagatcgaac 
110341 atcgataccc ggacctcgac caacgtctga tcaccgcggt cgacgattcc gttgccaaac 
1104 01 aaccagacaa ttatctgaac caacgtgtga tgcgagaagc ccgtgagcac gctcgtcgca 
110461 acgattggcc ggacgtcgtg tcgtctcgtt ggttgctggt caaccgattg tttggattgt 
110521 cgggaacgct agcgacgctt gccttggccg gcttctttct gctggccaaa ccggactctt 
110581 cggccaaagc ttcctcagtt gtgttgccca aacccgacgt ggtgatacag cccggcagca 
110641 tggaaatcga acgcggcacg aacttgttgg tcacggcgga attccaagcc gattggattg 
110701 gtcgccaatc cgcggtggaa ccagaactcg tcctcacagc gatcgacaat caaccagtcg 
110761 caaacgtcgc aaacgtcgca aacactgaaa cagttccgga agaccaagct gaaccacgtg 
110821 ttgcacggct gcccatgcgg cgaagcatgg ctgatccgat ttggggtggt ttggttcccg 
1108 81 aagtcgacga gtccttcact taccgagtcg tattgcctca ctggtcaagc gaggaatacc 
110941 gtgttgaagt tttcgaatac ccggccgtca cccgaagcga cgctgaaatc gattaccccg 
111001 agttcaccgg acttcagaac cgacgcattg aagacaccgt gcgagtctcg gtggccaaag 
111061 gcagcgaagt cacttggtgg gttcgaatca acaaaccagt cacgtcggcc caactcttcc 
111121 ccaagccata caaaaagaac gaggccaact acgaagactc cgctcccttg gagatgaccc 
111181 aagatcccga gcgtccactg tggtggtccg tcgcaatcaa ggtggaagaa agccaacgat 
111241 acgatgtgga actgaaagac gacaaagatc gcacgaacca gctctcgact cagttcatcg 
111301 ttcgaatgct acccaaccaa ttagccaaaa ttcgccccac ggtcggtggc gacgtttccg 
111361 tgtcggcatt ggaagagttt ctcatcgccg cggaagttcg ggacgatcac gggctcgctc 
111421 gcgcgggcgt ctcgctgaca tttgccggca aaccaccgca ggacatcgtg ttggctgaga 
111481 acatcgctgc caatcaaaag caaacgatcc agcacatggt cgacatggaa tcacttgagg 
111541 ccaaacccga cgatttgctg tcgtactctt tttgggccga ggacaaggtc gcaggtgaat 
111601 cttccgagac tgaagtccgt cgcacgcaaa gcgatttgta ctttgccgaa gtccgacgtt 
111661 tcgaagatat ttttcgtgaa ggcgatcctc aatccagcca accttcggaa ccctctgaac 
111721 caagcgaaca agaacaacaa gccgaagagc tgattgaaac gcaaaagcaa atcatccagg 
111781 ccacttggaa gctactgcag tccaccgttc ggtggaacga gcaaacgacg gcggacgttc 
111841 gcgtgatcgc ggactcgcaa caggatgtca ttgctcagtt ggatgaactg gctcaagagc 
111901 tgaacgaagccgaagccaag cagcacgccg ccaacgttcg acaatcgatg caaaccgccg 
111961 tcgcggaact gaaccgagcg attcaaacca gcgatcgtga accattgtcg gctgccgtcg 
112021 tggcggaaca acccgcttac gcgggcctgt tgcggatgcg agccagtgaa ttcgaagtca 
112 081 cacgttcaca acagcagcaa caaccgagtc aaggttcggc gggtcaacaa cgcaagcaac 
112141 aacaattgga tgaattggaa ctggaccagg aacaggaccg ctacgaaacg cagcaacagg 
112201 cgcaaatgac gccggagcag caagctcaac gtgagattcg gcaagtgctc agccggctga 
112261 gtgaactcgc ccgtcgccag caggatctca acgaggaact ggcccaactt caatcagcga 
112321 tgcaattggc tgacgacgaa gaggaaaagg aagaactacg ccggcaactc gaacgtcttc 
112381 gcgagcaaca gcaggatatg ctccgccaaa ccgatgagct gatggaacgc atgaactcac 
112441 ccgagaacgc gtccaacatg caggaagcca gcgaacaact cgagcaaact cgcgaaaaca 
112501 ttcgtgaggc tggcgaagcg atggaacagc aagacgctga aacggcttta tccagcggca 
112561 aacgcgccga gcgcgaactc gaatccgttc gagaagaaat tcgcgaacag gctgcgggtc 
112621 agttcaacga agcgatgcag tcaatgcgtg atcgagctcg cgaattgaac acgcaacaaa 
112 681 aggaactcga ggaacaattg tccgaaatgg acgcacccag cgaatcgcct ggcctgcgtt 
112 741 ccagcggcaa tcgcgaagaa ctcaaggacg cactgaccga gcagcgtgaa tccttgggcg 

112 801 aactaatgga ggacattgag gaggtcgtcc aggaagccga agagtccgaa ccgttgctcg 
112861 cacaaaactt gtacgacgct ttccgcgaaa cgcaacaacg ccggacagaa gaacgactcg 
112921 atgccgccag ccaacttctt cagcgtggct ttgacgagca gtcgcagcaa atggcggggc 
112981 aggccggtga agcgatccag gaattaagcg agcaaatcga aacggccgcc gaaagcgtat 

113 041 tgggcaacga agcagaagga ttgcaacgtg cagcgagttt ggctgagcaa ttggaatccg 
113101 ctgtgtcaga cgaagtccag caagcacgcg gagacgattc cggcggggca gggcagggcg 
113161 aaacctcaga gccaacggaa cgtcagccag gagaagacaa accccaacaa ggacagcccg 
113221 gtgagaacca gccaggccaa gggcaagaac aaggggaagg ggaaggacaa cctggccaac 
113281 aacaacccgg cgaggaaagc caaccgggtc aaggtggccc cagtcaatca caagatggcc 
113341 aagagccggg agaacaatcc gctggcgaat cagaaacggg tcaaccatcg aatccgcctt 
113401 cgcagtcaac ctcgccgggg cagggtgggg cagggcaggg cggacaagct ccagacgaag 
113461 aaccttcgac cgagccgact ccgggcgaac cgtctgaaaa caccgaaccg caagaatcca 
113 521 cggagcgtcc gcaagggcca cctggtcttc gccctagttc gcaatcgcag caagggcaac 
113581 cacagccgaa ccaaccgcaa gaaggtcaat ccggcggtcc caatccatcc gccgcccaag 
113641 ctccctcctc tcccagcaac ttcacctcac cgctgaccgg cgaaggattt cgcgaatggt 
113701 ctgatggatt gcgagacatc gaagagatgg tttccgaccc cgagttgcgt tcacgtgcga 
113761 cacgaattcg ggaacgagcc cgcgacatgc gtgaggaaat tcgccgcgaa tccaaaccgc 
113821 cgcaatggga tttggtcgag gagatgatcg tcgaacccat gcgagaactc aaacgcgatc 
113881 tcgccgaaga atacatgcgt cgcagcgcca agaaacaatc gctggttccg atcgatcgtg 
113 941 atccggtacc ggaccagtac tccgaagcgg tgcgtcgata ttacgaacga atcgggagcg 
114001 ggctttgaac gacgctttca attcatctga caactggagc cagcatttgg tttggggtgc 
114061 accggagtgg atctttcctg cgggcatcct agccgtggtg cttaccgccg cgattggttg 
114121 gtcctacgca cgttcacgcg caaccacgcc gattcgcgtg ctcgcggcta cgctgaaact 
114181 gatcgccgtg gcattgatgc tgctgtgttt gcttcaaccg atgtggcgag gcgaacgtcc 
114241 tcgaccgcag gccaacctgt ttccgatctt ggtcgacacc agccagagca tgcagctcca 
1143 01 agatgtcgac gaggaacttt cacgtgggca gaactttgcc aacgcgttgg ctgccgactc 
1143 61 ggactggcga gttcgtctgg aacaaacgtt tgacgtccgc ctgttcggtg cagattcacg 
114421 actgagccgt atggccgaca tcgacacaat cgaattcgac ggcatgcaat cgtccataca 
114481 aacttcgatc gatgcattgt cgcagcgact caaaggacgc cccgtcgccg gtgcgttgtt 
114541 attcagcgat ggcaatctca ccgatccctc cgcggaacag atcaccggca ccgattcacc 



114661 acgagtcagc gtgactcaaa gcgattttga 
114721 gatggcgacc gggctgaaag accaagacgt 
114781 ttccgtcgaa gagaaaactg tcaccatcga 
114841 tcgttttcgc cccgaacaat ctggcaccca 
114901 acaacgtcaa tcgatccgtt ctcaaatcga 
114 961 cggcgatgaa gccacgttgc gaaacaatca 
115021 cccgttccga attctgtact tggccggacg 
115081 tgcgttgcag agcgatgcgg aaacccaact 
115141 acccaaattc agcttccgcg acaaaggcgt 
115201 aggcgaggac gaagaagaaa cccgcgagca 
115261 agtcaaagaa tccgaagaac tcagcaaagg 
115321 gtatcacggg atcattctgg acgacatcga 
1153 81 gctgctgcga caattcgtga gtctgcgtgg 
115441 atccttcaac gacgattcct ttggcaagac 
115501 accgcgcaat cggtccgacc aaccggctcc 
115561 catgctgcaa ccgtggcaac gacttcgcga 
115621 aaagatgccg ccgttccgcg tgacgcacga 
115681 attggcccaa ttggaaacga gttctggcca 
115741 cgggaaaggc cgaacagctg ccatgccgat 
115801 gaaccaatac gacaacacca ccgaggtcag 
115861 ggcgcgttgg ttggtgggcg aagtccctcg 
115921 ctcgaccgga tcgaccaacc tgcgagtctt 
115981 aaatgcgacg gtgacactca acgtgaccgg 
116041 tcgtccggat gcaaacgaac ttggcaccta 
116101 tcggttcacg gtatcagccg atgtcaaagc 
116161 aggtggttgg gtcagcgatc ccggcatgga 
116221 ttggctgaag aacttggccg aatccaccgg 
116281 cgacttcgtc agcgacctac ccaatcgaga 
116341 tctttggcac caaacatggg tgatgctcct 
116401 acttcgacga tggaagggac tcgcatgagt 
11646r tggcccaagc tttgtgtcgc caccgcgttg 
116521 caagagcccg ataccttgca agaatccggc 
116581 acgtcatcca acgaaccggc gacgaaaccg 
116641 tcggatgaga cccaggtcat gcgtggcgat 
116701 ccaacgtacg aacccctttt tcgcgaatgg 
116761 gcgaaactga acgccatccg gattggaagc 
116821 caattgttgg aacaggccct ccagggtttg 
1168 81 tcgttgatcg gccatggaac ctttgctcgc 
116941 gatgtttccg cggaccaaat gaaggactgg 
117001 gtcaacaacg cttcatcgag cggcccgttc 
117061 atcgtgaccg caaccaaaag cggcaacgaa 
117121 gcggaagcgc tcgcggcgat cgattctgat 
117181 gaagcatttg tcagagccag caaggacgtc 
117241 gcgacagaac acgcgttgat ggatgacaac 
1173 01 cttcgcaaga aagtgctcac accagaacag 
1173 61 cgaatgacaa tcgccccggt tgccaactcg 
117421 cgcgatgaat tggaagaccg gctggatgca 
117481 gcgttccctg acaaccaatg ggcggaccac 
117541 ctctaccacg aagttgaaag caactcgcct 
117601 ccgtctgcgt cgacgctgtc ctcgaaaacc 
117661 cggattgtgg ctaccccgcc ttcgaatttt 
117721 tcaatcaagc acgcgaagag ttgcagcttc 
117781 gcctggtgga tcccgacgtg cgagccgaat 
117841 cggctgggag cctgaattgc cccgtgctga 
117901 tcgatcgtgc cgagcaacgc gactgcttga 
117961 tggaagattc cggcgtggtg ctggcgatcg 
118021 gctactacct cgtccgcagt gacgaagcgt 
118081 acgtgaaggt gacatttgat atctatcacc 
118141 atttgacgca gaacgttgat gcgatcgctc 
118201 atgaactcga ccgtggcgaa ttgcattacc 
118261 gctactccgg ccacgttggg ctggagtatt 
118321 gcgaagcggc ttcgtggatc aagaaggttt 
118381 tgcggtgctt ccgttattgc gagacttatt 
118441 cttgccgccc gaacctccgt tgccagagaa 
118501 gttgctgcgg ccaccgtcgc cttcgaagtc 
118561 caaccgtttg ccggccaccc aagttttgcg 
118621 ttcgggcagg tcgatcaaag tgtgcgagtc 
118681 atcgatacca gcttcgtttg caactgcacc 
118 741 ccaaccgaca tcaaggcgat atcgagtcat 
118801 tttgggacga cgagggcgat catcgccgaa 
118861 atcaaatgag tcacgaccgc gagaattgtc 
118921 acgttccttc aacaagaacg gacgtccgtt 
118981 catctccatc ggtttgccag attcttccgc 
119041 caaatcctga tcagcggaaa catcggtgat 
119101 gttgatatcg gcaaccgttg ggatgtcgat 



gacgtcgcca atgaccgttc gcgccgatgt 
tgtcgttcaa ttgctcaacg catccgggac 
aaaggaaggc gagctgcaat cggttacttt 
gttctatcga gtgctggcct tcactccatc 
tgaaggcaat gcgtggctgg aaggatcatc 
acgttggatc gacgttcaac ctcggcgtgg 
cccgaactgg gaattcaagt tcttacgtcg 
cgtgggcctg cttcgaatcg cggacaagga 
caacagcagc aacccattgt tcgctggttt 
atacgacgaa gcagtcatga tccgcctggg 
gttcccagag acacccgaag aattgtttgg 
accggagttt tttacacagg accagatgct 
cggcggattg ctgatgatgg gcggccaaga 
acctctgggc gaactgtccc ccgtctaccc 
cgctggcaaa tggtccatca cccgagaagg 
aacggaagca gctgaaaagc tgcgtctgtt 
ggtgggtgac atcaaacccg gagccatcga 
acccgccccc gccttcatca cccagcgatt 
cggcgacttc tggcgatggt cgatgcggaa 
tgatgatcca gctcaggcat ggcgtcaaat 
ccgcgtggaa ctgacgatcc aaccggatgg 
ggtacgagat gagtcgtatc ttccgatgga 
tgaagatggc gtttcaatcc cgctgaccgc 
cgccgcaacg tttgtctcgc gtgactccgg 
ggctgaccaa agcttcatcg gaaacgacga 
agagactcag cgtttgtcgt ggaacgaaaa 
tggcgaagtc ttgcgggtat cggacctagg 
aataccagtg accgagactt gggtctatcc 
cgcgatggcc tgcctgtgcg gcgaatgggg 
aatcgattga aaatgcttcg tcgaaaaagt 
tgcctgcccg tttcgacggt ggcctcgggg 
agcggccaaa cgtccgactc agggcaagac 
gatgtggcca gcgaatcagc tcgcactgac 
ctgatcgtgg tcgtgggagc ggcgggcgat 
tctgacactt ggatgaaact cgcgaaaacg 
gaatcgctgg agagcgaaca aactgaccag 
gagaccgact cgccacgacc gttgtggatc 
aatgttgcca agttcaatct gcgaggcccg 
ttggccggct tccaacgtcc ggtcgttgtc 
atcaatgcat tgtctgcccc caaccgagtc 
caaaattttg cgaggtttgg acagtacttc 
ctcgatcacg atgacgcggt ttcgattcgc 
gagcttttct acgaagcgga agatcggatc 
ggcgatggtt tgggcacgaa ggcggatgac 
ctcgcaaagc tcgatggaaa acaggcgggg 
ttgaattgga ccgacgatca gctcaaacgc 
cttcgcatgc aacggctttc tgatcccgcc 
ctcgagaaaa tcttggtccc gctcgccgaa 
ccatgaattc gcccgctcga ttgaaagctt 
tgtccaccgt cgatgcgatg aagttggtcg 
ggaagtggtg ggacaaagac ctggactcaa 
aggtcgccgc cgtctgcaca aagttcgtca 
acctcgctgg tttgaaggaa tccatcgccg 
tttcgcaggt cggcgatgca cgaccgggag 
tcgcgggtct aaaagaagcc gcaccgatgc 
aacctctcaa cgaattaatt gatcacgccg 
tctcgatcat ccaagaggtc gacagcccca 
aacaaatcag cgaaggtcat gtgatcgcga 
acttccatgc cgccggcaat cccggtcgac 
cgtccatttt cgctgcgatc gcggacaccg 
ggccggtgaa cgaagccgct gcgggacttc 
gaacttcttc ttgccgcgtt tcgcggtggg 
gccaaacgaa cgctttccgc cactgaagtt 
cttgcctttg cggaaaccct ccttgcggaa 
atcgacgacg tcacctgctt cttcaaggtt 
aagcgtttga tagatgtccg agggcatgcc 
gcgaatgttg atcgggccga tgtattcgcc 
aacaatgttg ccgggtttga ctccgtcctg 
gccttctttg acgggtgtga gtggacggcc 
ggatccacga ttgccggagc gatcgttcga 
tcggtcggaa cgttcgcgct gcttcttggg 
ctggctcatt tctgccagtg cggcagcgat 
gtactgagcg atcatgtctt tgtagaccgt 
gcgttgcttg aaacgccgca cgcgagcctc 
gatttcaatg ggttgcttgg tcgtgttctc 



119221 tcccgctcga ccggtacgac caatccggtg 
119281 aaagttgaac acgtgactga tccgtggcac 
119341 aaccaaaacg tcgagttgac cacgcttgag 
1194 01 tggcatgtcg ccgttcagtg cgatagcgga 
119461 actcagcgtg gagtcctttg tcttcgtgaa 
119521 caagatgcga gtcaacgcat caattttgtc 
119581 aaccgaagcg gcggtgatcg tctttgactt 
119641 ctcgtcagcg atcttgcgga ttggctttgg 
119701 tccttcaggc gatttttcga ggacaaactg 
119761 ttcgtcggct tcatcgagca caaggcaatc 
119821 gacgtgatca atgactcggc ccggtgtgcc 
119881 tttgagttgt ggttcgtagt cttgtccacc 
119941 atcagcaccg tacttggtga acgaccgagc 
120001 cagcacgagg acttgtgggc gacggctgcg 
120061 aaatgcagcc gttttgccgg tgccagtttg 
120121 catgtagggg atgatctcgg cttgaattgg 
120181 cgctttttga acttctgctc gcaggtccaa 
120241 cgactcaggt gccttctccg ccttcgcggg 
1203 01 tggcggagcc aattcttcgc gaacagactg 
120361 tgcgggcttt tgctccgcaa ccacttccgg 
120421 ttccgcagcc gcaacaggct cttcgacgac 
120481 ttccaccacg gcttcattcg caggttctgg 
120541 tgcatctggt gatgcaacag catcggattc 
120601 ttgaacgtgc tgctctggtt cagcgatttc 
120661 cgctggctct gcttgcactt cgacgatcac 
120721 atagaacagc aaggattcgt cgacagccgg 
120781 gttggcatag tgagccgccc attgtgccga 
120841 ttcaaatgtg tcctgagctc gttcttgcga 
120901 ttcattggaa gcgggatgat cagacatcaa 
120961 cccacgctgc gtttcgcagg cggtcatggt 
121021 aaccaagagg actctttcag agccctggat 
121081 aggaccttcg aacccttgca ttccgacaca 
121141 gaatccgtct caagcgatca atcaagacgc 
121201 accgagcgga tcacactgaa gacctgtcgc 
121261 cgccgccgac ttctgtggcc caggagagta 
121321 aactaaaatc tcattcgggt ttccgcgaat 
1213 81 cacccgctga agcaaccacc gcacattgcc 
121441 tgccggagta attctactta gcgcggtact 
121501 ttgcagcatc ttcggggatt cccgacccac 
121561 caatcggaga tgaaagactt tgaaccccat 
121621 atttcatttc gagtttggtt gctacgggcg 
121681 ctgatgcaca aacgtttctc gcggcctcgc 
121741 atcgcaatca tcggggtgct cgtcggacta 
121801 gcagcccgcc gtatgagttg cagcaacaat 
121861 taccactcgg cctacaacaa cctgccagtc 
121921 cgtgatccat gggacggcaa cccaaacgga 
121981 ctggtaggca tgttgccatt cctagaacaa 
122041 ttgagcatca actcggacgg caccgttcga 
122101 cccgatcgag tgcaataccc accatgggcg 
122161 gatcctggct ctggtttgcc atcgcttggc 
122221 tcagctgtct actctcgtga ttcgtacctc 
122281 ctcccctata tctcagacac tgcccacgcc 
122341 tttgttacgg gacgccacat gaagtttcgc 
1224 01 atgtgtgggg agattgctac cgacctcggt 
1224 61 aacgtgggcg cccatgctgc acccaatgaa 
122521 gctgaacctt acctcgaccc agagcgccca 
122581 gtggtgtggg gacgtggtta tcgttggcat 
122641 acgatcttgg ctccaaacgc gcaactctgc 
122701 tcgccacctt cgagtcgcca ccaaggcggt 
122761 aaattcatca ccgactcgat cgaagcaggc 
122821 gcaggctcgc ctgctgctgg cagcaaaagc 
122881 cgatccgcga aagaagtgat cgacggcgaa 
122941 attttgcaac tgcattcctt ccgctatcga 
123 001 ttagttctca ctattggatg gattgatgaa 
123 061 cttgctcgtc gcaccactcg gatgcgactc 
123121 ggacgcggaa aagttggccg aatacgaagc 
123181 aggatacgaa ggcacgaagt agcgccgctt 
123241 gcacctttgg attcatgcgc tagacagaag 
123301 ttgtatccag ccaatccgac tcggttaccg 
123361 catcgactta gccaggaaca cccacatagg 
123421 gcagcgcagg cagccattcc taggctgctc 
1234 81 attgaggcat caaatgcaaa ggaacagaaa 
123541 ccttgtcgtg gtcgcgatca tcggcgtatt 
123 601 cgcacgcgag gcagccagac gcatgagctg 
123661 gatccacaac taccattcgg cttacaacaa 



cacatacgat tcgctgtcgt gcggcaaatc 
gtccaaacct cgagccgcga cgtcggttgc 
ttggtcaatt gtccgctctc gaaccttctg 
aaacccttgc tggctcagct tttcagccac 
gacgatcact ccatcggttt cttccacttc 
gcgtggtgaa acaaacaacg cacgctggcg 
gatcgtgatc cgagctgggt cgttgagata 
caacgtcgca gagaacaacg cgacttgccg 
aacgtcttcc aagaatccca tgttgagcat 
gagcgaaccg agatccaacg taccgcgttt 
gacgacgact tggacaccgc gacggagctg 
gtagatagca gcaacctgga aaccactcag 
cacttggatc gccaattcgc gtgttggtgc 
aacatcgaca cgcgacaaga tcggcaatgc 
cgactgtgcc aaaacgtcgc gaccatccaa 
agttggttta tcgtaacccg atgcggcaac 
atcgcggaac agtggctcag ccggctcgtt 
ctcttctgtt tgtgcggact ggctctgagc 
ttcttgttta gcaacgggtt ctgattgaac 
ttcaacgact ggttctgggc tggccgctgt 
aggtggcgcc atcgttgcaa cgggctgcga 
ttcgcgagcc acctcaggct cagcgatcac 
agctggctgt tcctgcgatt tcggttgctc 
gacctgaaca ggttctgtgt tcagttcgat 
gtcggacgct tcatcggtgc agtcggcttg 
cgatggcatt tcgctcggat caatcactgg 
aggatcgacg accgtctcgg ttgtcgacat 
attcaactcg gacgtaggat cgaccaacgt 
aaaactcgtg cgcttaaaaa ggtagccgca 
cgcggcagaa ttggtaagga tggaatgcca 
ctggttgtct gcggttcgat tcagtcctcc 
accacgacgc cgcttctcat tcaccagaaa 
tcaacggact gggtagccac tcgggtcgaa 
agaaaagtcg attcgcgacg tcaatttgca 
attgaatcat cgattttgca ataggaggaa 
gcggtttcct cttccctcac ctgccagcag 
tttgtcggca tccccccatc cgagacaaaa 
ttcattttga gcacgtgtta atcaattgac 
tattcgggaa aaaaggcttt ggattcttcg 
attaacatgc tcaaattgaa gcccgtccag 
aactagccca tctaagactc aattgaggta 
ggtttcacct tggtggaatt gcttgttgtc 
ttgcttcctg cggtccaggc cgcccgcgaa 
ttcaagcagc ttggattagc tgcccacaac 
caaggctcag gaacgtatgc cgttggggga 
gacatttcct caaactatcg cctgtcgtat 
caaggcctgt gggaacaaat ctcgaacccc 
agtccatctt ggcaggccat gggacctcac 
actgaacttc caactctgcg atgcccgagc 
cggacgaact acgcggcttg ccacggtgac 
gacgtcgacg aagtggccga aggcggcaaa 
gctgcttcga gcgcatcaca ccgtgggatg 
gacacattgg atggcttgtc caacacaatc 
gacaatgaca aacgaaccac agttgccacc 
aagaaccaat gtcgactgga tccgtcttat 
caattttgga ccagtacggt aacgaaacag 
gactacatgc ctccatacag ccaaatgacc 
tcagaaggaa ccgaccaccg agatgtggtt 
gttcacgttc tgatgggaga cggtgctgtc 
aacaagaatg cacctcaggt gtcggatcga 
ccatacggtc tttggggagc cctcggatcg 
ttctaatttc tcctctttag acaacaactg 
agggagtgca gttgttcgtt ttgaattgat 
aacctggatg ctgaccttcg gaatgctctg 
tggaagcaat agcaacatcg tcgacaactc 
tgcactcgca gaagcagacg caatgtcaga 
tgcattcggc gtgcgaaaac acgttcgccg 
tgagcggcca gcaaaagcgg gccgcttttt 
aagcaacaag cggaaaagaa ccacaaagtc 
gcgaactaaa ccacctggaa tgcgagttct 
tgtcgctcaa gatttctgga aactcgattc 
acggagacga gggttcacgt tggtggaact 
ggtgggcctg ttgctccccg ccgtacaagc 
cagcaacaat ttcaagcaaa ttggcttggc 
tcttccagtc catgcatcgg gaacctatgc 



123781 tttgtcattt ctggttggca ctttgccgtt ccttgaacag caaggccttt gggaacaaat 
123841 atccaatcca ctcagcgtga attcggatgg atcgaccaag agcccaccat ggcagcccat 
123901 gggtcctcac cccgaccgcg ttcagtatcc gccgtgggcc actgaaattc cgtcattccg 

123 961 ctgccctagt gatccaggcc gtggcttacc atcgcttggg cgaacaaact acgcggcctg 
124021 catgggagac tcagcggttt actctcgaga ctcttatttg gacgtggacg aagttgcgtc 
124081 gggcgctaca ttgccctatc gaaccgacac cgcgcacgcg gcggcttcta attcagccca 
124141 gcgcggcatg ttcgtcaacg gtcgagaagt tcaatttcgc gatacgctcg acggactttc 
124201 caacacgatt atgtgcggcg agatcacgac ggacttgggc aacaacgaca aacgcacgac 
124261 ggttccgacc aataccggtg gacatgcggc gccaaacgaa aagaaccagt gtcgactgga 
124321 tccatcttat gccgagccct acatcgatcc tgaacgccct cagttttggt caccatccgg 
124381 cttgaccaag caagctgttt ggggacgtgg ttttcgctgg cacgactaca tgccacccta 
124441 ttcccagatg accacggtgt tatcgcccaa ccgcgaacta tgctccgacg gtcgagacca 
124501 tcgcgatgtg gtatcgcccc cttcgagcca acaccagggc ggagtccatg tgttgatggg 
124561 agacggcgca gtcaggttcg ttaccgactc gattgaagct ggtaataaaa atgcaccgca 
124621 ggtatcaaac cgagccgggt cgcccccgcc aggcagcccc agcccatacg gactgtgggg 
124681 atcactcggg tcacgtgctg caaaagaagt gatcgacgga gagttctagt cactcccacc 
124741 cgaatccatt gccaactatt acgaactgcc gccatgagtc actcatggcg gcagtttctt 
124801 tgtattcaga actctcgctc tagaccgact caggaaacag tgcgtccagt gtaagttggc 
124861 tccaggtttc caggtgctcg cgttcttggc ggagttgttt tacggaaacg aactgacctt 
124921 cggcgagttc ggcctcgttg ctggtgacga gtggttgctc caagatgaag cgatggacga 

124 981 ctcccaagtg cacccgacca acttcgttgc tcgggtcata aagcaaacct tctcgttgat 
125041 cgacgtaggg tgctccaatc gtcacttctt cttctagctc gcgacgcatg ccgatggtgt 
125101 atgggtcttc ttcgccaccc gcatcttctt ggctgatgtg accaccgaca ccgatgctgc 
125161 gtttggcgtg cagacgtttt tcacccgaac caccaccgcg agtgtattga aaccatttga 
125221 ccgcgccgtc agccggatcc gtccactgca tcaccacgta ggggatcaat tgtttgaaac 
125281 tcggatcgag ctccattgca ccgcgtgggc gataagacaa ttcgtcgctg gcaagaatag 
125341 gctgcaggaa tcggtccacg ttgcggtcaa acccttccaa atgcccaatc gattcgatca 
125401 aggtcgcggg aatgacaagg acgtgttctt cggtggtggc gttggaggac attctgcaat 
125461 cgaggtcgtg gactagctgg aagttcaaac caaccggggc tgtcaaattc gctcccgctt 
125521 ggccgttggc agtgtaacga attctcgtca gagcgaatcc cgtcgaggtc gacgatggtg 
125581 acggaccaaa Gcgctgacgt agcgattccc gtcaacgaaa aaccgcaatg gcaattcagc 
125641 ggcctgcgat attccgattc gaggcgtggt ggtgacacga ctgctcggaa gcttgggacc 
125701 agagaaaaca caccagttcg gatcgcgaat cggatccact ccgtcgcacg tccgatcaat 
125761 cgcgagagat tggcacaaac ggccggggcc cgtcgtcaat gctcttccgt cggtcgtttc 
125821 cgatcggtgc agacctcgat gctgaaacat gcggtcgatg ccccaaaccg gctgaagagc 
125881 ccggatcaaa actgccgaac cacagccgac cgattcggtc accagattca cacagtgctt 
125941 ggcatgaatc ggatagacgt acaatgtact tggccgccca aacatcgaag cgtttcccgg 
126001 tttctcaccc cgcgcactgt gactcgcagc atcgcgactg gacaggtagg cttcggtttc 
126061 cacaatccaa cctccgaccc aaaccccttc gattcgtcga gcgaaaccgc aacccaaaag 
126121 ctggcgagcg acgacagcgg gacgacggtc aaaaaattgc gactgcaacg actgcgtcga 
126181 tccccaatcg atcagccgat cttcctcgtc atcctcacgg aactgatttc gctttgcatc 
126241 attcatcaaa acaaacttat cgcgcgcgtt gggtgatgcc catcgacggt gatccgattc 
126301 ccaacggaat gatgattgtt caatcgggcc ggatcgcttc gatcagttca ttccaagatt 
12 63 61 cgaagaacga aggtgaattg atcgacttgg gagacgtcgc aattttgccg ggtttggtca 
12 6421 acactcatac gcacctggaa ttcagcgacc tcccaaaacc cgtcggacac accggcatgg 
126481 aattggccga ctggattcgc gaagtcatca cgactcgagg gatggtcgat caccaaacgc 
126541 gacaacaaaa cgttctcaaa gggcacgcgg aagcaactgc gtcaggcaca cagctgatcg 
126601 cggacatcga gaccacgccg atggaatcga tcgcaccaaa tacgatcgca ttcgcggaag 
126661 tcctcggact cagccaggaa cgaggcgacg aacggatgtc acaggctgaa cgacatcttg 
126721 ccaacaacac ttcgcctgat tcgcgatgtg acgtcgccgc gatcagcccg cacgctccct 
126781 actcgacacc tttgccgttg atcgagcgct gcgtgcaagt tgcaaagcaa cgcaagacga 
126841 cactcgccat gcacgttgcg gaatcacctg ccgaacgtga actactgaca aacggatccg 
126901 gcccatttgc cgaatcattg cgaactctag gattgcctat cgaggcgttc tttccctgga 
126961 caaaagcgaa tccagtctca cacctgattc agctgcttgc caaatcacct cgtgcgttga 
127021 tcgtgcacgg aaatgatttg agtgagaccg aaatcgagca agtcaaatcg cacccgaact 
127081 gcagcatcgt gtactgccca agaacgcatc atttctttca gcattccaat catcccgtcg 
127141 cagaattgct agccgcagga atcaacgtag ccatcgggac tgattcacgc gccagcaatc 
127201 ctgatctgga tctttggggc gaggtccagc acttgctcaa gcatcggcct gacctgaatc 
127261 cgaccgacgt gcttcgcatg gcgaccgcca acggtgccga cgcattggga cgctcagcaa 
127321 ctcacggacg gttggccgcc gggatgccag ccaactttat caccgtcgca acaagcgctg 
127381 atcgagtcga tcagctttgg tcggactttt catcctcatc cgtgtcatct tcatccatgt 
127441 cttccgccgc tcgcccagca ccccaaagct gaaccgcgtc ataaagagca tgagcagtga 
127501 tcggaatcag cagattgccg gtcacaatca gcaaggcacc aaagtacaaa cccatcaacg 
127561 cggtgacgac gatgtacagt ttggtgattg gatggaacat tccaaacgcg aatgaggaga 
127621 ccagccaggc tgcagttaat tcccatcctc cgacgcgaga ccaccaagcc agcaaacctt 
127681 cttccggcca agcgaattcg gtgactgagt ttggcaaact gcccagccat ccgccataag 
127741 cccaccatcg cgatggggct tccaaatcag gagcaagcga cgcggcgtct ccggccaacc 
127801 aaggcaacaa ccaaccgcga aacaagagtt cctcgccgac gccggcgcac aaactgatcg 
127861 cgaacaattc ccaggcgttc agtcgcagca tcaagccgat catggggtga tcgctgagct 
127921 tgtcgagttc ttcgattgcc gggtgtttga ttcgccgcaa cacggcaacg aacaacacca 
127981 atggaatggt cgcaacaatg cccagcccga taccgccaat gaccgcggac gaagcgactt 
128041 cgtttagcgg cggaaccaat tcacgggcac tgggaccgag caggtatccg agaatcaacg 
128101 ccagggcacc caagccggct tcgaccgcga cagcagtccg aaagacatcg tccggggttt 
128161 gttcgtcgtc cgcgttggtt tgcgtcacac gcctctctca ggatgggctt cggtgtcggt 
128221 agggattgaa caacaagtag ttcaagtggc gatacagatc actggaattc agcaactcgg 



128341 gtaacgtctt cagccggtga ggcaacatga ccaccaacga tcccgcatcc gccaactctc 
128401 gcagtgcgtt caaacaaggg tcttcattga catgctcggt cggggcaggg ggctctttcg 
128461 cgagcacgat cggcggccga tgcaacatcg cccgcgcgat cccgactgcg taacgaacat 
128521 cggtcccgag cgaatccgaa tcttcggtcg ctcccaaatt ggaccccggt tcgatgatcg 
128581 tctccaggcc gtcaggcaag cgcgtgattc gctcgtagat cccgagttgc tcgagcgttt 
128641 cgaccatgtc gcgattgtcg acactgcggt cgactcccgc catcaaattc tctcgcagag 
128701 taccttccca gagaggcccg tcgggaccga tccacatcac gtttttagcg agagcttgcg 
128761 ggtgaacgtc gagcaatgag atcccgtcga tctgcacctt gccgcgatgc gggcgaccaa 
128821 accccatcaa caattccagc agagctcgcg tcgaaacatc ttcggttcct aagaaagcca 
128881 ccagcgactt gggctgcagc gacaaactca ggtcacgcag gatcgccttg cccgaggaat 
128941 cttgcagcga cacgtcttcc atcgtcacgc aatcgcgaag accacccagt ccaacacgtt 
12 9 0 01 gctcgctcgg cgacacttcg ctgttgggtt gcaggtacag atacaccgat tcacaagctt 

12 9 061 tgctgctgcg gcgaagttga cggttcagtt cgctcaatcg agcagccgcc aaggcagcac 
129121 cggtcaacga aagaccaagc accaatgcgg agggaagact caatccttgt tcgccttgca 
129181 gcgtgttcag gccgagaccc aacaacatca cagcgatggc aacgccgccg gccatcatca 
129241 tgaccggcca cagtcgacct cgacgcgcgt catcgccagc gatccggcga tccaatgcgt 
1293 01 ccaactccgc ctcgtaggct tggtcagcca aaccaccggc ctgcagtcga gccatcatgg 
1293 61 gtgcgtcgcc aatcaaaccg atcaagcgat cacgaatttg tgggatctcg aatcgcgaca 
129421 actccagcca atccggattg cgaagcttgg cgtacaaacg ccacaacgcc acgccactga 
129481 tgaccgcgag aatcgccagc caaacgttca cgagcagtgc gagcgtcaca cacccgatca 
129541 acatcagcac gcttcgcggg atggatcgat accacagtga gagaccacgt ccgacttcgg 
129601 gcaagtggcg accaatgatg acacgggctc gttcacgctg agcagcggca ccttcgactt 
129661 ctgcgcgacg caaactttgc tttagcgttc gtgaatgcaa catcttgatc actcgccgcg 
129721 aacgagcatc ggcggatcgt cgatgcaacc aaacggccaa acaaaagaca gccgcgattg 
129781 ccaatgacaa tccaaccaaa ccaaacagct gagagacttt gctttgcgcc agaaagaact 
129841 ccggcacggg gatcgacagg tgtgttccca ggcgaacagg cgactgtgtc agcccatcgc 
129901 cttcatcgac caatgccgcg atgatgccaa agaccagcac cagaactgga atgatcaaac 
129961 ccgcgatggc ggcccaaacc aaatgcaggg gcgcgcccgc tgcctcgggt tcgcgacgtg 
130021 gttgtttcag actggaaaag cgattcaccg ttgcgagcct cgttggatat ctagccaccg 
130081 ttcgtttgtc atcacgaccg cattcgttgc gatcgacgat ccgtcacgac cggtcaccgt 
130141 tatcctcaaa cgagtttcca atgcaacaga tatggcgatc aagccaaatc gcgatcttcg 

13 02 01 aacaacaaca tcgcaaccat ccaaatggca atgacaaaga ccacaagata gttgaacgcg 
13 0261 gcggccaagt agatcagcgg gatggcattt cccgcatcca ccgccgcctg gacgttgaat 
130321 gagttcaggt tggggaccac gaccgctatc aatttaccaa caaatccgac cagctcggtg 
13 03 81 ttttccctcg cagaagcgac cagcggactg agcaggttgc cgacaacata gatcgcgaaa 
13 0441 caggtgatga agttcgccaa cagcggcaaa cgagtcgcca aagccaccgc gattccgccg 
130501 atcgccattg tttccatgaa gtacaaaccc aagatcggaa gcgtcgtgat gatctcttcg 
130561 tgaccgactt gccatggtgg ttgttgctgg ctggtttcgc gagcgtcgta aattggcttg 
13 0621 taggacatca ccaccagcag gaccgccgcc aagatcacga acagcaccaa cactgccaac 
130681 atgattccgg tgtacttgcc caagatgaac gaacgtcgac taactggttt gctgagcacg 
13 0741 gtcaaagcgg ttcggccatc aatttcatcg ctaaccgatg ttcccgcact ccaaacggcg 
13 08 01 agcaacatgc ccagaaccat gatcatcgtc acgccgctgt ccttcagcag acggatgtca 
13 0861 tcaccgagtg tgttgaacgg gaagatccca aagagaagca cggcaaaaat gccaatggcc 
13 0921 aacagcaaca gataaagcgg ctgagccatt tcgtttttcg cggtcgacaa agccaatgcc 
13 0981 cagactcgcg gggccgcttg tcgcattgcc ataaagcccg tcagcgtcag gaacagggcg 
131041 acggcggaga agacaatcga cgaagcctgc ggaatttgcg gcaggttttt aaacgtgaaa 
131101 gtcacttcag ccgggtcaat ctcttggttg tagatgtgaa gcttggccgg gtcacctgga 
131161 attggtggct gctcacggtc ttgatatttg taggtaaccg cttcggtgct gcccgggttc 
131221 aattcgatgg gagcacggct gaaggcttcc gctttgtccg agtcagccag gaagactttc 
131281 ttgtcactct caattcgcag ttcggcaacg ttgcggaagt tgtaagtgat tccggctggc 
131341 aggaactcgg ggatgtctgg gtcagccgaa ggttcgattg tcgcgacttc cacgaccgta 
1314 01 ccgtcaccga ccaaattgac cgcggggacg ctctcgagaa tcgccatcgg gttttcaaca 
131461 aacggcgtcg atcccaaacc gatcacggca aaaagaccca aagtgcccaa gatgtacggg 
131521 atgacacctt cgcccagaat cgacatccac tccgatcgag tccgatgcca cattccgtag 
131581 atcagcccaa agcccaaaat cattcccaac gtcaccaagg gcaagaaaag caattggcca 
131641 ttctcgccgc cttgcggaac atagaaatag cacagcaacg cggagaagat cgcaccggca 
131701 atcagcgaca atgtcacgcc gcgttttgga tcgtctgcca aattcccgag accgggaatg 
131761 aaggacaggg cactcaagat gccgtggatc acagcggtca ccaaagcccc cagcagcaac 
131821 cccatcgaca gcacccaaat gggagtgacg aatacgggca accacttgcc gacctgggcc 
131881 agagggagca aacccgattc agcaccagca aaatcagaga acagtgaaat agcggaccca 
131941 gcaatcatgc ttggcgactc gtgtcgagaa aagtacgaaa tgaaccttcg gtcaggctgc 
132 001 tacgtgagac caacccattt ggtttgccgc cgaggctcgg attcagtgga tttgttggcc 
132061 ggcgtgatca acttgatcga acgagccatg aagcgagcca tgaaacggcg aatcaaacct 
132121 tcggactggg cttcaccccg ggggttttct ttaccccaac gcgaaaaccg ccggtcgctt 
132181 ttaaatgctg acgcagtctg ctccggcagt ttgctgcaaa gctcgcaacc gtttcggtgc 
132241 ggtaatcagg gtagctccat cggtgatgaa tccaacgccc accaacatag ttcagggtga 
132301 cttccgcaaa gatcccatca cgcaaataga ttcggtggtc gcggtccttg atcgtcgcca 
132361 acaccaattt tgcttggctg acgtaccccg gatccaaatt caacggccgc ggctcgggat 
132421 gatccgacag cgatgcgtat tcggattccc attcgttggt gacgtttttc cagtccgcca 
132481 atccggccgg atccgcgaat tcagcgattc cgatcagttc ttttgtcagc ccatcgccca 
132541 tttcaggttg gtaaaacccg ccggcttcga acggactggg ctcacctttt tcgcccagct 
132601 cgccccagcg ttctgccagt cgctgcttcg cccactgacg agcctcctga tgccgcgaaa 
132661 tcacggcgca gaaacggacc accggttcga tcaatcgaat ttcgctcatt ccgtcacggt 
132721 catccttcgc acgccagttc cccttcgcaa atcaatctct ccctttccgc cgtcaacaaa 
132781 ctcaccacta gacatccgaa cctagtcggg catccgaact tggccgggca tccaatcttg 



132901 atcgatcgag ccctcgcccc aaacacggac taccgcaagc cgaagcgaat gatatcctga 
132961 tccgcgaaac gagcatgatt tgcctgaggt ttcttccgca agcaaccatc gctttcaacc 
133021 ttcatccaac aggcaacgag tcaacatgca tccctggatc gcggatcgaa cggcctcctt 
133081 tgatagcagc ggaattcgaa aagtcttcga tctcgctgcg aaactgaaag acccgatcaa 
133141 cttgtccatc gggcaaccgg actttgatgt ccccgaggaa attcaggacg cgaccgtcga 
133201 tgcaattcga tcgggcaaaa acgcatactc gccaacccag ggtatcgcgc cgctgcgtga 
133261 gaaactgctg gcggaaatca acgccaagta tcccggccaa aatcgcgacg tgtttgtcag 
133321 cagcggcacc tcgggcggtt tggtcctttc gctgctttcg atgatcaatc cgggtgacga 
133381 ggtcatcttc ctggatccct atttcgtcat gtacccggct ctcgtgagct tgtgcggtgg 
133441 tatccccgtg acggtcgatt cgtacccgga tttccggttg gatccagcca agatcgaagc 
133501 cgcgatcaca cccaaaacca agatgatcct ggtcaacagt cccgccaatc caaccggcgt 
133561 gaccgcgtcg gagcaagatc ttcgagatgt cggcgatttg gcggccaagc acaacatcgc 
133 621 gctgctttcg gacgagatct acagccgatt cttctacgac ggcgacttcg catctcccgc 
133 681 ggccaccaat cccgacacaa tcgtgatcga cggtttcagc aaatcgcacg cgatgaccgg 
133 741 atggcgagtg ggctacgtcc acggaccgcc cgagatcatt gcgacgatgc tcaagatcca 
133801 acagtattcg ttcgtctgct cgccccagcc cgctcagtgg ggagctctcc gtgcgatgga 
13 3 8 61 aatttcgctg gacggccaca ttgacgacta ccgtcgcaaa cgcgacttca tggtggaaca 
133 921 actttcgccg cacttcgaac tgaccagccc cggcggtgcg ttttatctgt tcccgaaagc 

133 981 ccccggcacc gaagggggga cggcgtttgt cgaaagagcg atcgcggacg gcctgctgat 

134 041 cattcccggc aagattttca gcagccgcga ttctcacttc cgcgtcagct tcgccgccag 
134101 cgatgacacg ctgcgtcgcg gtgccgaacg cctgatcaag ctggctggcg acaatcgctg 
134161 accgatttgt tagcctgctg cgttcccaac gcagaaattt cctactttca aacatccctg 
134221 atcgagtttt tgcatgtccg ccgatccgac ccaaaatcat cgtctggcca cccgtgctct 
134281 ccatgccgga caagtcgctg acccaacgac caaaagccgc gcggtgccga tctacgccac 
134341 caccagctac cagttcgata gcacggaaca cgcggcggcc ctgtttggac tagctgaatt 
134401 tggcaacatc tacagccggt tgatgaaccc aacggtcgat gtgctggaaa aacgcatcgc 
134461 ggcactggac ggcggtgtaa cgggattgtg cttcgcctcc ggtcaggcgg ccatcacggc 
134521 agcggttttg gcgatcgcac acagcggtca aaacattgtc agcagcacct cgctgtatgg 
134581 cggaacatgg accctgttca cacaaacgtt gaagaacctt ggtatcgaag ttcgcttctt 
134 641 cgatccggat caccccgagc aaatccacgg cttggtcgat gaaaacactc gactggtcta 
134701 catggagagc atcggcaacc caagaaacga tgtccctgac ttcaaggcaa tcgccgacgc 
134 761 agctcactcg gcaccacacg gggcgatccc agtgttgtgc gacaacaccg tgatgacgcc 
134 821 ttacttgctg cgtccgatcg atcacggcat cgacattgtg atctacagca cgacgaagtt 
134 881 cttaggcgga cacggcactc acatcggtgg ctgcatcgtc gatagcggaa atttcaaatg 
134941 ggccgaccaa cctgaaaagt ggcctgagtt ctgcggcccc agcccttctt accacggtgc 
135001 cgttttcgaa gagcacctgc gtggcatggg caacatcgcc tacaacgttc acatccgcac 
135061 ccactggttg cgtgacaccg gcgcggccat gagccccttc gcggcgttcc tgttcctgca 
135121 aggcatcgag actcttcacc ttcgcatgcc tcgccactgc gaaaatgcga tgaaggttgc 
135181 tgagttcctg gaagctcacg acgcagttga gtgggtcaat taccccggat tgaaaacgca 
135241 ctcgcacaac aaattggctg agcagtacct gaccaacggc aaaggcgcga tccttggctt 
135301 tggcatcaaa ggtggcatgg aagcgggcaa gaaatttatc aacgcctgcc aactgtgctc 
135361 tcacttagcc aacattggcg atgccaaaac attggtcatt catcccgcca gcaccacaca 
135421 ccaacagctt gccgaagacg aacaacgccg agcgggcgtc agcccagagt acattcgcgt 
135481 ttcggttgga atcgaagaca tcgacgatat cctcgatgac ttgaaacagg ctcttgctgt 
135541 tgcgaccgcg tgaccgtcaa agcaatcgaa acgttgcttc cgaacaattg atcctcccgc 
135601 gaaatgacat ccgggagatc cactcgggta acgatgttcg agagccaagc ttctatgggt 
135661 gagccttcca acgaggatct gtccagcacc gatgatgtgc ggactgatgc gccattggcg 
135721 tacgccaagt acgtgacgtt tgaccagtcg ctgccgctcg aacgcggcgg cgaactgccc 
135781 gaaattcgat gctgctacga aacgtggggc acactcaacg acgacggatc caacgccgtg 
13 5841 ttggtctgcc acgccgtttc aggcgactca catgcggctc gccacgacga agacgaccaa 
135901 ccaggttggt gggacggctt gatcggcccc ggacttccca tcgacacgga tcggttgttc 
13 5 961 gtggtctgtc ccaatgtcct gggcggttgt cgcggcagca ccggcccagg tgacgcggac 
136021 ccaacctcgc ccgatggcaa accctacggc gccaacttcc ctcggatcac catcggcgac 
136081 atcgtcgaag cccaaaaact cctggctgac cacctgggaa tccggcagtg gcgggccgtg 
136141 gtcggcggtt cgctgggcgg tcaccaagtc ttgcagtgga tcaatcgcta tcccgacgcg 
136201 gcgaaaacct gcgtcgccat cgctacctct cctcgactga acagccaagc actcggtttc 
13 6261 gatgtcatcg ctcgcaatgc gatccaaacc gacccgcact acgccggcgg ccagtactac 
136321 gacaaagacc aacgcccgga cacgggattg gcaatcgcac ggatgctggg tcacatcact 
136381 tatctttccg tcgaagcgat ggaggccaaa ttcgaccccg atcgccacga cccacggcag 
136441 atcgcatcac agttcgagca acgattcagc atcggctcct acttggcaca ccaaggccaa 
13 6501 aagttcacga cacgcttcga tgcgaatagc tacgtgactc tgtcgatggc gatggatcta 
136561 ttcgacctgg gcggcacgcg actgaagctg atggaaacgt ttgacgaagc aacttgcgat 
136621 ttcctgctta tcagtttcag cagcgattgg ttgttcccac ctgctcaatc cagagaaatc 
136681 gtcaacgctc tgaccgcttt ggacaaacgt gtcacttatg ccgaaatcac caccaatgcc 
136741 ggccacgatg cattcctgat cgcgaaagac attgcgactt atggaccgct gatccgcgaa 
136801 cgactgagag acaccgaaac gcatcccgcg gttccatccg acatcacgtt gaacgtggac 
136861 gaagaatcaa tcctggagat cattccagcc ggatccagcg tgctggacct cgggtgtggc 
13 6921 aacggacaac tgctcgccgc gattcgtgat cgacatcgca cgccaggacc gccaacaacg 
136981 gaacatcgat tgatgggcgt cgaggtggca caggagaacc tgctggcaac ggcaatgcga 
137041 ggcatcgatg tgattgacta cgacctcaac cacggactgc ccgcattcat cgatgaccag 
137101 ttcgactacg tgatcctcaa cgcgacgttg caagcggttg agaacgtggt ggaactactc 
13 7161 aacgagatgc tgcgagttgg ccgacatgcc atcatcagct ttcccaactt tgcgtatcgc 
137221 caactcagag atcactacgt gacacatggg cgatcgccga aagcccccgg cgagttcgac 
137281 ttcgattggc acaacacgcc taaccgacgc ttccccacga tcgccgacgt tcgcgacctc 
137341 ttgggccaac tgaacgtcgt gatcgatgaa gaagtctttt gggatgttga tcagggtcaa 



137461 cgcgaaaacc gttgaacttg attcacaatg 
137521 aatcgatcaa aattcggtat cgatgatttc 
137581 gacaccgaac ggtgacttcg aaccaggcgt 
137641 gctgttgctg tttccagctt cgatcgaatc 
137701 caggacgtgg actccacctt ggtggcgact 
137761 gttgccctgg aaacaaagct catcattggg 
13 7821 cggacgggcg agggcccatt tgaatccgcg 
13 7881 cgagccatcg ttccgccaga actgaggacg 
137941 cgcgtttcct tctcgccaga cctcaaccgg 
138001 gtcgcccaag tcagtgatga tttcgcccat 
138061 gtctcggaac ttcgtctgcg ttcgtggaat 
13 8121 aatccgtaga ccacgactgg taccattgtc 
13 8181 cgagtcgccc aagcacacgg catagttcgt 
138241 atcactgggg caacggaagc tcggaaggtt 
13 83 01 gtggtcggcc aactgacggt tgggccatgg 
138361 attgctgatt acttcccata aaccttgttg 
13 8421 agacaagtcg gaccggttac cgccgggtgg 
138481 gtacgtgccc gatccgtgct tcggcatcgc 
138541 taaaccaagt tgcttgaaat tgttgctgca 
13 8601 ttgcacggcg ggcagaagca aaccaaccag 
138661 ctccaccaac gtgaagcccc ggtgctttct 
13 8 721 gaaagatgtc cgatgaaaag gcggggcgtg 
138781 aattagggct tggtcggatc cgatggagaa 
138841 ttctcattca gaaactgcgt caactcgtcg 
13 8 901 ccgcacccac tgatcgcggc gacaccgagc 
13 8 961 caaaaattca caggcaaaac tccatagact 
13 9021 aaaacttctg atttttcctt ggccacaaac 
13 9081 caaaaggtga aaagcaatta acgagcagga 
139141 gattttcaca ttagcatgca aatatgacac 
139201 tatcgatttc aagaagtttg cctgttccga 
13 9261 gtcatcggtc gaaaacccgt gccaaaaacg 
13 93 21 gcgaaggaag gcaccgcaaa gcaggcctat 
13 93 81 ctgtttcgcg gcgaacggtg tacgagccag 
139441 tgactcgatc cccgattttg accggcaccc 
13 95 01 gctttttggc ctccgatgag agtcccatgt 
13 9561 ttaacttgac gactaagtgt tctttcccac 
13 9621 gaccggtgta ccgactgccg gagcgttcga 
139681 cagcggtcgc cactagcact tccttctcgc 
13 9741 cctggtcatc tttgaatagc tggactccgg 
13 9801 cttttccgtc ttgcgacttc acctgcatca 
13 9861 ttttgatatc gcgaaccgtg cccttcactt 
13 9 921 ccgactgtgc gatcctggac ggtgggaccg 

13 9981 agtcgtagct ttcgtacacg ccatcgccgt 
140041 ttaggtaatc accgtcacta tcgatgtacc 
140101 cgcgatcctc gtaggacaca tagctttcgt 

14 0161 ggatgatcag ctggtcgttg tctgagtcgc 
14 0221 tctcgtcgtc ccaacgtccg atagcctcgt 
140281 attccgagat gtcgtaccag gcatcgtcct 
140341 agtcgccggt ttgtcccatc gcgaccggcg 
14 04 01 tcgcgaccaa ggaacgaagg agtggttgtt 
140461 tttgattggt gttgtcgagt gcccgcgact 
14 0521 gaaggcaaac ggcgcgccac cttgcccgtt 
140581 caccacctga tctcgcacca ctgcggtcgc 
14 0641 tcgactcgca cccgaaaaat gcttgcatac 
14 0701 agacggtttg ccgtgtgctg aaccgaccgg 
14 0761 taacaagaga ttgcgagcat gagctttggt 
140821 cgcggtcgcg aatttcctgc gattcgtcaa 
14 0881 cagttgctcg aggtcgttcg gcgatttgag 
14 0941 atcaacgacg cggcggaatg tgcgttcgtt 
141001 cgagagattc ttgagcgagc cgggctggcc 
141061 cttcccgaag gcccccaacc actgctccgt 
141121 aacatcatcc aggcgtaccc cttgttcatg 
141181 atggtagaga acatcgactt ggcgatgaaa 
141241 accgaggatg acttggacgc tgacaatctt 
1413 01 cttctcaggc cgcaccgtac gatggttgat 
141361 gaatcctatg ctggcgactt cccacggcca 
141421 ggacatcgca catctcatcc aactgccgcc 
141481 tccttacgat tcgtttttgc tggtgtcctt 
141541 gccgtttctc gaaaacgttt tgcgaggcaa 
141601 cgccgagcac tacaaacact tcggcggtgt 
141661 tatcgcagca ctgcaaaagc gatttgacgc 
141721 caatcggaac tgggatccat actttgccga 
141781 gaagcgatcg ctggcgtttt tcaccagcat 
141841 tcgcgaaaac atcattcagg ctcgcgaaga 
141901 agtccggatg ggcttcaacc atcctggctt 



acctcgggcg acgagcgtcg tccgaaggtc 
gcgtgcggcg cgagtgccca aggctcccca 
caaaaatccg ccaccttcgc gaacttgagc 
ggtgatgaac ttcaccgctc catcgcacat 
actaggcgtc acatcacctt cgcgttcttg 
cgccaacact gtgacgaaac cggtgtacag 
acgctgctcc gcatctccgg tgaagggtgt 
ctccacatcg atgaattgac ggcacgacaa 
actgtcgaca gcatgcgaac gcgtgtcgta 
agcgagggta ttggacaaac catcgaggat 
gaaagcacca cggcaggcac tctgctgata 
agcggtgtcg ccgacatcac acagcgttcc 
acgcccgtga gcgggcaagc caacgccagg 
cgtcaaccag gggtcgtaag gaatcgcggc 
ccccattgct tgaaaaccgg tgtcggtcgg 
ctcgatgaat ggcaacatgc caaccaaagc 
gttgatccct gtgtctggtg gatactgtgt 
gttgtacgcc gagtgatagt tgtggattgc 
actcattcgg cgagcggctt cccgggccgc 
aacgccaatg atggcgatca ccaccagcaa 
ttgaaaactc attgaaaaga ccttgaagga 
acaggtcaac gacgagcatg cccggtaaac 
acctcgggct tgggctcttt cagctctggg 
gtcgaggtgg cttccggctc ggaagaacca 
aacgcagcca agccgcacgc tttgaaacga 
tgtgattgat cgtgacaacg ccagatcaag 
cactcagcga agtgagagct caagattggc 
atcatgctca aaagacacgt taaatcaacc 
gttttcggcc atctttcact cccgaaactg 
tatggggcca gccatacgaa ggaggaaaga 
catgggcgga ctgaaaaccc ctcctcctaa 
agagagactc agcaacttag agacacaggt 
atcagctttg cttcttgtca gcgatcagaa 
cggtcacggt gacttggtcg ccctcactcg 
cgacggtcat catcttgtcg cccttggtct 
ggaccttggc agtcttggtt gactgaactt 
ccttgtacat tccgtgcttc ggtaaatcga 
ccctctttac catcggtcca tagaaggtgg 
ttgcatgtcc gccgagatca atccagacgt 
tcaacgagta ctcgttgcct cgttgaacgg 
catgacgttt gccgctcaag ccgtctttta 
cgttctttga cgtgctgccc agctccgaat 
cttcatcgtc gtaggtgacg taggcgtcat 
gcgtcgatcg ctcgtacgtt ccgtcctcat 
tgtagtcctc gccatagaaa gtatcggcgg 
tgcttgttcg agcgtcttca taggaaaaga 
ccgtcgggtt gtaatcgttg ccgtcaaacc 
caactttgct gccgcggtct tccgggtcaa 
tgcaagccaa cggaaaagag acggcggcgg 
ggcgatgaag gaatttcatg gggtgaacct 
gctgcggaca tgggtgaaaa caccaccaaa 
tacggccgca tgacagacct cacgcgattg 
cgagcgatca tttcgcatcg tcgcgcgaaa 
ttgcggcatc ccgcatcaaa tcggtcaacc 
tagactccgg cgattctgcc tttttgacgc 
gccggtgcag acacgcaatt cgaaaccatg 
ttcaccatct tcctcgagaa ccgagtcggg 
gggactggga tccgaattgt ggccctgtcg 
cgctttctga tcagcgacgc ggaccgcggc 
atcatcgaat ccgatctggt tggcgtggag 
gtttgcacgg cgttgttgca agcggaactg 
cgtccgcacg gcaagccagc ggtcgcgatc 
acgctgacgg agaaaggttt ccgaatcatc 
ccgccgcaag acatctactg agtcgacttg 
cgctccgtac gatggccttc caaggccgtc 
tctgaatcag caggttcgac agccttggaa 
ttcttgactt cccgccatga gtgaactgcc 
tggaggcccg gaggggcagg acgatgtgat 
gaacgtgccg cgcgaacgga tgctggaagt 
cagccccatc aacgagcaaa accgtcagct 
caacggaatc gacttgccga tctactgggg 
cacgcttcgc caaatgaagg cggatggcaa 
gttcagctgc tacagcggtt gtcgccagta 
agttggcgag ggagcaccgc tggtcgagaa 
catcgctgcg atggcggaca acgtttcaaa 



142021 catcccgatg ggcatggctg acaattgcga 
142081 tttggttgcc gatgcttgcg gtgcggtgga 
142141 gccaccatcg cagccttggt tggaaccaga 
142201 cgctaaaaaa ttggaatcgt tggtgatcct 
142261 ggttctgttt gacttggacg aggaagccgc 
142321 ggcacgagcc tccgcggcgg gaacccaccc 
1423 81 gcaagaacgg ctcggaaaac tgaacgaaaa 
142441 tgatgtctgc ccgcaggatt gctgcttgta 
142501 gcgaccggtt caagccaatt gatgcggcgg 
142561 ggccacatcg gcttcgtcgt ctgtccacga 
142621 cgtgagcacc atgcctcgac tcgcccacca 
142681 cgaaaccatc aacgcgttgt tggacgattg 
142741 ggtgagcttc gccgtgggac gctgcgagcc 
142801 cgatgtggtg ctgcatacag tcttcgaaga 
142861 acctcggcac ctggaattca tcgagcgtca 
142921 cgattcgaat cttcgcgact gatgccatcc 
142981 aaaagctaat atcaccatga gtgttttggt 
143 041 cgccgtcatg cccgatggca cgttcaaaga 
143101 gtacgtcctg ttgttcttct ggccactcga 
143161 cgcgttcagc gaccgggcca aggacttcga 
143221 gatcgacagc catttcactc acttggcatg 
143281 tggcaagacc gcttacccat tggtcgccga 
143341 cgtgctgttg gacggcggcg tcgcactgcg 
1434 01 tgtgcgtcac caagtcgtca acgacttgcc 
143461 aatggtcaaa gccctccagt acttcgaaac 
143521 agaaggctcg cgtacaatca aagcggacgt 
143581 atacaaaagc taagcaagca atcgagctga 
143641 gccgacccct ggctccgctt gtgaccagaa 
143701 tgcggacggg cacgactggg tttgccacac 
143761 cgatcccgat ggttctttgc acgtgaactt 
1438 21 cgccacggaa caat ccgt cc gaggcgacgt 
143881 ccagtactca cttcgtctgg ctctgcgagt 
143 941 tgaacccacg ctgtcgattc aaacactgtt 
144001 cgccttgggc gattgcggcg tgatcttcga 
144061 accgatttca accgttcaat tgcatggtga 
144121 caccgacgcc ccgttcacga ccgatttgcg 
144181 cggcgaattc ctagaaaaag gcgtgatccg 
144241 ctccgaacag ggcgtttccg aggcggatct 
144301 gcccctgcct ttgacggctt gagcttgcgt 
144361 agtttggctt ttcaagaacg cgtcgggcgg 
144421 cttctgattg gccgacagtt cagtggaacg 
144481 ggcgtttttc ctgatctaca tgtcccttcg 
144541 ggaagacgcg atcgccgaag tcgacggcga 
144601 cgacacggat gcatccgagc ccggcgaagg 
144661 tgcggttgag cgtcccagct cgccaacatg 
144721 tggctatgtg atgctggtga cattgaacag 
144781 gaccgaacgc aaagagaacg gacgtctgaa 
144841 cctcggctac ttggccgccg acccgaccgc 
144901 cggaccggga actcccgcgg atctggcgac 
144961 cggcgacatc atcaccgggt tcaaccaaaa 
145021 agcgatgctg gaaaccaaac caggcgaatc 
145081 atcgatcgac ttcacgacca cgctgaccga 
14 5141 acacggtggc gacgatgaag tcgaaggcaa 
14 5201 gggacgcgtc ggtcgccgcg aaattcaatc 
14 5261 caccggcgat ctgatttggg atgcctccca 
145321 gttgtccgac tctgagatga agccggcttc 
145381 ctcgctgaaa ccagacagct actcgctgga 
145441 ggaagcccaa gagctggctt accgaatcga 
145501 gtggtacagc aacaaaatca gtcccaactg 
145561 caaaaccacc gccgaaggtc acgaactggt 
145621 gaacgaat cc gaagcggatg accaaacact 
145681 caacctgagt tacatcggcg tcgatgccca 
145741 aggacaagag tcactcaaga cattccgtcg 
14 5801 tgctgttccc gacaacaaag aacgcgccgt 
14 58 61 cgcagacgtg cccccgggat cgtcgctcaa 
145921 acagcccgac gtgatggaag cctacggttt 
145981 gttcgtcgct aaacctcttg gcgggttgct 
146041 cttggcaatc gttctgctga cactgtgtgt 
146101 agcggcgatc aacgctcaac gcatgcaaga 
14 6161 gaagcacaaa gacgacatgg aagctcgcgt 
146221 cgggttcaac ccgatggcgg gctgtgcccc 
146281 cttgtaccga acactctccg tcgacattga 
146341 cacctgggcg tcaaacctgg ccgctcccga 
146401 ggactacttg ggaggccgcg gcaccggttg 
146461 gatcgtggtt tcgttgttcc tggctcagca 



ctacgaaaaa caacttcgag aatcatgccg 
ttgggacttg gtttaccaaa gtcgcagcgg 
cgtgctggac gcgatcgccg agatggacga 
tccgatcgga tttgtcagcg accacatgga 
ccagctttgc cgagaacgcg gaatcaaaat 
agatttcgtc gagatgatct gcggattggt 
acccgctcta ggtgaactgg gtccctggca 
cacgccacgg cggcctccgg tggcgggcgg 
ctagaatcac cgacgagcca tcctgattcc 
agcggtgttt ctgccccctt ttcgatcctt 
agtcttcttc acgaccaaag accgcagcga 
ccaaacctat ctgaacgatc atcccggttt 
cgaatacgat cgtccggtca acatggactt 
ccgtgccgcc catgacgctt atcaaacagc 
aaaacccaat tgggccgaag ttcgcgtgtt 
aaccttcaca accacgtttc actcttttta 
cacccaaaaa gcccctgatt tcaccgccac 
cgacttcaag ttgtcggatt acaagggcaa 
cttcactttc gtctgcccaa ccgaaatcat 
agacctcggc gtgaacattg tcggcgtttc 
gaccaacacg ccacgcaacg aaggtggcat 
cttgaacaag caaatctcgc gtgattacga 
tggtttgttc ttgatcgacc aagaaggcgt 
gctgggccgc agcgtcgacg aagcactgcg 
caatggcgaa gtctgcccag ccaactggca 
ggaaggcagc aaagagttct tcggcgccga 
tcccatgtgg caatttcaag atttcgcggc 
cgcaggccag tgcttcaccg cgcactggaa 
cgcgggcggt tccaaaggaa tcagcttcgg 
gttgccggtt ggcgacgacg cgcttcccgt 
ttggaacgtg acgcttcctc aagccgacag 
cgtggaagcc acggaaactc gactggtcat 
gctggacacg cacccaacgc tcgacctgac 
cgagggcgac accctgggtg acggcagtcc 
aaccggctcc atcgccgtgc tgctcgcacc 
tcgagaaggg cacctgcaac tgcgtctctt 
caaggctcgg ccttggattg tgctggatcg 
ccaagcgtat ttgaaacgat tgtgcgaaac 
ttttggacct ccaggctgcc tcgcgcattg 
caagaatagc gggcttcccc acccgcctcc 
ccgactattt tcgttcatcc tctcgtcgat 
gaccatgttc gcgccaccac tgccaccgga 
aaccgttgaa ccagccgagg acttggttgc 
tgaaggcgaa gaagctaacg aagacacgga 
gtcgaccctc ggttccatgg atccgaccag 
tcgcggcggt ggcatcgaac gaattgaact 
ataccgacgc gtcgatgttc gctccggcta 
cacggatctc gggatccgag tcaacgtggt 
cgcttccggc gtgcaaggtg gcctgaaacc 
caacgtcaac aacctgtcca tgcttcgcga 
cgccaccgtc acggttctgc ggaacgagaa 
acacccgctg gacttgattc gattggctga 
cctctcgcgg ctttcctgcc tgctgacggt 
cggcgagaag accatcgaag gcatggtcga 
agacggggac aacgtttcct accagctgca 
gggtaaatca gtcggactgc aacggaccta 
catggacgtt cagatcgaca accgcgccga 
aggtgccaac ggcatcacgc tggaaggctg 
gggcggctcg gccgctcgcg acatcgtcta 
ctccggttac gcactgttga agcgggcgaa 
gttcgctccc gattccgcac caccggctcg 
gtacttcacc gtcgcgatgt tgccaccgga 
tgccgccgcg aatattatcg ccgacccaag 
caacgccagc ttctatctcg acagtgccat 
acaatcactc cgattgttcg ctgggcccaa 
gggcgactgc atctactacg gttggttctc 
gcacctgttt tccaacgtcg gcaactacgc 
gcgaggtctg atgttcccgc tttcgcgaaa 
gttggcacct gagctgaaga agattgccga 
ccgagcccaa cgggaactgc aacaacgcgt 
catgttcctg cagttgccaa ttttcattgg 
actgcgacaa gctgcgtttg cctcgtggac 
catgatgtat tactggggcg actggatgtg 
gttgggtccg tacttcaaca tcctgccgat 
aaagatgttc atgccaccgg ctacggacga 



146581 cttccgtgtc ccggctggtt tgtgcatcta cttcatcacc agcagtctgt ggggaatcgg 
146641 cgagcggatc ttggtcaaga aaacgctgcc atcgaagcct cacttcgatc cagcaaccct 
146701 gcagggcgcc gccgcgggtg gcggcacggt cgatggcaaa gtcaacaatt cggcgggtgc 
146761 caatggaaag aaaagctccg acggcaaacc caaaacgatg gccgaccgac ttcgcgagcg 
146821 attgggaact cccgaagaag aagccgcccc gctacccaaa gatcgcaaac gcccaccatc 
146881 gaagaagccc ggcaacaaga aacgccgcta gcgtcttaag acccccgacc gcgcagcggg 
14 6 941 agggttcgga taacgagcgt tcagcgagat ttctggggga gggcaatccg cgccacgtcc 
147001 cacgctcggc cccctccctc gcgtatgcct gaacggcgtc gcttgacctc ccccaaaact 
147061 tcgtttcggg agaggtgcgc agggtgaaaa tccgccggaa agcgttctct gaaaacggca 
147121 ggccgcagct tcgcgtactg cgaggccaga gtagaaccgt tgtctcaact gttccgccgg 
147181 gcaaccctca acggccaggc ctcatcacca tccatcaagg tccgaagggg cgacagatcc 
147241 attgtcagag tcgtcaggcc ccggaacgat gaaccactca aacagctcag gccgacggag 
147301 ggcctccgtc ggcaacaggt aactcaacac tcccacgaca acctcgtccg tgacaatgcc 
14 73 61 ggtgaatgtc gaccgcaacg cggtgcaggc gaatacgccg atttcttgac gttgcgatcg 
147421 gggcagggca gggcggagtg tcattccgcc acttcccaac ctttctttgt caaaaccagc 
14 74 81 tcctgcgact tgtccactgt ccttgcttcc aagccatcaa ctcgaaacgt ggttggaaag 
14 7541 cagcgtttca tgctcacggt cgtccgtggc ggcatctcga actcgtcgac actctcgttg 
147601 gtttcgatcg acaccgtttg agctttgccg gaacggttta gcaccgagat ctcaatctgc 
147661 gctgatttgg gtcgaccaaa caaatgaacg gcgtagtaag tcgtttgatc tttcgtcgcg 
147721 acagccgccc ccgtttgcgt gatgtgtttg gccacgatgt tctcatggtg cggcggtgaa 
147781 tcgatccatc ccggaacaaa gatatcaatc aaactcttcg cagtgacttc accagtgttg 
147841 gtgcgataag caatgttctc tctcacgacg cagtactcgt accccgccgc ttcggcacgc 
147901 tccgccggcg tcttgccatc cgcgtgatga ccgtacttgc ccgattcagc catgaactcc 
147961 gcaaacttct tcgcggttgc ctgcaactcg tcgttcacct gcaactgacc gagccccttc 
148021 tttttgcggt agtcgttgat ttgcttgacg attgcctctt ccacttgcgt gaccaattcg 
14 8 081 ccatcgcttt ccgaatcagc agcgaccgcc gtcgattcct cttgagccat caacggaccg 
14 8141 acaaacaaca gacatccaag catcaaccaa ccacgcactg catcactcct ggtagagaat 
148201 ttgaattgaa accgaatcac ccaatgagca gcatgatgca atcgtcatgc ctattctttg 
148261 cctcgtctgt ttcgtcacag cgccgaccca aatcaatcgg tcacgaccag cgattccctc 
14 83 21 cgccttcacc gagctggtaa aacgtgtcgc acccctgatc aacagcgccc ctttcgctgt 
148381 ccccactcac tgatttgaca ccgtcccaac tcagcatgcg aattcaagct ccttctttcg 
148441 tcgatctcga aacgcccacc ggtccgatgc ggacgcattt gtttcgcccg gacggtcccg 
148501 gccgctatcc cggtgtgatt ctttacagcg agatctacca gatgacggcg ccgatcgctc 
148561 gcaccgctgc tgtgctggcg ggacacggtt tgctggtcgc cgttccggat gtctaccacg 
14 8621 agtacaccga actgggcgag tcattcgcgt acgacaaaga aggcaccgac cgcggcaacc 
14 8681 gattgaagat cgaaaaggaa atcgccgcct acgatgccga cgctcgatcc gtcatcgact 
148741 tttttggcag tgacgaaggc tgcaccggaa aagtcggcac ggttggcatc tgccttggcg 
148801 gccacctggc atttcgggca gcgatgaacg aggaagtcaa agccggagtc tgcttctacg 
148861 ccaccgacat tcataaacga agtctcggca aaggcatgca cgacaacagc ctcgaccgca 
148921 tcccagagat tcaggccgag atgctgatga tttggggtag gcaagaccca catatcccag 
14 8981 ccgaaggacg ccggatgatt tacgacgcga tgactgctgc cggcgtcagt ttttcctggc 
14 9041 acgaattcaa tggcgagcac gctttcatgc gagacgaggg gcatcgctac gacccagaac 
149101 tcgccttgct gctcaactcc atggcgtgcc aacacctcaa ccgccatctg cggtagcggc 
149161 tgagaaagcg aatcccccgc cgattcgcga tctcacttcg caattcgggg ttggtcggaa 
14 9221 aatcgtccca cttacgaatg agcgacgatc gcgtatcctt tgcggttcgt tttcgcgaaa 
149281 ttcggaccat ccgccagctc ctgggggatc gtcccgtagc tttcttcccc tcgaaccctt 
149341 tgatttcgac ccgtgttacg aacccacacc tgcggcgcgc ttcgcaaatc tgacgtcggc 
149401 tcccccgtga ccctgtgcgg atgggtggac agcaaacgag accatggcgg agccgttttc 
14 9461 atcgatttgc gggatcgtta cgggctgacc caggtcgtca tcggaccgcc ggaagccaat 
149521 gaagcactga tcaaacaagc tggccacgtg ccaaacgaaa gcgtgatcct gatccgcggc 
14 9581 gtggtggcgg atcgcctgga aggcaaaacc aacgcgaaat tggaaaccgg tgaaatcgaa 
149641 gttcgcagcg aacacttcga aatcctttcg gcctccgaaa cgcctccctt cacacccggc 
14 9701 caatccgatc tgccaggcga agacttgcgt ctgaagtatc gcttcctgga ccttcgccgc 
14 9761 aaagaaatgc agcaggccct gatccgtcgc agcgaaatca tcaagtgcat gcgggactac 
14 9821 tttgccgagc acgatttcat cgacgtcgaa acgcccatcc tcggtcgcag cacccccgaa 
14 9881 ggagctcgcg attacctggt ccccagccgc gtgcatccca gcaacttcta cgcgttgcct 
149941 caatcgccgc agctttacaa acaaattctg atggtcgccg gtttcgaccg ctacgtccaa 
150001 gtcgccaagt gttttcgcga cgaagacttg cgagccgatc gccaacccga attcacgcag 
150061 ctcgacttgg aaatgtcctt tgtcgactca gaggacatca tcggattgat cgatggcttg 
150121 gtcgccaaaa ccgcaaagca ggttttgggc aaggacatca cgctgccgct gccacgcatg 
150181 acctacgaag aagccatgcg tcgcttcggt tccgatgctc ccgatttacg attcggtttg 
150241 gaaatcgtcg acgtcacttc ggtggctgcg aagaccgact tccgagtctt ccgcggaacc 
150301 gccgacgccg gcaacttcgt tcgtggcatc aacgtcaaag actcggccct caagttctca 
150361 cgacgccaaa tcgatgaact gaccgcgttt gttcaacaag acttcggtgc caaaggtttg 
150421 gcttggttcc gcgtcgaaga tgacggaacg ctttggagcc cgatcgcgaa gaacttcgat 
150481 gaagaacact tggctgaaat caaagcactc atgggcggcg aacccggtga cttgctgatg 
150541 ttcttggccg acacctggga agtcacctgc aaaggcttgt ccggtcttcg caaacgactg 
150601 gccgtggaac tgaagctcta cgaagacggc gagctgaact gcagttgggt gaccgagttc 
150661 cccatgttcg agaaggacga agaagccggt cgctacgtcg ccatgcacca ccccttcacc 
150721 gctcctctcg aagaagacct gccgcttctg aaagagtcgc ccgagaaatg tcgtgctcaa 
150781 gcctatgact tggtcatcaa cggatcggaa gccggtggcg gaacgattcg gatccacgac 
150841 agcaaggttc aatcgcaagt gttcgaactg ctgggcatgg acgaagaaac cgctcgcgat 
150901 cgtttcgggt tcttgctcga cgcgttgcgt ttcggtgcac ctccacacgg cgggatcgcc 
150961 ttgggtgtcg atcgctgggt gatgctgttc gctggcctcg aaaacattcg cgaagtgatc 
151021 gcgttcccca aaacgcaaaa ggctgccgac atgatgaccg gagctcctgg cgaagtggac 



151141 cgcatcaatc tagttgcccg gctcacctcg ttggttcact aaccaaatct tgctacaaac 
151201 gttggttgga gcgaggtcgg agcagggcag ggctgtcgtg cgttcactgc cgtctggtct 
151261 cgccgattga atctcttttc tatcgacgga tcgcatcatc gecgctaatc gccacctcgg 
151321 tttgtttgga gccaccagcg tcggcgtcgg cgcgatcgtg ggcggcggga tcctcgcact 
1513 81 ggccggagtc gcgtttgcga caacaggtcc atccgcgatt ctggcgtttg gcctcaacgg 
151441 cgtcatcgcg atcctgaccg cgctcagttt cgcggagatg gcgtcgaagt tccccgaatc 
151501 cggcggcact tacaccttca gccgcaaagt cctctcagtc gaatccgctt tcacggtggg 
151561 ttgggtcgtt tggtttgcat cgatcgttgc ctccgttctc tatgcgatcg gcttcggtag 
151621 ttttgcgacg ctgttgctga gcgaactgta ttcgactcag ggttcggtcc cacattggct 
151681 tggcgaaccc tggagcgtcc ctgtcgtttc gcttgcgacg accgtcggta tcggcggcct 
151741 gatgacgttt cgaacgtccg gtggaggagc ctggatcaac gtcgccaaag tggctgtctt 
151801 ctcggttctg atcatcggcg gattctggat gctgaccggg caacccgttt ccaaaaccac 
151861 cgcggaattg cgtccgtttc tcgcatcggg ctggggcggt ttggtacaag ccatgggata 
151921 cagcttcatc gccttgcagg ggtttgactt gatcgccgca gtcggcggtg aagttcgcga 
151981 accgaccaag aacatcccac gagcgatgtt gctgtcgctg gtcattgcgt tgctgatcta 
152041 tttgccactt ttgtttgtgc tgaccaccgt tggtaccgat ggatcccaaa gcattcgcga 
152101 attggccgca tccgatcccg aagccgtggt cgcactggcc gcgcggcact acctcggcac 
152161 atcaggctat tggctggtgc tgatcgcggc ggttctctca atgttctcgg ctcttcaagc 
152221 caacctgttt gcggcatcga gaattgctct cgcgatgtct cgggacaaca cactcccgaa 
152281 cgcactgagt cgtttggccg ccggatccgg ttccccctgg atctcggtgt tggtgacgac 
152341 cggtttggtg tgcttgctga ttcagctttt gccagacatt gcggcggccg gggcggcatc 
1524 01 cagcttgatc ttcctggtga cattcgcgat cgcccattgg ttggccatat tggtacggca 
152461 acgcagcgtg ctgacaccgc caccgttccg agttccgggt tacccagccg ttcctgtcgt 
152521 tggcggcttg gcttgtttgg cgttggctat cttccaaggc atcgcggtgc ctgaagctgg 
152581 aatcatcgcg gtcatgtgga tcgcgattgg cggcgtactg ttcctatcct tgttcgctcg 
152641 tcgagcccgg ctgaccgatg tgtccaacat cgccagcaat cctgaactct cgcgattgcg 
152701 cggtaactcg ccgttggtat tggtgccaat cgcgaatccg aacaacgctc gcgcgatgat 
152761 tgcgttggcg gacaccttgg tccccgccgc cctgggacgc gtgctggttc aaaccgttgt 
152821 cgtcgcacct cacgactggg atccgttggt caatcaccgg ccctccgctc aacttcactc 
152881 ggtgatgaac gaaatcttgc acgcctccgc gagtctgggt gtgcggtgcg aaaccctgac 

152 941 cacggtttct gcggaaccca tgatggaaat cgcccgcgtc gccaaactgc accaatgcca 

153 001 atcggtgctg ttgggcttga gcgagatcac tccggaggca cgcgacacgc cgctggaagg 
153 061 tttgctcggg caactatcca gcgatgttgt cgtgcttcgt gctcccaagg attggcagct 
153121 cgaccaaagc cagcagattc tcgttccggt cggtggacgt ggcggccacg actacttgct 
153181 cactcgactg ctgagcagtc tctctcgtga acaacaaaga caagtcaagt ttctgcgagt 
153241 gatccccacc gatacacttc gtgccgatca aaaacgtatt cgcaaagagt tggatcggac 

1533 01 aactcgagtc aacgcgggca gggtttgtga acgcgaggtc gtgcggagca atgatccagt 
153 361 caaaacaatc gccgaccgag ccggcacggc aggactgatc attctgggtg ctcaacgact 
153421 gggaccacgg caaaagctat tcggtgactt cacccgcaag gtcgcgctcg aatccgactg 

1534 81 ccccgtgatc atcatcagcc gccgcggcta aggaacgaaa acctataaag tcgctggcaa 
153 541 ctgacagact ccagacatcg gcaatctcat tgcctcgcac ctgctggaga gcccaatcgc 
153601 ctgctgcgat tggtcgcgtt caggccgacg gagggcctcc gtcggcgaca agtaactcga 
153 661 catctcccaa gacggccccg tccggggcaa ggaccgcgag gccagagtag aacagttgtc 
153 721 tcaactgttc cgtcgggcag ccttcaaagg ccaggcatcg tcggcaacca agtccacttc 
153781 caaccgttcg cgtcacgcga ctcgacgaca gcgacggccc acgcaaacat ttggggacaa 
153841 atgttctact ctgtttccca cgacagcccc gtccggaggc aggccgatgc aacgaacgaa 

153 901 tgagccacat tgccaatcac catcgttcca cccaacaagg tccgaagggt cgacacagca 
153961 attcccccga gcagaaggtc gctggcaatt gacagatatc agacttcggc gagctcattg 

154 021 tctcacacct gctagggagc ccaatcgctt gctgcgattg actgtgatca cgacaactct 
154081 tttttcagct ttgcgtcggc agtgttcatc tcatcctgcg acttgaattc gaccacgatc 
154141 gtcttcagcc gagaggtttc gccggcgacg atcgccacac ggcccttgct gacgcgcaac 
154 2 01 catttcgcaa gtgaagcgat gaccgctttg ttggctttgc catcttccgg cacagtgtga 
154261 accgacactt tcaaagctcc atcgtgcagc ccacccacgg atgccttctt ggctttggga 
1543 21 gtgacccgca cgcgaaacct ccatgtcaat ccatcgcatt gtcgatccag tcctgctcct 
154 3 81 ggctcgctca tcactcagca tcagttggct gtgacaactt ggctgatcga cgtccaaggt 
154441 tgccatagcg gtgataatca tgcgagatcg attgctcgtt caggtatcgc aagcattcaa 
154501 tgcgtccgtg atccaccact ggttcatcaa tcaccgtgat gaagcggcgt ccgcaagctc 
154 561 gacgaatcga ttctttcgcg tgggtcagcg tacgcattcg gctaacacca ccgccttcga 
154621 ttcgctcgat caactgctcc acggattctt cgatgggatc aatcaatccg ggcaaccaat 
154 681 cggcggtggt ttccagcatc gccgacttcg accgatccag ctcaggtgac agcgaaagcg 
154 741 tgaactgggt gcccacgcaa accgccgccg agatcgccac gatcgcatcc acaatgtcgt 
154801 cgtccttctc gattcgaatc gtcatgccat cgacggatcg ataacgacga atattgtctt 
154 8 61 ggccggtcaa tcgataggtg tcgtgttcca acgcgaaatc agttgcattg gcgatttctt 
154921 gcgacgtgat cgctcgttcg attcggaact tctgaccatc gcccaacatc tcgtcggaat 
154981 agcgaccacg ctcgatccaa tcccgcatcc gatcgagctg gtcgtgcgaa gcttcgactt 
155041 cgtccgtcgg tggtgtccaa tcgctggcat gtgtgacctt ggaaagggca agcacataat 
155101 ttggaccgcc ggctttgata ccgggaccgt aggcggacag accgactccg ccaaacggtt 
155161 gccgcaacac gatcgcgccg gtcgttggac gattgatgta gaggttgcca gcaggaatgg 
155221 attctcgcca caatttgatt tctcgatcgt ccaaactctc cagtccgctg gtcaacccat 
155281 aaccggtcga acgcacgatt tcgatcgctt cttccagtcg accaaatcgc atcacaccca 
155341 tcaccggtcc aaacaactcg gtcgtgtgag tgaagctgcc cggttgcacg tcccatttga 
1554 01 cgcccggccg gtacaacgtt gggttgccat cgacatgctc ggacatcacc aaccaagatt 
155461 caccaccctc gagttcctgc aaaccacgtc gcagtttgtc ggttggcgga tcgaccaacg 
155521 gggtcacgcg cgtgtgcaaa tcccaggcgg aaccaaccgg cagacttcga accgcgtcgg 
155581 ccaagatttc tcggaattcc ggatcgtcga acacttcgtt ttcgagcaac aacagcgacg 



155701 agtcgcgatc cgccatcgcg gtcacgatcg tgacattctt gccacccgtt tccgccaaca 
155761 agtgcaaatc ggggcgaaca tccaacatgc gtttcgcggt cgatgtgcca ccggtcagga 
155821 tgaccgtgtc cacgatcgga ttcttgacca actgctcctc agcgacctca tcatcacaaa 
155881 tcacaaattg caatgcgtcg cgaggaacgc ccgcgtccca gaatgcctgg cacatcagcc 
155941 acgccggcaa caaagtttgc atcgatggtt tcacaaccac ggggtttccc gctgcgatcg 
1560 01 cggcagcgat tccgccacat gcgatcgcca atgggaagtt ccaaggcgtg atcacagcaa 
156061 cgactccgcg tggctcacac tggatgcctg cacattcgtc ccaagctttc atcgtcagcg 
156121 ggtagaactc caggaagtca atcgcttcgc taacctcagg gtccgcctca gccaccgttt 
156181 ttccagcacc ggccaccatg cttccgatcc aatccccacg tctttcgcga gccaattgcg 
156241 cgactgaacg taggactgaa taacgctcat caacagatag atccgccgcc caggtcgatt 
156301 cgctggatgc tctgacggcg gattcaactg cgggcgaagt tgcattcttg taacgacaag 
1563 61 tcacgacgcc gggacgcgag acatcgtcgg tggtgaacca ttccgctgaa acgcgatcgt 
156421 ctccgtccag cgaatccacg tcgggtaaac caaccagcac cggcgtcaca cgagttgcct 
156481 gctcgtcaca aagtggcttc caacgatcta atgtttcttg tgcccatttc gaattatgag 
156541 gcagcgacca atcggtatcg ggttcattca cgaactgcgt ccaatgctcc gccgcgggtg 
156601 gctgcacggg agtctggtat cgattctggg ttcgtcgagg ctgcacggaa accgtttcgc 
156661 aatccatcaa cgcgtccttg aaccgtcccg ccaacaattc aaagtcgggt gagtccggct 
156721 gcaatcgata cgcatgacgc aggaagttct cggcgccggt gttttcatcc aagcggcgaa 
156781 tcaaatacgt gatcgcatgc aagaaatctg attttttgca agctggagcg tacaacaaca 
156841 tcgccgcgtc tcgttccgtc aacgcccggc gttgatgatt ggccatgccc tccagcattt 
156901 cgaactggac cgaatccgag gctcctgatc gctgagtcca aatcatcgcc agcgccacat 
156961 caaacaagtt gtgcgaagcg acaccaacgc gaatgttccc atccgcagca gcactgatca 
157021 gctccctgag catgcgtttg aagttcgcat ccgtatcgtg cttgttcgtg tacggagcca 
157081 gcggccatcc accaacggaa gcatgcactc gttccatttc gaggttggca cctttgacca 
157141 aacggatcgt caacggagca ccgccaccgg cgactcgctt cgtcgaccac tcaatcaggt 
157201 ctcgcatgac taaacaagaa tctggcacgt aggcttgcaa cgcgatccca gcttgaacgg 
1572 61 aatccagatc gccgcgttcc aatgtctcgc gaagcacgtc cgcggtcaga tacaaatcgc 
157321 tgtactcttc catgtccaag tacacgaact tgtctttggc cttggaacca tcgtccatgt 
157381 ggttggcgtt gcgatacaaa cgttccaagc gatcggcgac catgcgaatc gtgtgttctc 
157441 gtgccagcgg agacacttgg ctgtagagcg tggtgagttt gaccgagatg caacgcacct 
157501 cgggcatgcg aagcaattcg tagaacgact gcatccgctt tcgcgtttcg tcgtcaccca 
157561 gcaacgcttc gccgagcagg ttcacattca tcgccacgcc gtgacgctgg cgagatgcca 
157621 agtgcggagt caatagatcc ggttctgctg gcaaaatcac attggccgtt tcacggcgca 
157681 tcttttcttt gaccaacggc accgccacgc cgggcagata ctcgccaaag gattggaacc 
157741 ctttcagcat ggcctgttcg accggactga agaatcgcgg aataccttgc agatccaaca 
1578 01 agtgggtcat ctggtccgcc acacgggccg gcgtgtgtgt gcgaaacgct tggtcggtca 
157861 tctcgaccaa agtcgccttg tcggcatcgt ggccgatcat tcggtccagt tccgattgct 
157921 ggcgacgttc ctggggtgtc tgcaatgcac gcgactctcg caaaagatca gccgccatct 
157981 caatcgcggc agcggcatct tgctcggggt caaacccctt ccatgttgac gacaaggttg 
158041 agtcgggctg attcaagaca agatccttca caagagattc tgcggaaacg aagatcgagg 
158101 acacgaggtt cgccgatcaa ccattgacgg ctaaactttc gcaacttgca accgcctgtt 
158161 ttcacaacga tttctttatg gcattccgcc gcgacccaaa gacgcctgca atgacgactg 
158221 atttgccgca acatgtgttc cgtttacgag ttcgctacga cgaatgtgat ccgatgggac 
158281 tggttcatca ttcaaattac ttgcgatatt tcgaaatcgg acggaccgaa ttccttcgat 
158341 cgtcgggggg ccggtaccgt gaagtggaag aagcggggct gtatgtcgtg gtggtccaca 
158401 tcgattgtcg ctatcgtgcc tccgcacgtt atgatgatga aattgatatc gtgacgagaa 
158461 tcgccaaaat cactgcggcc aaaatcatcc atgaatatga gattcgccgc ggagacgagg 
158521 tgctcgtgca agcgaccgtg accttggcgg tgatagataa gaccggtcga ttgcaacgtg 
158581 ttcccgaagc attgttgacg taaagctctt tgactctttc cttcctggtg ccctcccaat 
158 641 gcgtcctgac caacctaccc gatcgctcag caaccgccgc caattccttc gccgcatcgg 
158 701 agtcacatcc gccgtcggct acgccgccta cttggctcat caagtgagtc catcaccact 
158761 gagcgtcaac ggtttgccgg ggcagggcag ggcactttgg gccgaagaac tcagcgaatc 
158821 caagacggaa accagcactg gttatctcaa gcaatcattg aaagccggca tgattcgggc 
158 881 cgggaaatcc gacgcagcca acccttggat cgaacggttt cgaattgcca aggatgccgg 
158941 ctttgaagga gtcgaaccca acacgtcacc cggtatggat gtcgaagcga tggtcgccgc 
159001 ctcggcggaa accggtttga cgattgacgg aaccgtcggc ggctatcact ggggtacaac 
159061 gcacaccagc tccgatgcag cgactcgcaa gaaagcacag caattgctcg aggagtcgct 
159121 gcaacaaacc gccgacttgg gcgccaacac gttcctgatc gttcccggcc acggcaaaga 
159181 cggcacggcc gaagaagtcc gccaacgagc tttcgaagct cttgatcgcg ccgttcccct 
159241 ggccgagaag cttggcgtca aaatcttgat cgagaacgtt tggaatcact tcctctacga 
1593 01 ccacggtggg gacgaaaagc aatcggctca accactggca gacttcgttg actcgttcaa 

1593 61 cacatcgtcg attggcgtcc aattcgatct tggcaaccac tggaagtacg gcgatgtggc 
159421 cgagtgggtg aagacattgg ggcatcgcat cggaaagctt gacatcaagg gattctcgcg 

1594 81 cgaacaggga cgcttcaccg acgtcaccga aggtgacatc gattgggcga gcgtccgaaa 
159541 ggcactcgct gagattaagt tcgaaggctg ggtcgcggcg gaagtcggtg gcggcgatgc 
159601 tgaacgactg aagaagattc acggccaaat tgaaaccgcg ttgcactgct caaaatcgct 
159661 ttcacaaatc aaggcggaga ccgcgtgacg gcgtcgcaac cactggacca agccgaccaa 
159721 gactctgaat catcctccgc aggttcatcc gccgcggaga cagagcacgt cagcgtgatg 
159781 ccaaatgaga tcgtgcaatg ggttcgagaa atcaacccga ccacgatcat cgacggcacc 
159841 tacggtggcg gtgggcacac gcgtttgttg gccgaggttt tggccgacgc gggttccgac 
159901 tcagacaccc cacgtgtgat tgcgattgat cgtgatcccg ctgttgttcg tcgggatgaa 
159961 caaaagtctt ggatcaaaga tgatccgcga atcgagctgt tcctgggcag ctacgaaagc 
160021 tcgcccaagg cactcgatgc gttggacctg acgcacgcgg atgcgttggt actggacctc 
160081 ggtctatcca gtgatcaact tgccgaccgg aatcgcggct tcacattcac aattgatgac 
160141 gcggaattgg accttcgttt cgatccagaa aacggagtac cagcgcaccg ctggctgcaa 



160261 cgacgcattg ccaagcagat tttcttgcgg gctcgtgaac gcaatccggt gaccaaggtg 
160321 ggtgaactgg tggagatctg tcgccgctgt gtgcctcgtt cacgcaacca tgacattcat 
1603 81 cccgcgacgc gaacgttcca ggcgctgagg atcgcggtca atgatgagct cggcggattg 
160441 acccgcacgt tgcagtcggc tcccgactgg atcgcccccg gtggtcgagt cgcggtgatc 
160501 agctttcact ccttggaaga ccgcatcgtc aaaaatgctt tccgtgagga tcaccgatgg 
160561 gaaatcctga cgaagaaacc gctgcgtccc accgacgaag aggtccaagc gaatcctcgc 
160621 agccgcagcg cgaaactacg cgtcgccaaa cgagtggagt gaacacggca acccgtagca 
160681 cgttggttcc gaacgtgaat ccgcccgtct cggtggagga ccatcgagca acagttcgaa 
160741 atgcctcatt cgcctcgaga aattagccaa ccttccggcg gtggtggtag gtcgaatgcg 
160801 ttgtgatagt ctcgctggcg catctcgatc caaaacaatg tctcttcggc aagtgacttg 
160861 ttcgcgggag agaacgccga ccagcgaacc gcctcagtca ctcgctcctt tgccatggtg 
160921 aagtcttgtc gtgctgccgc ggcgatcgac gcgttgatga gtgctgccgc attgcgtggg 
160981 tgcgattcca acacctgctg ccaacgatgc tccgccgccg gccaattgcc tgatcgggcc 
161041 aattcgtttc cacgtcgaat cgctcgactg cctggagaca atctcggcga agccaatgtc 
161101 acctgttgcc gatcaacgct ggcggcgagc agtcgtttgg tttctagaac aaccgctcgg 
161161 cgtgttcgtt cagcgacatc gggaatattc aacagatgcg gataccgttg cgagatcagc 
161221 gcctcgtcga ctgaaaccgg catcccggcc gattccgcat cggcctgcag tccagtcaat 
161281 cgccaagaca ggctgacttt atcgtcgcga tcctcgtgac cggtagcctg cagaatttca 
161341 ccatgcagca gataacgcaa cccttcgcgt cgagcggccg cgctgactgc catatcgttg 
1614 01 ggttcctctt cgaaacctga aacgaggcga atgttcgctg ggctttccag ttgctccgcc 
161461 gcgatcaatc gccaacgcga gggttggtcc gtcagcagag catcctcaat cggttcggcc 
161521 cactcgtcag gccctgagat gcccataaaa gcgactccag cgggctcgac ggccagctgt 
161581 ggcggttgcc aaacgtggac tggggcggtg gtcttgcagc cagtcagcaa gatccagcag 
161641 gcgcacaccc ctatccaagc acctcgggac acccaaggtg agggccagcg tcgctcgatg 
161701 ggaatcgccg agcctagcat gcagaaaatt agcttccttt cgaacacgaa ccggatcccc 
161761 gcggaaatgg acccgaagcg gacggtagcc aaacagagcg tctcgcggca agttatcttg 
161821 tgtgaactag acttagaaac tgtccgtgaa actcgttctc gccccacgct gatttttttg 
161881 tatggcttcg ttgttcgtca ttcgaggacg tgaccaagga aaacacttcc agctcgcacc 
161941 gacggtcacg cgactgggac gtgaatccag caacaatgtg cagttactcg acaacgaggc 
162001 ttcgcgaagc cacgccgaaa ttcgcggtga cgggacgggg cgtcgttacg aactggtcga 
162061 cctgggcagc agtaacggca cactcgtcaa caatcgcaag atcacacggc acgtactgac 
162121 tagcggcgac cgagtggaga tcggttcgac gctgctaatt tttaccggca ccggcaacgt 
162181 gtcggctctc gatgcggctc acggcgtcga catcgtccgg caagtccgcg ggggcgatgc 
162241 cagcaacatt gtttcttcgt tgtcccgcga aggcgtcgtc tcgccataca caccgggtgg 
1623 01 ctcttcatcc aacggcccat caaccgataa acccgcgacc tcgccgacga ctctgccgcc 
1623 61 accaatcccg ccacctgtcg aaaatcctcc gacgccatct gaggacgaaa gagtcgattc 
162421 ctcggcttcg ggttctgatc cacctggtca gttccacgga ccgcctgtca tcggtcgact 
162481 cgatgcagac cgttcactgg aagtcatgta cttgaccgcg atcgcggtgg gacggaccga 
162541 cgacttggat gaggtgctgg accgtgttct caagttagtt tttgactggg tcgaagccga 
162601 tcgcggatgc atcatgcttc gcgacccgga atccaagcga ctgaccccgg ccgctcgctg 
162661 cgatcgcgaa tcagcgaccg gtggaacaaa acaaaaccct tcatccaaac cacaagaccg 
162721 tatccaaatc agccatacga ttttggatta cgttctgcaa cacaaacagg gcgttcgcac 
162781 aaacgacgcg ctcgatgacg aacgattcga ttccgccgct tcgattgttc aaggcggcgt 
162841 tcgcgaagcc ctttgcgttc ctttgcaggg cagatacgaa atcgtcggca tgttatatgt 
162901 cgacacttac accacaccag gagaactggt ccgcaagggc ggtgcgacgc gattccatga 
162961 cgaacaccta cggctgatca ccgccgttgg gcaccaagcc gcactcgcga tcgaagacac 
163021 gttctattac tcagcactcc tgcaaggcga acgactggcc gcgatgggcc aaacgatcgc 
163081 tacgctttca catcacatca aaaacatcct tcaaggcgtt cgaggcggca gctatctgat 
163141 cgaatcaggt ctgcaaaaag acgacaccga tgcgatccgc cggggatggt ccatcgttga 
163201 tcgcaaccaa gaacgaatca gcaatctcgt gctcgacatg ctgacgttct caaaggaacg 
163261 cgaaccggat ctgcaggaag gcaatctgaa cgaagttgtc agcgacgtgg tcgaactcat 
163321 gcagagcagg gcagggcaga gcgagattga actggtttcc gacctggatg ccacattgcc 
163381 aatcgccacc tttgatagcg aagcgattca ccgcgccgtt ctgaacttgg tcaccaatgc 
163441 aatcgatgcc gccacgaaaa cagttttggt caaaacgttt tacgatgcgg ctcaaggctg 
163501 gatcgtcgat gtggaagacg atggcgaagg tgttcccgaa gaagaccgtg agcaaatctt 
163561 ctcgctcttt gagtccaaga aaggtgcacg aggcacgggc ctgggtttgc ccgtcagcgc 
163621 aaaagtgctg cgtgaacacg gaggcacatt gaccgtcgac gatgcggcaa gtggaggagc 
163681 ccgtttccga atgatcttgc cggccaccgg caccgatatc aacgaactct cacgcagcga 
163 741 aacccaagcg taggggtagt tagaacggcg tgaaggacat tcgaagactg atgatccccc 
163 801 caattcgatt gtccacaccc gggcgagcct tttcatcggg gaattgagag aacgacttcc 
163861 gctccttgct tccaaaagca acacgcggat ggtgcaattc atcagttcag gcttcgttca 
163 921 aaacctcgtt gggtgccatt gcaggctgcc gacacatgag cccaccattc cgattccagc 

163 981 accagtgttc gatttggcaa ttgaatacgg gaaagttaca ctgctgtggg ccgattggac 

164 041 tcaacacgcc ggagccgaac gatgaaaaca cgacgattat cacgcgaaga gatcggtagt 
164101 cgtggctcgg agatctatga aacgcaattg aaaaaactgc ttgaacaagg ctcgcgagac 
164161 cagttcgttg cgattgatgt ggaaaccggt gactatgaag tggcggatga agctcatgct 
164221 gctggaacac ggttgcgcca gcgtcgttcc gatccacagg tcttcatcgc caaggtcggc 
164281 cattcagctg cattccacgc cctgaggctt actagcggtg acgggatcga gcggaatgat 
164 341 tgaaggcagg atcgacaata acgttgaagc cttcgttgac ttagagattc aacttgatga 
164401 cacgacccac acgatcgctt tcttggttga tactggtttc aatggattcc tctcgatacc 
164461 attccgcttg gtccaccaac tcggcttgcc attgtcggat gttcagcagg gcgtgaccgc 
164 521 cgacggacgc tcaagcttct tcgacacggt tgaactgacc gtgatctggc atggaaatcc 
164581 gctgaccgtt caagcccaag tactcgacga acccttgctc ggaacgaggc ttctgcgcgg 
164 641 ttccaaaata gaagccgctt ggattcaggg tgggatgttt caaattgaat tgattttaga 
164701 tggctgatct tttgtgatcg ttgttgcaga gcttcacaaa atggaagtca aaaaacagga 



164821 tgcgactcgg tagcccgttc ggaccagcga gctggatcga atcgatcgct cgacctcgga 
164881 caacaatcaa ggcaagggtg agcggataat gatccatcca agggaaatgc gaaagcaatc 
164 941 agtcaagacg atgaatgaac ttggattggt ttgcctttca ggctgagcga tcagggcaag 
165001 atgcttttcc tgtgcgagcc tacgtgattc gaaagcgtcg tcatgcactt agttgaatct 
165061 cgagcaaaga ggcatggggt aacgtagttc ttgataccta ggactattgt catacgttga 
165121 ttcgctgacg tttcatgagc gagacattcc ggccaagttg agatgccgtc acttaaactc 
165181 caacgagtca aagaactccg atggcttgtc caattcactt tcgttggatc gtggcttgcg 
165241 ttggccgcca accagatctt tgatttcgga gtacggcgag tacaccgcga ccacaaggtt 
165301 tcgattgtga acatagattt cgcagtccga cccttcgatc gctcccacac cggattgcat 
165361 ggtccccgcc attccctcca caccagatcg ggtaacggga cgaggattcg acagacggat 
165421 tctcatgtct cgcaacaatt ccgcgatggg aatgttcgga tcgatgacag ccggcaaatc 
165481 agccaccacc agtgaatact gagatccggt ttctttcctc cggccggtga gctcatgaac 
165541 ggtcgtgcct ggaaactgct gcgacttgtc tctccggtct tcacccgcag gcatgctgac 
165601 ggtgtagcct ttgtacgtta ccgatgtcca accactgtga cgcgacccca gatacccaac 
165661 cgcaaagatt ccgccggcgc acaacaagac acacaatagg ccaagccccg cgatcgtgct 
165721 gatgatcttc acagcagcct tggatccacc acctcgcttt ttcgcgggct tgggcggttg 
165781 gtaaacggtg accggtcccg agggacgaaa gactgagctc ggcggtgccg ctgccgaaga 
165841 acctaaactg ggcaggttat caaacaggtt ttcttcggga acatggaccg tcacgggcgc 
165901 ccgaactgca agcatcaact cacacttcgg gcacttcact tttcctgcgc ccatcgtcgc 
165961 gggaatctgc aaagtggttt tgcattgagg acacacaatc cgacgtccgg ccatcgaaac 
166021 tcccgcggtt catcatcgag caatgagcgg gaaaaagtgt cacatacttt tgatgcaaag 
166081 cacgcagcag gccgttccgg caaaaagcac ctgacacttt ctcccaaccg ttcttcgaca 
166141 gtgatgcaac agttggattg gcccgtgtgc cagtttaccc cagcgcgacg gcgagtgctt 
166201 ctcgaacgat gggcagaact cggtcttgga cctcatccaa cgtgccttcc aaaaaggctc 
166261 ccgcggcagc tttgtggcca ccgccgccaa attgacgagc gacctcgttg caatccaagc 
166321 tgcaacgact ccgaaagctc aatttgaaac cacctcgcac ttgcccgacg aagattacac 
166381 cggcttgggt gccacgaatg gccaacgtca aattgatcgc gtcctcggtg tcattgggtt 
166441 gcgcgccaca caactcaaag tcttctttgt tgaccgatgt ataaaccaac gccccatcca 
166501 tctcggtggt cgttcgagac agaatcagcc cgcgaagttt cagccggccg agcgtgtcac 
166561 gttcatacag atcgccgtag acttcgctgg gaacgacacc tgcatcgatc aaacgagcga 
166621 tcgtgcggta tgtgtccgat gtaacgctcg ggaaccggaa ccatcccgtg tcggtcgcga 
166681 tggcagcaaa caaaggcgtc gccatcgttc gcgtcagtgg aacattcagt gcatcggcgg 
166741 cttgaacgac caaatgacct gtcgcttcgg attggtagtc cttgtacatc ctcgcaccca 
166801 actcatcttc acccacgtgg tgatccagca ccatcttgtc ggcgcgagac gcacggatga 
166861 catcgcccat gtccccaagc tgagcccatg cgctggtgtc gagaatcata atgcagtccg 
166921 ctttcacatc ttcggcctcc actgaatcgc ccaacacctc gatgttgcca gcgggatcca 
166981 agaacgacaa agccggcggg gtccggtgag cattgatgat gcgaacatcc ttcccaaccg 
167041 ctcgcaggac ttctgccatt cccaattcgc tgcctaatgc atcgcagtcg ggacgaatat 
167101 ggctgaccaa aacgaacgat tggtaatgtt gaatttgatc aacaaatgct ttccagttaa 
167161 ctcccacgag cgggtctttc tttcaaggtc gagattgaag agatgaaacc gtctcacgaa 
167221 cggtttgcaa atcgatttgc aattgttgtt ttagggcgga cgcgtcggcg aacttgatca 
167281 cgccccgaac ttgactcacc aaagtcagcg acaacgattt cccataaaga tcgccatcat 
167341 agtcgagcaa gtgggcttcg atctttgtct cttggcgttc gttgaacgtt gggttcggac 
167401 caatgtgaac cgcggcggga tgattggtcg gtccatcgca ttcatctccg cacgccagcg 
167461 ccgcgtaaac accgtgctca ggaagcaagg tcgtcacgcc gcccaaattc gcggtgggaa 
167521 atcccaatgt acggccgcgt tgttcaccgg tcgtcaccac accggtcaga cggtagcgac 
167581 tgccgagcat ttgattcgcc aagacgatat cgcctgatgc gattgcttcg cgaatgcgtg 
167641 aactgctgac catgcgttcg tcgcgaaccg atggaaccac gatctcgacg tcaatttgac 
167701 gctccgcagc caactctttc aaacgagttg tgttgccggc acgatcgcgt ccaaagaaaa 
167761 aattcgggcc ttcgatgatc gccttggctg acaactgttc cacaatcaat cgctggaaga 
167821 attcttcggc ggtttgattc aaaaaatcgc gagtggcctc gcaaaccagc aaatgatcga 
167881 tccccgaacc ggacatcaac tccgcccgac gcgacaaact ggtcaaactg ggtggtgccc 
167941 cgtgtggccg cagcaccgat gctgggtgcg gatccatcac gattgcgatt gctttaccgc 
168001 caacccgctg agcatgccat cggacacgtt cgagcaaagc acggtgtccc aaatgaaccc 
168061 catcaaaatt gccgatactg accgcaccac cctgaagcga tagatgcagg tcggagtcgc 
168121 ccgcaaggtc gctcaagtgg atcagggaag tcattcaaac ggagcagggc aggggagagg 
168181 caaaaagaaa cgaacagatt ggcctcagta tgcctgattc acctcgattt cgtcagatcc 
168241 catggcgggg cgcaatcttt gattcagtgg cttcagtccg caggctgagt catttcatcc 
168301 gcgaccaagg ccaacgcatc cagtgcatgc agagaagtca tgcccgcgtg actccatcgg 
168361 gccgccgcct cgtcaccaca acgaccatga atccatacac ccaacttcgc cgcatcggga 
168421 ccgctcaaac cttgcccaag caacgatgtc acgatgccgg tcaacacgtc gccgcagccc 
168481 gccgtggcca tccctggatt gcccgttgtg ttctgccaaa cctgctttcc tccgtcccca 
168541 ttttgatacg caacgtgtga cggtccacct tttacaacga tggtcaaacc caatcgacac 
168601 gccaactctg ccgcggcttc gacttgggca tcgacatcct tcgcagacgc tcccgtcaaa 
168661 cgctgcagct caccaggatg tggcgtcaga acgcacgcgg catcggcctt gcttctttcg 
168721 aaccggccat ctgaaagcca atcgttttga gcgatgatgt tcaacgcatc cgcatccaaa 
168781 accaacggca gatccttttt cgcgagcagc ccctccacga tcgtcgcccc acctgatccc 
168841 gtcgtcatgc cgggtccaca accgatggcc gcctgagctg acaatcgatc cttcaacgac 
168901 tgccacgccg gatcggcaaa cttctccccg tcatcgccca gcccaattgt catcaacgcc 
168961 ggatgaaaac cggctacaca atctagaatg caatcgggaa ccgctgccgc caccaaaccc 
169021 gatcccgtgt gcaatgcagc gatcgatgac aacgcgatcg agcccgccat gccgcgtgag 
169081 ccaccgatca acaaaacgcg accaaaattg cctttgtgag ccgatgcttc gcgatgcgga 
169141 atccgcagcg gcggcgggct ggtagactga gtctcaatgg atggcatttc aggctcgcag 
169201 gttgattgat tgtcgcacta tgcaggtcgg caggaatgat cgggtagaac ctcacgtggg 
169261 cgagtctctc tgagactcgc aaatgacagc gtctcggaga gacgccgcta cgtgatcaaa 



693 81 tcccgtcgcg atcaacgtat caatcggagc 
.69441 caccgctgaa ccggatcctg attcatgagt 
.69501 cgtttaattc gtcgcaattt ccgtttgtta 
.69561 tcggcttggg cgtgttttcg ctgatttgtg 
.69621 gatggtccgt cagcgacgcg atctacatgg 
.69681 gcgaagtcaa acccgtcgac acacccgctc 
.69741 tggggtatgg agccgcgatt tacaccgtcg 
.698 01 aactacaaag ccttttgagg aatcgaaaaa 
.69861 acacgatcgt ctgcggattt ggacgcatgg 
.69921 ttggccaacc gtttgtcgtg atcgatgaaa 
.69981 cgggaatgtt ggccatggtc ggcaatgcca 
.70041 tcgaccacgc tcgcggccta gcaacgttgc 
.70101 gcgtgacggc gagagacctg gccgagaaag 
.70161 ccgctcaaaa gaaactgatt cgctgcggtg 
.70221 gtgcgatgcg ggtcactcag ttgcttgttc 
.70281 atggactgtc gcatgggatc agcgaagaac 
.70341 tgcgactgac cagcagctct ccattggtga 
.70401 gcaatcgtgg ttttctaatt gtcggtgtga 
.704 61 ccagcggcac tttgattctg gaagtcaacg 
-70521 acattgccga attgtgcttg gctcacaaag 
.70581 tcaaaggctg atcgcgatcc gccatcatgc 
.70641 attggcaatg ccgctggcga tcaaaccagc 
.70701 attgaccacc gcaacattcg cggcacacga 
.70761 cgcaaagttg gctccgtatc cagtgctggc 
.70821 cccggccaac gccgccggca gagcgccttc 
.70881 cgctcgcaat cgggggacgg cggccaacaa 
.70941 gatgcgttca caagcgaccc ccatccatgc 
.71001 cgcatcggtg ctgcccgcgg tgacaacggc 
.71061 cggatccaaa ttttcgtccc cgcaaccgat 
L71121 agtgttttcg aaaactcgcc gcaactgagc 
L71181' acacccttgg ccgaccgcga cctgagtttg 
L71241 cccctcgccg tagatcactt cgccaaaacc 
L713 01 cgcgccgggc accgtttgca tgtccgcagc 
L713 61 cgccgcggct tccacgtcgg tttgccccga 
L714 21 aggggagggg ggttgattgg agggtaggtt 
L71481 atccttctga ccgtccttgc gtgtttcgca 
L71541 atagccgatt cccccgcatt tggatgcaga 
L71601 tccggtggtc tcgatacgtc cgtcattctg 
L71661 cactgtgttt acgtggacct cgggcagccc 
L71721 gctcgtacct gcggggccaa gtcctctcgt 
L71781 gattttgctt tccccgtgtt ggcgtggcaa 
L71841 acctcgatcg ctcgcccgct gatcagcaag 
L71901 gcgaccgcct acgctcacgg agcaacaggc 
L71961 gctgccgaag ctcttgatcc aaacatcgaa 
L72021 cgcgatgcct tccctggccg caccgaactg 
L72081 gtcaaagcct cgactgcaaa gccatacagc 
L72141 gaagcgggca agcttgaaga actcgacgtc 
L72201 ggcgtttcac ctcaagatgc tcccgacaaa 
L72261 ggcgtaccaa aaacgttgaa tggcaaagcc 
L72321 aatgacatcg ctggacgcaa cggcgtcggt 
L723 81 ggcatgaaga gccgcggtgt ttacgagtcg 
L72441 atgtacgtcg aacaactcac catggaccgc 
L72501 cccgaagtgg ctgaaatggt ttactacggt 
L72561 atgtcgttca tcgaaaccgc tcagcgtcca 
L72621 aagggcaaca tcatggtgtc ttcacgaacc 
L72681 gcaaccatgg aaggcggcgg ctcgtacaac 
L72741 caaggcttgc cgtcgcgagt ccaaggccgc 
L72801 acgattgaca gggcaacgac tcaagtcttt 
L72861 cctcgccgtt tttttatgga cgcgagttat 
L72921 acgcccattc gtccagcaca tcgtccgata 
L72981 cctggaccgt caacggtggc gttgtatcgg 
L73041 cgacgcagac acccccatcg gttgctcccc 
L73101 gtccaccgac cgacccgata tccaaatcgt 
173161 tcatctgacc ctcggctagc aagatcgtct 
173221 ccaatggtcc cgtactggga aacaatgtcc 
173281 ctgttcccca actactgctt tccggatgcc 
173341 gcatttggtt ttcgccctca ttccagggaa 
173401 cctcgtccag atacggcaga atcgtcaccc 
173461 ccaccgtgta cggggcaggg tagacgccat 
173521 cgatcttttc cagattcttg atgctgctca 
173581 ccgtccggct gccaacttga accgctgcaa 
173641 atccggccgc gacaaaccaa atcgccgact 
173701 gccgcgaacc agcaaactca ggcatcgtga 
173761 tgcgaccact gtattcatct tccacctcgg 
173821 gaaacgtcat acgatccttt catctaatca 



ggatgaaatg caaatggcgt gcaaacgttg 
cgcgtggctt tcccgagcca aacttctcta 
tgcagcccac accgctccgt cgaattctgc 
cgatcgcgat catcgcttac atccaaatgg 
tcgtgatcac gatctttggt gtcggctatg 
tgcgaacact gacgattgcg atcatcgtgt 
gtggattcat tcagtttcta gttgatggcg 
tgagccaagg cattgccagt cttcgcgcac 
gtgtccgcgt ggccgaagaa cttcaggagc 
acgcggcccg cgtcgaggaa gctcaacaag 
cggacgaaga catcctgatg gcggccggaa 
tgcccgacga cgctgccaat gcgttcatct 
tcgaaattgt ttcgcgagcc gagaaccatt 
ccaattacgt tgtgatggca gcaaccatcg 
gcccgaccgc gtctgcggtg ctggaatccc 
tgtctgccat cggtttgaac ttggaggaac 
gcaaaccgct cgccgaaatc caagttcgcg 
agcgtggcga ggatccgatt cagatgaacc 
atatcgtgat cgtggtcggt caccaaaacg 
tccaacgaca agagattctc tatcggggcg 
ttcttctgcc gaagccgcct ctgatttcgc 
gaagtagccg cctttgaatc ctgcatcaat 
tgtcagcatg cccattagtg gcgtcaggcc 
gggaaccgcg atcacgggtg tggccacgtg 
cattcctgcg accaccacaa tagcggcagc 
acgttgaggc ccggcgaccc cgatgtcgtc 
caatgtctcc gcggcttctt ccgccacatg 
aacgtgggtt gagtgacctt cgacaccgat 
tctcagcgtg cgtgcggacg ggttgtaccg 
cgccgcggtc gcatcaattc gcgtgatcaa 
cacgat ctgg gt cat cagat cggttgattt 
acagcgggcg agacgtccca aatcgacggt 
tgacccgatc gagctggcct gaatggattc 
agcgaggtcc tgcagaatct gcagcagggc 
catagcgtgc tgcttgactt tccggccagg 
ccggacaact taaatgcgag gccttcaaac 
tctgagatga aaagttgtgt gctggcctat 
ggctggctcc aggaccaagg ctacgaagtt 
tgtgaggatc gcgatgccat catggagaag 
ttggtggacg ttcgtgaaga actgtgccgc 
gccaagtacg aacaaatcta tctgctgggc 
gtttgcttgg aagtggctcg cgaagtcggc 
aaaggcaatg accaatgtcg tttccaactt 
atgatcgctc cttggcgaat caaatcgttc 
atcgagtact gcgacgtgaa gcgaattcct 
agcgatgaaa acgtcctgca catttcctac 
aacggcgtgg agctggtcga attcggaatg 
ccagaagaag tcacgatcgg ttttgaatcc 
gtcaacgctt tggaaatggt cgagcaactc 
cgcatcgaca tggtcgaaaa tcgtttcgtc 
cccggcatga ctgtgttgta cgacgcattg 
gacctgatgc accttcgcga ccgcatggct 
ttctggtaca cgcccaagat ggacgcactg 
gtcaccggcg aagtcacgtt gcaactctac 
agcccaaaca gcttgtacga cgaagagatc 
caagacgacg ccgaaggctt cctgcgaatc 
gtgtcaccac gcaagttcta acttggatca 
gccaactcag catcgcggcg aggttcaaaa 
ctctccaatg ctcgactcaa ggttgcgtcg 
gcggctcacc tcccgtcgga gtgatcaatg 
gcaagaagta tccgctgcct tccaccgtcg 
ccaaaccatt tcccgaaggc ggattgatga 
agggctccat ccacgattca gtcatcgtgt 
tcgtcgcccc atcggtcacc tgccgaggtc 
cggcgccggt caccaagtga tagacggtgc 
gatagaccgc cggtgtttgg ctgtaaagaa 
catccttgtc gatctggttg tacaaaccgt 
gccacgagtg cagttttcgt cccgcggcat 
gatcggcagc gtacgcattg agtgccgagg 
atcgctggcg gcccgtgcga attttcttgg 
tcaaaccggc tccgatcagc agcaacgcca 
tgttgcgttt gccaggacga tttcccatcc 
tttcgcgacc gcatacgacg cagtcgcccg 
tttgcgattg gcagtgtggg caagtgaaca 
gtgatgcacc gcgtgtgcgt ccagccacca 



73941 gcagccaaca cgccagcgtt tctgtggatg gcaaaatcgt cggtcagatc gagcgtggat 
'74001 tggtggcgtt gattggcatc ggtcacgagg acacggaagc aaccgcatcc gcattggccg 
"74 061 ataagaccgc tggacttcgg attttttcgg acgacaacgg caaaatggaa cgcaatgtaa 
74121 tcgatgccgg tggcgacgtg ctcgcgatca gtcagttcac cttgctggca gattgccgca 
74181 aaggacgccg gccagcgttc accgatgcgg caccgccaga tcgtgcgaac gagctttacg 
74241 agcactacgt cagtgaactt cgcaaaacgg gattgtcggt gccttgtggg atcttcgcgg 
74301 ccgacatggc tgtttcactg acaaacgatg gccccgtcac catcatcttg gaactgtgac 
L74361 gcacgttcaa aggtgagcgg cagttatcgc tggggcaggg cagggggcca attcggtcgg 
L74421 gcaactcgct ggcccattcc cggctgaacc acaccgggct gcgtcgcacc aggctgagcc 
L74481 ggtgagggcc agccaaagtt ggaaggcgac tcctgcggcg atgtcactcg cggttgccac 
L74541 tgcggcggct cgcgactttc ccagggcgcg aacaacgacg gattggtgta gtccggattg 
L74601 atctcatcct ggctcggaac ggcgtagcga tcgcgatcca ttcgcactcg ctggcgaagc 
L74661 cggtccaaaa tttcgtgatc gccccatgcc aaatagatga acaacccgag ctggccccaa 
L74721 acaaaaatgt tggccagcgg actgctgccg aagaatttca acgccgtgcc gaacgatttt 
L74781 ttcactcgca tcgagccacg actgaaatcc aacgaccaaa tctcgtccaa gatcaaatgg 
L74841 gtcaggaaac caaccagcac cgcgcacgac ttgtagacgc gaaccgcttc gctggcgcaa 
L74901 ggcatcgcca tgtatgcgat caatcccgcg accagcgccg ctggaatgct gtgccacatg 
L74961 cctcggtgaa ccgtgaactt acggaaaccc tccacggcaa caaatcgaat cgcgatgtag 
L75021 acgatcattg cggcgagcgc catcgcttcg tgtgacaacc ccaagtctcg ccatcgatcg 
L75081 atcatcagca tcggtacgac cgccgccaaa aacagactcg tttcgcgaag cggcacgccg 
L75141 gaatcgctgt ccaaatccgg caacatcccc gagacgctgc acagcccacc ggccaacagc 
1752 01 gcactctcca tcgacatgcc ttgatcgaac acgccccaat atccataggc gcatccgaca 
L75261 acggtgctcg tcgaaatatg ggttttgaag ccagccaatg aaacgcgaac ggaaaggaga 
L75321 cgacgggcgg ggaagggtgg tccgacaaga tttcccgaga caccaaccgc aaggccaccc 
L753 81 aaccacgagg gcggagcagg gcggtcgcgc accgagaacc attggcgtta ccaaattcgg 
L75441 gcccgcgccc gccaccggaa atttttgctc cgccaattgg gcccctgccc ccgacgagtc 
L75501 cccatcgccg gcccgaaaaa aattgctgtt cgccctgccc cttccatttt gctacaagcg 
L75561 tcaaaacggt ccgaatccgt tgcagcggtg aaattgcgct catgcagact tacgacattc 
L75621 tgatgacggt catcctggtc ggcgcaacct tgttgggcgc gattcgtgga tttgcatggc 
L75681 aactcgcatc aatcgcttcc atcgtcgtca gctattgcgt cgcgtaccac tatcgcgaac 
175741 cattcagcca gaacattcat gcagctccgc catggaatca attcctggcg atgttcatcc 
175801 tgttcgttgg cacgtcgttc gtaatctggg tcgcactgcg aatggtcagc gggatgatcg 
175861 accgaatgcg attgaaagaa ttcgatcgcc aaatcggagc tctgtttgga ctcgccaaag 
175921 gagcactcct gtgcacgatc atcacgctct tcgcggtcac gttgtttggc gaacgaactc 
175981 aacgagcgat cgtcgccagc gaaagcggcc gcctgatcgc tcgcgtgctc gccgagtcca 
176041 actcgatcat gccgccggaa ctggacagcg tcgtccgacc gtacctggac cagttcagcg 
176101 acgaagaact gggcgagcca tcggcctcag aaggatcctg gctctcacaa actccgatcg 
176161 caccaaacat cgatccaaac tggtcccatt cgaacgctcc aacggcgaac aacttcgcgc 
176221 cgcaaaacgg ctttcaacaa gcccaaaatg acccgcgatc gcagtcgcca tttggtggat 
176281 ttggaggaag ctcaacaccg acgcccaacg ctgatttcaa ctctgccgcg ggctcgcaac 
176341 aaccggctcc gacttggcgt caatccgcga cgccgctgtg gcaaacgcca cgacgatgat 
176401 cgcccgccat cttggaatcg cgatgacggc gaacgatcgg ggcggggcgt attcgagatc 
176461 gccggggcac agcgatctgc aacgtcacca agccaaagcc cagacgcgga aaacgcctga 
176521 aacacgggtt ttacgcacct cgaatacaac ataaggggca ttatcggacg ttgcggggcg 
176581 gtcgaaatcc gccctgattc gacgcgtcga tcccatctgc cagccctcgg tatggcgaca 
176641 ccgggcggct cgcgccgctc cgctgagaga cgcgacgact cttcagcgtc ggcctcagct 
176701 ctcttccaag aagacgccga gtttgcggaa cttttcgtag cgttgctcga gcatcagatc 
176761 gaccggcatc tcttccagtt cagacaactg acgcgacaga taagtcttca tccgagtcgc 
176821 catttgatga tggtcccgat gagcgccacc gagcggttct tcgagcacat catcgacgac 
176881 gccgagacgc aacaaatggt cgcttgtgaa ccgaagtgcg gccgccgctt tgggcgcgtg 
176 941 ttcgtggctc ttccacagaa ttccggcaca gccttccggg ctgataacgc tgtagtaagc 
177001 gtgctgcaaa accgcgactc gatcgccgac gccgatcccg agagcgccac cggatccgcc 
177061 ttcgccgatg acgacgcaga tcacgggcgt cttcagatca ctcatcatga acatgctttc 
177121 tgcgatcact tgagcttggc cacgttcttc ggcaccgatc cccggatacg ctcccggcgt 
177181 gtcaatgaaa caaatgaccg gcaagcgata tttctcggcc attttcattt tcaccatggc 
177241 cttgcgataa ccttccggat gagcgcaacc aaagtgacaa gccgcacgtt ccttgtaggt 
177301 cctacctttt tggtgaccga tcaccatcac cttgaatcga tccagcttcg cgaaacccga 
177361 cagcatcgcg cggtcgtcgc caaagtgttt gtcaccgtgc aattcgacaa attcgtcaaa 
177421 cgctaagttg aggtaatcgc gcgtgtaagg acggttctta tgacgagcca cttggaccgt 
177481 ttgccaaggg tccaacgatg agtacgtctc tttcaatttg gcaacacgag ccaaacgcaa 
177541 cgatcgaatc gctgagtcaa tttcttcact gcgatcagtg ttccgttcca gcgaagcgat 
177601 ttgctcctcc agatccgcga tctcgttttc gaattctaat ccgggtccgg ccatcgctta 
177661 aatttcctct tcttcgtcgc tggtaggttc ggggcgtttg atcatgtcct cgggcgactt 
177721 catggaagtg tcatcatcga agatggtcac ttcaggtcga cgaggacgtt tccgccaaac 
177781 ttcgatgttg cggaagactt tcaagaattc ccaaaccgat ggcggccgag catcgggcga 
177841 cttcgccatc atgcttttga ccaagtccgc aaaatctttg gtcacgttgt cgttgtagac 
177901 aattggactc gggatcgatg ccgacaagtg tttggaaagg agatcgttgg gagtctgacc 
177961 ggtgtaaggc ggcttgcccg tgcaggcttc gaacagcaca catccaaacg aataaatgtc 
178021 actgcgttcg tcgcaaagct tcccgcgaat ttgttcggga gccatgtagc tgcgcgtgcc 
178081 ctggatgttc tttgccttgt agaacatctt gctcaaaccg gttcgcttcg gttcggtgat 
178141 tgtgaagtca atcagcttgg tctcgccggt atgagaaacc agataattgt ctggtttgat 
178201 atcgcggtga atccaattct tcgtgtgcat gtagtagagg ccttcggcgg cttgctcgat 
178261 gattttgtcg agcatgtagg ccagcgattc ggggcccttg cgaagtgcct gtttcatgtt 
178321 catttcactg aacaattcca acaccaaaaa cgggcgaccg ttttcgacgc ggtgatcgat 
178381 catcttgatg atccgcggac tgttcagcgc tttgccgacg ttgtactcat gtttcagtga 



178501 ttqatcccct tcatcaatcg cttcccaaac ttcggcggtc gacccaatac ggatcagacg 
178561 agcgagtcgg taaggaccta gaaagtcgcg tgatttagac atcgagcggc tggggctttg 
178621 tcggaaacgg gacgatcgag ttgaacgatc aaaacgattt tgtggcatcg catgatagtc 
178681 caaccgcccc cacagccgaa agacgtgtgc ccgacaccgt aaatcgctcg gcgggaggcg 
178741 ggataccgcc atgggaatcg gcgtttccgc tcgataccac cctgccctcc tcggctcacc 
178801 aacgcagttc caaacacggg accgcccggt agcgaacccc cgttcagcga gcagaaatca 
178861 qatccgcata cagctcgcgg atccgttggg tcatcgtttc gtggcggaac aggtcggtga 
178921 atcgagttcg cccctcttcc cccattcgca ggcgaagctc cccgtcgcca accaaacgaa 
178981 tcattcggtc cgccaagtca gcgacctgcc ctgccccaac caaataccca gtttggtcat 
179041 cgatcacgac ttctcgagct ccgtcgatgt catagctgat cgccggacga ccggcgatca 
179101 acgcctgagg caacgctcga gccaggcctt cccggtagga cgcgtgaacc aaaatgtcca 
179161 tcgctccaat catggcgggg acctctgacg gagggaccaa gccggtgaag atgaagtgtt 
179221 ctttcagtcc cagcgattcg atttgttgtt ccaattcccc tcgcaatatt ccgtctccga 
179281 ccagcagaaa ccgaacgttg ggatgccgat ccgcgaccaa cttggcggca ggaaccaaat 
179341 cagcatgacc tttcaaatga aacaatcgag cgattttgcc caccacaaca tgctcgtctc 
179401 gcaagccgta cctttgccga acagcctcgc ggtgatccac cgcatgaacg aatgggtcga 
179461 cattcatccc gctgtgaatc gtcacaaatt tttcgcgtgg tgcgacgccc gcttccacca 
179521 tcaaatccgt catcgcgtcc gcgacggaga tcagcttgtg gcaacgcgaa gcggcccacc 
179581 gttcacaacg aacaaagaag tcatgagcca gtttggattg gtactcatgg aacggtgccc 
179641 cgtggaccgt atgaacgaca acgggacgct tgccggtgga ggctcgtttc aatccccaac 
179701 ccacgtgtcg gcctagcaga ccgccctttg cgctgtgcgt gtggacgacg tcgggatcaa 
179761 agtccaatag agtccgccgc agatcacgcg tggcattcca gtcgtgaagc ggatggattg 
179821 cccgacgcag cgagtcaatc aactcgattg gcaggctgtc cagttcgaac cgccctgccc 
179881 tgccctttga atcgttcgcc gaatctccaa tgtcactgga gtctttgcga acacgttcgg 
179941 agagcaagtc tccttccggt ccaacagcag gcccggtaac gagcaagacc tcgtccccat 
180001 gaaggtgaac gaggtcctgg cagttgtgca gcgtgttttc ttgcgcacca ccaatgatca 
180061 ttcgggtgat cacatgggcg atacgcatga aaggtcttaa gccccgaact ctaagcgacg 
180121 aacgccgagt tgctgttgca aacgattgac gatagcggtg tgcatttggc tgacgcgaga 
180181 ttcagacagg tccaacgtgg ccccaatctc cttcatcgtc agttcttcgt agtagtacag 
180241 qatgatgatc agacgctcgt tgcggttcaa ccctttggtg accaaacgca tcagatcgtt 
180301 tttctggacg cgacgagtgg ggtcttcccc cttcttgtcf tccaggatgt caatctcacg 
180361 aacatctttg tagctgtccg tttcgtacca ctttttattc agcgagacga cgccgacggc 
180421 gttggcttcg ctctgcatct tttccacttc tttgacgtcg atctccatat gagccgacaa 
180481 ttcttggacg gttggtgctc gaccaaagcg ggcttccaag gtcttcgtcg cggtggccag 
180541 tttgctggct ttgctgcgaa ccaagcgagg cacccagtcc attttgcgaa gttcatcgag 
180601 cattgcccca cgaattcgag gcacgcagaa cgtttcgaac ttcacgccgc gttcggtgtc 
180661 gtaggcgtcg atggcgtcca tcaggccgaa catgccagca ctgatcaagt catctagttc 
180721 cacgccatcg ggcagacgtt gccaaatgcg ttcgccgttg tagcgaacca attgcatgta 
180781 gcgttcgacc aactgattcc gcagcggctc gtaatcaggc gcatctttcg tggtggattt 
180841 gaaagaagtc cataccttca agatcgggtc gtcagcagtt gcggtagccg gcatagaatc 
180901 ctccgtgatg atcactgatt gctaacttca tgtccgcatc ggaaatccgc tgcgtgtccg 
180 961 tcactagcaa tgaaataagt cttggtgtgt ttgaacgtgc gagtcaatca gcatccgtgc 
181021 cgtcaacagt ccgccgtcac ttggtttggt tttcggtttc ttccgatgcc gtttcggcga 
181081 caccttcctg aaaccacttc actcgtttgc gatacaaatc ttccactccg tctcggatca 
181141 actgatccgc aattgcccct gccaaacctc cgatgcccat gaagacgatc agagttccga 
181201 tcgcttcgga agcaacgtct ccggccatct ctccatgcca gattccgcga aggacaacca 
181261 gtcccatggc gaaggatcca agcgatagag ccaatgatcg ggtcacgtca ccgtccaaac 
181321 aqtqgaggca ggccagaaaa taaaaacggg tcgaattctt ctatcccaaa ctcgttaaac 
181381 ggacacttca aaaaaagtct caagtgactt tcgtcgatca cgtgccgtcg cagttcagag 
181441 tatcggtcag cttcccgttg atacttcagt caaacttttc gaaattaggc cgcctcgccg 
181501 tagtgcaaag attcgtaccc gtccattgtt tcaaatccca tctcccccaa tccagccgta 
181561 ggcatgtgcg gcaacaattg tgccaccaga cccgaagcgt cagccacggc gatgtcatcg 
181621 gggacttgct gaccattggt caaatagctc agcggcggcg cgacgacgcg atcacgcccc 
181681 gtcaacgcag ccagcacaga tgccatacaa ggtgtctcat caagcttggt cagtgtgacc 
181741 gcattggctc gcaccggtgc gaaaccttct agcgtcgttc gaatgttctc acccgaactg 
181801 gtcgcactga gaaccaagtg tgtttcgtca ggatgcgccg ctcgcagaaa ctctgaaagc 
181861 tgttcaatgc gggcatcgct tcgtggactc cggcccgcgg tgtcgatcag gaccagatca 
181921 acatcgccga gcgcactgag cgccgtttcc atctgttcgg gtttctcaac gacttgcatt 
181981 ggcaagtcca tgatctcggc gtaggccttc agttgctgga ccgccgcgat tcggtaggtg 
182041 tcaatcgtta gcaaaccaac acgccgtcgg gcttcgatac gaaaacctgc cgcgagcttg 
182101 gcgaccgtcg ttgttttgcc aacgcctgtt ggtccaacca acgcgaccac gtgacgatca 
182161 cctggctgag ttcggatggg tccacacagg tttaactccc gagccactgc tcgttgcagg 
182221 tgttccatcc aacgctggtc agactgaacc gctgattcag aattcgcgat tcccgcagca 
182281 aaactggacg ccgaagccat ccatctgcga gcggtggatt cttccactcc agcctgaagc 
182341 aatcgcaacg aaagcgaatc tgcaggactc agttcacccg atgcgtaacc ggcaccgtag 
182401 aaaccgccct gccctgctcc attgcccgcg tttccaaaac cttggatccc ggtgcttgag 
182461 qaatqttggt gtgacgcatc ataattttga tacgccgcat ccatgggatt ctgtcccgaa 
182521 gaaaccggtg ctgcgccggg attgattgtg ccggcgttga tattgggtgt cccggtcggc 
182581 gacgtgccat aaccagagga tgccgcgacg gagttgaccg aacggacatc agcgggctgc 
182641 aacggttcgt ccgcgtagtt cgctgaaacg cgactgccgg atgcatcgtg cgaagttcca 
182701 tcctcaggac cgctgtgcaa ccccgcagtg acttccacat gagtccggcc aagccagccc 
182761 atccaqccat tgcggacctg ccgcgtatga agcacggaag cttccgggcc catttgatct 
182821 cqqatqtctg caagggctgc ttgcagattc gctgctcgaa aagttcggat atgcatcaat 
182881 cactcaccac gccgtgtgac tggatttttg tgtcttgtgt gatttcgttg tacgacaaca 
182941 cacgaacgcg gggcatcgat cctgcgatga tctggcgaag tcccggacga atccgcggac 



183061 tgtcqcaagt cgagtccacc gcttgaggac tcatccgaac aaaaagtcct cgctcggtat 
183121 qttcqattcc tgccgcaatt cggtcttcca ttgctggatc gagcgccacg acgtgcagtt 
183181 qgcccaattc gtcgcgatag cgtgtgctga tcgttcgcgc caaacggtga cgaacgtatt 
183241 cqctqagcca aatcggatcc ttcgtccgcc ccgcgtgatc acccagcgtc tccagaatga 
1833 01 ttqccaattg tcgaatcggc acatcttcac gaagcaatgt ctgaaggact tgttggacat 
183361 cqctqatctt cagcatgcca ggaatcaatt cttccacgac tgcgggagcc gcgtctttca 
183421 attcgtcgat caaatgcttc gttgcatcgc gggtcagcaa ttcgtccgcg tgacgacgag 
183481 ccaactcttg caagtgagtc gccaagaccg caccaggttc gaccgttgtg taaccataaa 
183541 tcgcagcttg ttcacggcgg atcggatcaa tccaaaccgc tggttcgccg aacgttggat 
183601 cttttgtcgg ttcgccgtcg atgacaccgg tcgtgtgccc gctgtcgatt gctagcaagc 
183661 gatccggcag gacgcgttgt ttggtcagtg ggtttccagc gattcgaatt tcatattcgt 
183721 aqtcqtgcag attcatgtcg tcgcgaatcc gcaccttggg cagcacgaca ccgatgtcgc 
183781 ccgcgattga atgccggaca cccgtgatcc tctgcatcaa atcaccgcca cggttcgggt 
183841 ccgccagcga caacaatccc agaccgatgg ccaattccat cgggtccacg ttcaagtagt 
183901 cctcqactcg cttttcagcc ggcgcggacg cggccgcctg ctcggcttct tcagcgtcca 
183961 actcatcttg ttcgtcttgt ttgttttgtc ggttcatgac aacggccaaa ccgatgcaac 
184021 tggctcccaa cgtcgccatc ggcagggctg gcaagctggt caaaatcaac aacgacagaa 
184081 atccaccggc aacagcgagt gcctttggat tgctgaacaa ctgctgcaag aactgctcag 
184141 gcaaactggc tttgcgagca ctccgggtga ccaacaaacc cgcggccagc gagatcaaaa 
184201 qcgcgggaac ctgactgacc aaaccgtcac cgatggtcag tttggtgaac agttcggctg 
184261 cctccgacac ggtcatgccg tatcgcatca ccccgatgta cagaccaccg gccacgttga 
184321 ccaacgtgat cacgatgcca gcgatggcat ccccacgaac aaatttgctg gcaccatcca 
184381 tcgccccata aaagtcggcc tggttggcga tctcttcgcg acggcgttgt gcttcttttt 
184441 cgtcgatcaa acccgcgttc atgtcggcat cgatcgccat ctggcgacct ggcatcccgt 
184501 ccaacgcgaa tcgggcggca acttcactga tccgagtcgc acccttggtg atcacaatga 
184561 attgaatcaa aacgatgatc acaaagatga tgatcccgat ttcgatcctg tccccggcaa 
184621 caaactcacc gaaaccttga atcacaccgc cggcggcacc gccgctgccg gattcggtac 
184681 tqqtcaaaat caaccgcgtc gtagcgacgt ttaacaccaa ccgagcaagt gttgtggcca 
184741 qcaacaacga tggaaaaatg ctgaattcaa gcggcgtcga gacgtaaacc gtcgtcagca 
184801 aaacaatcac acccacggtg atgttggttg ccaacaacaa gtccatcaaa aatggcggaa 
184861 qcqqtacgag gatgaccacc aaacaagcga tgatgccaat cggcaacacc aaatctcggt 
184921 agcgcatcaa taactgcatc gccggtcgtt ccttcgggct tgagcgtact cttctgatgg 
184981 cggcaccttc ccggtgctac ccaagcccga cggtatcaaa actctttttt cgtcgaaagt 
185041 gaagaatccg actcaagtta ggaagatgaa caaccgaaag agcacacgtc ctttttgacc 
185101 qqaccagttt cacgccgtgg tgccaagcca gagcggcttc aagccttcac ttgcaatcca 
185161 qaaaacgctt tttgcagccc tagtttgctg atttcggctt ggatcgcttt tgcatcagcg 
185221 ttttccaacg tcaacgagat ttcgtcggcc aacagctgag cgaacagtga ccgggcgatc 
185281 ggcaactgac gcttcacctc ggcggcggac acgctgcgtt cggccaactc agcgactcga 
185341 ccagcaatca cctcggagga ttcccccgaa tggtccgcga ccattcgcat cacatcgtga 
185401 accaaatcat tgatcggctc gtcggccacc acatcggcat cggcatcggc atcgcccgca 
185461 tcqacttcgc ccgattcaat ttggtgagat cgcgttcttt cgcgatgctg ttcacgttcg 
185521 aqagcccgcc aagtcctctg cagcaatcct tctcgccagt aactgagcca ccggcgtgat 
185581 tccgttttgg cggaatccaa atccaagtca cgcggtcggc acgattcgcc tgctttcttc 
185641 qccttgctct caatctcggt tacccgagcc ttcgcggcgg agcgaagccc cagaattaaa 
185701 aaatctcgca atcgaccctt cgcgtgcccg gaatacccag ctttgaccaa atgtgaaatc 
185761 agtatcgcca acgcccgatc cgcatccgat tcgctgccca atacatccgt cagcaaccga 
185821 cgcatcggcg gcaagtaacg cagcacaaac ccggcggcgc tgcccgcacc ggtccatgtc 
185881 tqccgagatc gactgagccg aacctccgaa gagccggtca gcgatgaagc cgacagatcg 
185941 cccgcattcg ctgacttcgt ttgctcttgc cgagccgaag cgtcagtgcc agtcgaagtg 
18 6001 ttqggatgag taacggccgg aggattcatc ctcgtattct aactcagtcc ggtcgccagg 
186061 taqacqtgta gctccccgaa accccgcatc ttgttgccaa attcggttcg caatgacgtt 
186121 cccactttca agaaaccagc ctcctctcag tccgttttcc aacgaaccgc caaaccgggg 
186181 gccaattcgg gcgaacggtc tcgcgagttg aatcgagtgg gatctcccaa ccaaccttcc 
186241 tgcgacaaca ccagggtgac gacaccgggg gatgatcgca gaaaatcatc caacgctctg 
186301 gaaacaaagc ggagtccgtc gggctgatcc ttcaggcctt ccccagaatt atcgaacttc 
186361 aattccccaa gcggcaccag atcacccacg ccattggcca ccggactggt ctcgaacgtg 
186421 agcgacgctt ctccagtttc gctccaatct agtgatccga tgccaccggg agtcgtcaaa 
186481 tccgatggcc aaccgtggac caaaatccgt cccacacctg ggtgcgagtt cggagccata 
186541 ctcagcaaca atgcggaacg atcgattcgc ttcagaacgg cgcttgccga atcagctccg 
186601 qcacccgacg gcggtggcaa ctgcgtcaga tcaaacctta gaaaagcgat cgccaaaaca 
186661 ttctgatcgt tcgccccatc acgatgagcc ataacggcgg gctgctttcc aaagtcttcc 
186721 ttgcttcctt cgcgaacata ggtatcggca cctttcccag cacgcgttcg gatggtttca 
186781 aaacccaaca gatcgccggt ctgcaaattt tcagtgactt caaattgaga tgtcaacgag 
186841 tccacatqtg cccccgccgg tgattgcaat tcgaagatca catcaaaacg ctcctgagcc 
186901 aagttgttgg tcaggaactg aatctcaatc gattgcgtct cgcctggccc cggcaactgt 
186961 cttggcgaaa acgatcgggg aaaagcggga gccttgcgtt tgagctgccc ctcttgctgc 
187021 aaataqgtgg tgatgcgagc cacctgcccc gcttgcgaat agcgaatccg cggatcgctt 
187081 tcacqtgagt ccgcctgatt ggtcaactgc aaccgaaacc tggccatttc gccaacggga 
187141 atttccggat cagcaatgac cttccctgtt aactgcaatg cgcccagttc agcagccaac 
187201 accqcqqqtq aggtgtcggg cgtcgaagcc gacttcaccg attccatcga cgtactcttc 
187261 qaaqttccat cgcgaagcat ccaccacact ccaccggcaa tcagtgcagc aacggccaaa 
187321 ccaacggccg caaagaacgc tttgcctgtt ccaccacttg ttgtctttgt cgtcgcagcc 
187381 qqttctqccc ccgctggcac cgacgagccg ttgctcgaat tggcatcgat atcgattgcc 
187441 aatcccgctt gcttggattg ttgaatctgc tccgacaccg ccaacaatgc tttgcgaaag 
187501 tgccccgccg tttcgtatcg ctgcccgggc tctttggaca acaaacgcat aatcagtcgt 



187621 qgttcacggc agagaatgtt gaccagttga gtgggcgtgt tggaagcaag caatggaact 
187681 ttccccgcac acatctcata catcaccacc ccgagtgaat agagatcggt gcgttcgtca 
187741 acggtctcat tgcgagcctg ttcgggtgac aagtagcccg gtgtgccgac gacgctgtca 
187801 ccgcgagcca agccttgcac cgcagatcct gcgaccgcca agccgaagtc caaaatcttg 
187861 acccgttcgg aaggcgattg aatccacaag ttcgccggct tgatgtctcg gtgaatgatt 
187921 cctcgatcgt gggcggcttc caatcccaag gcgacttgat ccgcgatcgc gatcacttct 
187981 tccgagctga attggcgacc ggattgcagc aactgctcca gcgttcctcc ctgcaacagc 
188041 tccatcgcca agaatggtgt gccattgcgt tgcccaactt cgaacagcgt cgcaacgttg 
188101 tcatgatgga ccgcggccat cgagcgagct tcctcgacga tccgcttgcg actattcggc 
188161 gatttcgcga atcgctcgtt catcactttc agagccacgg tgcgctgcaa ccgagtgtct 
188221 tgcgctcggt agacatgccc catgccgcct tggcccagca attcgatcac gcgatacggc 
188281 cctagtcggc ccaattcgcc agctgcttct gggggatcca gaaactcagg gggtgaatca 
188341 ggtgcgggcg gtgaatcggt catgaggtgc cgggttgcga tgggaaacca atgcattcca 
188401 ccggtcgtcg atgcggttga ccggctggcc agccgccccg cccaactcgt tcggcagaat 
188461 ccaaccctgc agggtaaaca ctggcacatc gcccaacaac tcttccgcca cggaaatctg 
188521 tccccatgga atccatgccg ggtcttgata cgattggccg ctggattggg ccgagaccac 
188581 cgccctgccc tgtcccgctc tgaacatcta ttcgccttct acccgcgacg agcccgtgtc 
188641 acgaaccgct atccatgatc tcgccgacgg aatgacgttg gcccagccgt tccaagctgc 
188701 tgacaaacaa ctgcgagtca atcgtcaggg tggcaagtac atcctgctga agttatccga 
188761 ccggtcgggc acgatcgtcg ggatgatgtg gaatgcggac gaaaccacct tcgacacctt 
188821 cgaccgcggc gactacatcc attgcaccgg tcgaacccag gtgcacaacg gttccctgca 
188 881 ggtgatcctc accgagatcc agcgcatgga cccggcggaa gtcgaccaaa gcgatttcga 
188 941 ccgcttcgat gccaaccaag cggacaacaa cttcacgcgt. ctgagagagt tgctggccac 
189001 gatctcacaa cccgcactgc atgcgttggc tcagtcatac ctgaacgacg attcgtttgt 
189061 tcaatcgttt cgccaggccg ccgccgcggt ctccaatcac catgcctatc cgggcggact 
189121 gctcgaacac accgtcaaca tgatggaatt ggtccgactg atctcaccgc gatacgccaa 
189181 agtcgacacg gacctgttgt tgatgggagc cttccttcac gacttgggga agattgatga 
189241 actgcgagcc agcggcgaga tcagctacac cgaccgaggt caactcgttg gacacatcgt 
189301 catcggcgtt cagcgtttgg gcgaaaagat cgcagaaacc gaagcctcta ccggccaaaa 
189361 gttttcgacc ggactgcggc accaattgga acacttgatc gtcagtcacc acggtgtctt 
189421 gqaatacggc agtcccaaga ttccagtcac gctcgaagcg gttgcgttgc atcacatcga 
189481 caacctcgat gcaaagctcg cttcgtactc ttcgatcatt gattccgaca tcgctgctga 
18 9541 tggaaactgg acgaactaca ccccatcgat cggacgcaaa ctttggaaag gcgactcgtg 
189601 aattccacct cgcaatcaaa aacggaatcc gcccaacaag atgctggcaa agccgtcgtg 
189661 ttgctgtccg gtggactgga cagcgccacg tgtgtggcca tcgcccgcga ccaaggattc 
189721 gaagttcatg caatcagctt ccgctacggc caacgccatg acggcgaact tgatcgtgca 
189781 gcaaaacaag ccagcctgat gggcgtcgta tcgcaccgcg tgatcgacat tgacctggca 
189841 cagcttggtg gatccgctct ggtggactcg tccatcgccg tcccaaaatc ggatcatgtc 
189901 gacaaaatcg cgggcgacat cccggtgacg tacgtcccgg cccgcaacac aatctttctg 
189961 tcgtacgctt tggcggttgc cgagacgttg ggatcgcgag acatcttcat cggcgtcaat 
190021 gcactcgact acagcgggta cccggattgc cgtccggaat tcatcgacgc atttcaaacg 
190081 atggcacgct tggccaccaa agcaggtgtg gaagacgaac attcgcttac cattcacacg 
190141 ccgctgcttc actggaccaa ggccgaaatc atccagcgtg gaatcgaact gggagtggac 
190201 tactcacaaa cgctgtcctg ctatgaccca caaggcagca gcgacgaaat gcgtccgtgc 
190261 ggtcaatgcg acgcctgttt gctgagagcg aaaggcttcg cagaaaacga gatcgccgat 
190321 ccggcaatcg gctgagacaa cgatctggat gacgtggact tggttgcgat gtggcttggg 
190381 cattgagggc tgcccgacgg aacagttgag gacaactgtt ctactctaga ggtgcccatc 
190441 tcaacgcggt cgcgattcaa cgacattgcc ccggggagga ctcggtgagt ggattgcaaa 
190501 gtgcaaattg atcaatgcaa attgcaaagt atttccggct cggcaactga cgtgacaacg 
190561 cagacctgca actgccaatt tgcaatgtcc actttgcaat ttgcaatcgc cccgtctcgc 
190621 cattcctctc atggcagtca cgccagaccg ccccggacgg agccgtcgtc cttagctcgg 
190681 ggtggaaacc ccgagaccga acgagaagaa ccatcagtcg ccccggatgg ggccgtcgtg 
190741 agcgttcccg cgttgttgaa aagcacgacg cgaacggttg gaactggact tggttgcgat 
190801 gtggcttggg cattgagggt ggcccaacgg aacagttgag gacaactgtt ctactctgga 
190861 ggtgcccatc tcaacgcggc cgcaattcac cgacattgcc ccggggagga ctcggtgagt 
190921 ggattgcaaa gtgcaaattg atcaatgcaa attgcaaagt atttccggct cggcaactga 
190981 cgtgacaacg cagacctgca actgccaatt tgcaatgtcc actttgcaat ttgcaatcgc 
191041 cccgtctcgc cattcctctc atggcatcac gccagaccgc cccggacggg gccgtcgtga 
191101 ttgctccggt atcgtccagc cgcacgacgt cagcggttgg acctggactt ggttgcgatg 
191161 tgccttggct gttgagagtg gcccgacggg acagttgagg acaactgttc tactctgctg 
191221 cggtccattt caacgcgatc gcgattcacc gacatctctt ctactgaggt ctcggtgagt 
191281 ggattgcaaa gtgcaaattg atcaacgcaa attgcaaagt atttccggcc cggcaagtca 
191341 catgatttca ccgacctgca attgccaatt tgcaatgttt actttgcaat ttgcaatcgc 
191401 cccatcgcat ctcgcgtgtt tccgcgttag ccccccgcag gttttcccta cttcaacagg 
191461 aacccgttga gattccaacc ggtctcggtg gtttcaacaa atccgaacgc gttgcggaag 
191521 tacttttcaa cctcgctgaa cgctggccat ttactgacgt tcaaacgctc gatgggttcg 
191581 tccgctttgg gctcgtcgcc ttcgaacatc cgccggacca acgacgccaa gaccgagtca 
191641 ctgttcttca attcgccttg acgttgcagt tcgtacttcg cacgcagcga ctgacgcaga 
191701 cgcaccacgc gttgaaaagc cacttcatcg gctcccattt cgctggtcag tttcaagagg 
191761 tctttgactt cgtccaagtc ggccagcttg tcgccctcgc cggtttgcat tcgcttcgcc 
191821 acatcgacca accattccac atggctgcta aacaccaggt gtgacttttt ggaacccttg 
191881 ccagcaggaa ccatggcgat ggcccaggtg ttcaagagcg gagtttgctc ttcatcgatt 
191941 tcttcttcga aacctaggtc atcgaagaac tcgtcatcca gctcgtcatc accactgccg 
192001 tgtttgacct gccaaatttc aacgccggga acggtgtcaa ctttgaagac gtcgggttcg 
192061 acctccatcg ctttgcgaac cgccgttccg atcgcatcag cattgagaac ttcgatcgca 



192181 tgctcagaaa gattcggcaa aacgttcttt tcaatgtcga tttgcggccc ttcctcatca 
192241 tcgcggatgc cttcaatgct gggccgaaag atttcattgt cgaacgcttc atcgaccaaa 
1923 01 gtctcgaccg cccaaaacgc ttcttcgatt ttccagttca ctcgcgtgat actggctgta 

1923 61 ttggcaggaa cccatgccgg aattccttcc agcggcgtgt ttggaaattg gagcattcga 
192421 gccgccaaac gatagcggct gggtttgtcg gtcaccggag gagcatgcac gtagccacga 

1924 81 tgcagcaagt cgaatttgcc ctgacccacg accgcgatac cgccagctgc ttgcaaggca 
192541 tcgaaccctt gattctccaa cagattcaaa atgcgaacct tgttgcctcg atccactttg 
192601 gcgacgtcgc gaatgatccg gcccatcgtc aaaggtttga caaaccattc gacctgcgaa 

192 661 ccttcgactt caatttcgct acgtgaaaga gcctgcttcc aaacggttcg caacaatggt 
192721 gccttgtcca aggaatcttc ccggccatcg tttgcaatcg cgtccagaat ttccagcacc 
192781 accgaatcac gatccgatgc gatcacgcga tccttggtca gacagacaac aacctgttcg 
192841 atcttcagtt gcccaggtcg cgtctttggt cggtacacgc gtacggtttc accagcatga 
192901 gtgacatcgg tccgggtcgc gccgtcggct ttcagatctt gatcaactcg cttcaagaca 
192961 gcttccgctt ccgcgtgctt tccgcgaatg tcggcgacaa cgcaaacgga agacggtcgg 

193 021 cggttgtcgt tttcaaacgg caaccaaacc gcgatcactt ctccggtcgc gatctcgtac 
193 081 agttccttcg gacgcaatcc gactttgtca cccagttttt cccacacggc accttggctg 
193141 ccgaaattgg catcgatcaa tggcttcatc gagggatcat ccgtgatgcc agacagagtt 
193201 gtcttcttcc acgaatcaca cagcacggtc aggttgggaa tacgtactac accagcggcc 
193261 gtgtcgggca acaacttcaa cgccgtcatc aaaacgggtt tggcaccgcc ctgctctgcc 
193321 ccggcttgct tcgcaggccc agcgttcttt tcgagcccgg cttcctttgc gggtgccgca 
193381 tcttgggcga tcgaatgtga tccaatcaaa cacacggcca gaacggctgt catcacatgg 
193441 agcaccgcga ggcggcaagc ctgtgttcga ccgtcgaagt tggaaaaata tgttcgcgtg 
193501 ggattgatca ctcgtttgtc cgccacttgc acccgttgaa aattcattca agatccgtct 
193 561 gaaatcatcg gaatccatcc gagcccgcgt ctctgcgcga ctggctcagc cggccgatat 
193621 cgtaaaccct tccatttcgg gccgaaatgc cgatgtcacc atccatcgcc tgatttgtgc 
193681 ccttgcaaga gaccaaaatg accgaaacgc aggaaactcc gtaccaaaac cccctgattg 
193741 agcgttacgc gtcacgagaa atggcgtttc actggggtcc ccagcggcga ttcgcatcct 
193801 ggcgaaaggt ctggattgcg ctagcggagg ccgaacaaga gctcggaatc gcaatcaccg 
193861 atgcccaaat cgaacaactg aagtcgttcg aaaacaaatt aaatctggac gcggcggcca 

193 921 aatacgaacg tgaacttcgc cacgatgtga tggcccacgt ccacgcttac ggcgatcagt 
193981 gcccggacgc ccgtgggatc atccacctcg gcgcgaccag ttgtttcgtg accgacaacg 

194 041 ccgacctgct gctgattcgc gaagcccttg agctcactgc taaacgcttg gcggccacga 
194101 tcgaccaaat ggcaaaattc gcggccgagc accgcgacct gccctgtctc gcctttaccc 
194161 actttcaacc agcccagccc acaacggtcg gcaagcgagc atgtttgtgg atctacgacc 
194221 tagttcttga tctcgaagcg atcgaacatc gtctcgaaac tctgcgtgca cgctcagcca 
194281 aaggcaccac tggaacccag gcgagctttt tggaattgtt ctccggcgac cacgacaagg 
194341 tccgtgcgtt ggaaaaacga atcgctgaaa agctctcgtt tgagtcggtc tacgccgtca 
194401 ccggacaaac ctaccctcgc aaagtcgacg ctcaattgct cgacgccctg tccgggatcg 
194461 ggcaaagcct gcacaagatc gccaccgaca tccgcttgct ggccggtcgc aaagaagtcg 
194521 aagagccgtt tgagaagaag caaatcggca gctcggcaat ggcctacaaa cgcaacccga 
194 581 tgcggagcga gcgaatctgt gcactcggtc gatttgtcat gagcctgcaa agcagccccg 
194 641 cgatgaccgc agcgacccag tggatggaac gcactctcga cgacagcgcc aaccgccgct 
194701 tggtcatccc gcaagcgttc ttggcaatcg acgcggcact cgtgttgatg caaaacgttg 
194761 ccgacggaat ggtcgtctac ccagccacga tcgcaaaaaa cttgggtgcc gagcttccgt 
194 821 tcatggcgac cgagaacatc ttgatgcaag cggtcgctgc gggcggtgac cgtcaagatt 
194881 tgcacgagca aattcgcgtg cacagccaag cggcagcact agaagtcaaa cagaacgctg 
194941 gcgacaacga cctgctggaa cgtctcaaag gcgacgagaa cttcgccggc atcgacttgg 
195001 aagctgcgat tgatccgcac gcttacgtgg ggcgagctcc ccaacaagtc gacgagttca 
195061 tggaagcgat catcaccccc atccgccagc gttactctgg cagcgacagc ttgagcgttg 
195121 aagtcacggt ctgatttcga cgggcgactg cccctaaacg caaagatctc gccggcgtgt 
195181 catatcgccg gcgagatcaa atgtctttct tggatcgtct gttctagaaa caataagttt 
195241 tagaacaaga agatcgtgtc aatgccgaat gtggtttggt cgtcttggtc gttttccaag 
195301 atgccggtca cgtcggctcc gtctgcccag tcccagcgaa tttcgggacg gatcaagacg 
195361 ttggcgtgcg gtttgacgtt ggcacccaag gttaatgcgt agacgtcaac gttgccgttg 
195421 aagacacccg cgtcggcgtc ataccactcg aagcgactgc ccaaagccaa acaatcgctg 
195481 accgtgtaga tcaggtagtt gttcaggccg taggtttctt tgaagttgcc accattgccg 
195541 ttttctgcgt ccatgtaatc cgactgcaag acatactgca gttgatcggt caacgcgatg 
195601 tcggtgacga tcgagtgcat gtagccctgc gacgtggtgc cggctggctc atccgcaaaa 
195661 cgtccagcga cagttgcgta ggtcacagtg acgtcgtcac tcatggctaa cgagacaccg 
195721 cccaagaacg agtcgccgtt gtcttcgaaa ccgctgtccc aacccaaggt gtaaccaccg 
195781 taaacggtca agtcatcgct agcgttgtag gtcgccaatg caccggtgtg ggtgaagggc 
195841 tcgctgttgt acatcgtgta agcgtggctg tagaagaagt tgtcaggagc ggcaacaact 
195901 tcgtagccaa tgatcgtgaa gaagtgacca aacttcaccg acaggtcgcc gtagccaact 
195961 tcaccgtaga cttgtggcag tgcacttccg tactcgccat gcttccaacc ggtgtcccag 
196021 tggccattgt cgatgccgaa cgcctgtgtg tcctgagcgt cgatgccgta gacgtaatcg 
196081 atgcgtccgc cgaagtcgaa accgttcgag gtgtcgatgg ccttctcagc gtagaaccaa 
196141 gcttgatgca ggttgtattc gtcgggacga ttgttgaaca aaggcaatgc tttgttgtgg 
196201 tagcccattt gaacccaacc gccgacggag tagccacaag cttcaccgaa gagcgtgaac 
196261 ggatcgccga ggtcacaatc aaacatcccg caaccgccgt catcaaacag gccgccaccc 
196321 atgtcgccac agcccatgtc gccgcaacca ccatcgcagg ccaactcacc gcagccacat 
196381 tccatttcag gaaatgtgtc acaggattca tcacagctgg catcaaagtg cccgactgga 
196441 acgacttcgg cctgactggt tggcattccg ccctggtaac ttgcgtcggc cacgaatcgg 
196501 tcggcagcag aagcgccatt gaggcttccc aatccaacca agccaccgag ggccaaccac 
196561 aaagcagctc gtttcgattt catcttgctc gtccctgatc gatttcaaat ggattcgccg 
196621 cgtccacagt gactgccacc ttccatggca gcaaaccagt caaccgtgtc gaccgccaag 



196741 ccatagaaac cccagcccga atcaccggaa aatccggcac agcgttaaac tttgcgggtt 
196801 aggggattga tccggcgatc cacgaggcgt cggccgatcg gcctgagatc ttgagcgcaa 
196861 acgaaaaaaa gccggctcat cgcatttctg cggtgaaccg gcctcttgga tctcacgaaa 
196921 tcttcgttcg gggcagctta gaacgtgaag atcgtatcga aaccgaaggt ggtttgatcg 
196981 tcgtcatcat tttccaagat tccggtcgtg tcgccatcaa cccagtccca gcgaacttcg 
197041 ggacgaatca agacgttggc gtgaggacgg tagttgatac cagtcgtcag agcgtagacg 
197101 tcgcttccgg tgttgttgcc atcatcgaag acgccttcgt tttggtacca ctcgaagcgg 
197161 ccacccaagg acaagcaatc gttgacgctg taaaccaagt actggttgat accgaacgtg 
197221 tcacgtgcgg tggcaccagt ttcgtcttcg gtgtccagca agtcggactg gatgatgtac 
197281 tgcaagttgt cactgacagc gtagtcagca acgatcgagt gcatgtagcc ctgctcaact 
197341 ccaccagcgt tgtcttcgcc gaagcgaccg atggtggttg cgtaagtcag agtcaggtcg 
197401 tcggtcaaac cggccgagaa cccgcccagg aagttgtcgc cattgtcttc gaatccgctg 
197461 tcccagccca tggtgtatcc accaaagacg ctcaggtcat ccgagacgtt gtacgttgcc 
197521 aatgcaccgg tgtgggtgaa tggctcgctg ttgtacatcg tgtaggcgtg gctgtagaag 
197581 aagttatccg gagcggcaac gacttcccaa ccgatgatcg tgaagaagtg acccactttg 
197641 accgacaggt cgccgtagcc catttctgtg tacagctgag gaatggcgtg accgtagtcg 
197701 ccaccgttgt cccagtcgtt gtcccagtga ccgttgttga tcccgaacgc ttgcgtgtca 
197761 ggtccgtcgg taccgtagac gtaatcgata cgtccaccga tgtcgaagcc acacgaagtg 
197821 tcgatggctt tttcagcgta caaccatgct tggtgcagct gaacgttgtc gggacggctg 
197881 ttgaacaaag gcaacgcctt cgtgtggtag ccgatcgacg cccatccacc gacttcgatg 
197941 ccacaaacat cgccgaacag cgtgtaaggg tcatccaagc agcagtcgcc caaaccgggc 
198001 aagcagcttc cgaggccacc gccgcagagc gaatcacaac cgctgtcgca acctgcgttg 
198061 cagccacagt cgcccaagca gccgacggtt tcgcaaccgc aagtcacttc gttctcgcaa 
198121 ccggtgtcgc agccgccaac aggctcacaa ccacacacgg gctcaccaca atcgcaaagc 
198181 gacgaggtgc aagagaccaa tccaatatca tttgctgaag cgccggtcag gttcgtggcg 
198241 ttaacgccgc atgcaaccgc tgcgatcaga gctagtttac taagcttcat tgctttctcc 

1983 01 gttgatatcc caatccaatt gggtcgtttc gtgtcaccaa gattcaaaat ggatgctgtt 
198361 agaacgggaa gctgtgcccc caaaatgtgg ggagggccga tctctcgctc acaacgatga 
198421 aacaaacgca gtccatcgct ttttcaacgc acacgctgcc tgccgccgat ggcacatcga 

1984 81 aacttccagg caatcgcttc ggaattcgaa tcaatcacct cgaaagattt cgagtgagtt 
198541 gaaagcaatc catcatgtcg tcgggagaat tcatcggtgg gtttgcgaga tctgtcctca 
198601 accggtgctc aaaacgactt agaaaccgcc cgcggggtgc gcaacgcgcc tgttaatcct 
198661 catctgcggg tcgctcccaa cgttccgtct gggcaaactt tgacgcttcc cgacccgaac 
198721 acgccccgca tctgaaaatc ctgctttcaa taaaacgttc cacccgctat cctgcggggc 
198781 ttctccctga aatcagggat tttgaccgtc tcaaatcaca ttcatgacaa tcaccgcgag 
198841 cggtggcgga gaatcagatg gcacgacagt gcgaagcgtg cggcaaaaaa gtccaaatgg 
198901 gcaaccgtat cgaaacccgg ggtaaagcga agtacttggg cggtgtcggt accaagatca 
198961 ccggctgtac ccgccgtaaa tttgtcccca acctgcaaaa agttcacgtg acattgccaa 
199021 acggcaccaa caaatcgatg cgagtttgca cgcagtgcat ccgcagtggt gtcgttcgca 
199081 aaacggtgaa aacaaaaccg tttgacgtca gcggcagcaa aaagtcctga tgtccagttc 
199141 cggcagtagc aacggcggct cggtcgacat ccaaaaattg gctcgactgg ctcgccttca 

1992 01 gcttacggag caggagcaaa ccgatttcgg tccgcaaatc acggacattc tcggtttcgt 
199261 cgaacaattg tccgaactcg acacttccgg tgtcgagccg atgacgagtg cgctcgacgt 
199321 cgagaatcga ttccgcgacg acgttcctga ccaaagcctg tcagccgaag tggcgacacg 

1993 81 cacggcgccc gcctctcaag atggcacgtt tctggtgccg ccggtcctgg gaaacacttc 
199441 ttccaagaag tgaagcatca cgagcaacgt ttgcgggaag gattcatttc caggaatccc 
199501 gcgaacttcg ggccggaagc atcgctgcga acttggcgga caatcgataa atcgctacgg 
199561 ctataaatgc cactcatgcg tgaatgatgc atatggattg gacaaaaacc caggctgctt 
199621 gatcacgcaa gcgaagcgac acctagattt cattcgggaa cgacgaaggc accagacgcg 
199681 aatcttttat agatcgacga ccgtatcgct cgccgaatcg tcctgatgct tccgcaactg 
199741 ctccatcgcc gccaaagtag cgtcgcaata cgatgagacg gctgttcccc aggcctgttc 
199801 tttcgtcgcc tggcgagctt gctgcaaaag ctggtcgacg ttttgccagt ccaacatcca 
199861 atttcgctct tcggaagtct gacgaagctg agtgatcaca tcgtccattc gaagcaagaa 
199921 cgcttcatgc gaattggctt taaaacggcg gtaaggcccg gtccccaaag cggtttcccc 
199981 gcttccgata tcgagcgatt cacccgaatc ccaggcatca ccagccaagg ccggatcgct 
200041 cgaaccaccc tgccctgccc cggactgacc gggcatcccg gggttcgatg acgcacctgc 
200101 cgaattccgg tcccgcacgc gacgcgtttt cagcggtcgc ctgcgttcat tcggcttgaa 
200161 gactccactc gcccaaaccg cggccgaaat tgaccccagg atgaacgcaa cgattgccga 
200221 gcccacaaac cgcgggctga ccagagcggc cgccaaacca aaaccgactg ctcccgtcca 
200281 acaaaatgcg gtggttccaa tgagcgcccg agaggaaacc tcatccgctc ccagagaacg 
200341 ggctttccgc ggcgcaccgt cattctgagt cagatgctca tcatcgactt caaccacgat 
200401 caacgtgata ttgtccggac cgccccgtaa attggccagg tcgatcatca cccgtgccgc 
200461 tttatcgaca tcgagcgaat ccagcagaac tgcgagctcg tcgtcttcga tttgcccagt 
200521 cagcccatca ctgcaaacca tgaagcggtc gccaacttca aggtcaaacg gcccctccag 
200581 atccaccaac acttctggcg aaggccctag cgaccgagtg atcacgtttt tgggaattga 
200641 cttactgaag acactgtcgc tgtgaacttt gccgctggct tgcatttccc aaaccagcga 
200701 gtgatcaaac gtcaattgct caagcacgcc ctgccgaagt cgataaactc gcgaatctcc 
200761 gacgtgtgcc acataggcgg caccattgac cagaaccagc gaactggcgg tcgtccccat 
200821 gttcaggaac tcgggattgc tctgtccgcg ctcgtaaatc gcggcattgg ccagatgcac 
2 00881 ggcctcggaa atcgcgtgct caggagcctc ttcccgcgat tgatagtagt gcatcgaaat 
200941 gcgttcactg gcaatctgcg acgccaactc accggccgcg tgggctccca ttccgtcggc 
201001 gacaacgaac aggtggccgc gttggttcag acgttcggcg gactcagcga ttaaaaccgc 
201061 cgatgaatct tgattgttcg cacgtcgcat cccgacatgg gtctgctcag cgaacttcac 
201121 tccggggcgc catgaacctg acaaacgatt tgtgaccgag aacttgtgac cgagaacaca 
201181 ttcaaccgac gaacttacac gcctcttaca gactacttgg caaattgtaa gtcgccacgt 



201301 ttqggcgtca cgcggatcgc tggatccaca aaaccgacca agtaatgagc tgacactcgt 
201361 ttcccgaccg ttccgatgca cgccgctcag gcaggggcga tcaactccgc agccaatcga 
201421 cacqcgtcca agaactcgtc gaccatcagg tactcgtcga ccgtatgaat ggacgcctga 
201481 ccacacccca atgtgaccgc ttcgattccg tgcagcatca accaattcgc gtccaaaccg 
201541 ccattggcga cttcacactg tggagagcgc cccaattgcg aaatcaattc cgttgccgcc 
201601 agaactgacg gatgatcctc cgctaatcga aacgcttcat aatcgactcg gctggtgaat 
201661 tcaaagctgc ctgttcggcc ctgaacgtcc ttgaccgact gaaccgcatt ttcgatcgcc 
201721 qttcgcatct gagacacgat ttcggcccga aactcggcat cgtgactgcg agcctccgcc 
201781 cgtagctgta cttctggcgt gatcacgttc gtcgcgtcgc caccctgaaa aacccccacg 
201841 ttcgcggttc cgcgacgccc gtccttttca accagtccca gccaaccgtt gcgatgcaga 
201901 ctgctgatcg cttcggatgc aatcacaatc gcactgacgc ctttctcggg agccacgccg 
201961 qcatgagcag cgtggccgtg aatggtcacc tgcactcgtt cgccaccgat cgctccatga 
202021 cggattttat cgagtgctcc gccatcaaaa ttgaatgctc gatcaacatg gccgactttg 
202081 gacacgtcca aatgacgagc cccctccaac ccgacctctt cctgaatcag gaatgtgact 
202141 gcggcaggag ccagtttcaa atcagggttc tttgcctgcc gtgatagtcg ctcgacgatc 
202201 qcactgagaa tcaccgcgca accgctgcgg tcatcggcac ccaagccggt cggcccatcg 
202261 qccacgacaa ttcggcccaa ttggtcatcg tctcgaagct gtggattgca accaacacag 
202321 atcggaaccg tatccatatg agcggaaagc aacgtccgcg gcaacgaatc gtcgcctggc 
202381 aacgtgacaa tcaaattgcc cgcgttgcca tccaaacgag tcttggtgcc agcgtcatcg 
202441 gattgaatcc aatccggatc cactcccgca tccaccagca ttttctcaat cgcggccgaa 
202501 attgcggctt cgtcgccact cttaccagga atccgcgtta gctcgagaaa gcgatccaag 
202561 gcagcggact catcgatcgc agcccaagac atagcaccct tcccgccatc atttatcgct 
202621 tcggactggc tcatcgcgtg cgattggcac ggcgaggcga ctgatcttcc acacgaattg 
202681 gctgtgctgc ggtgccagtt tgcgactggt ttaaacgtcc cattgcgaac cacccaacgg 
202741 ccaacagcaa tccgctggcg gcaaacgagg ccaacattgg aacaacgtga gcgggttccg 
202801 tggcgtagtg agtcaaacct tctgtgctgc catgccctgg gtgagccgaa gcgggaacgg 
202861 caaaaaccaa agcggcaatc gagccaatga gtgaacgcga caaaatgaac tcacaagaat 
202921 caagagaaat gagaagagcc gattcaccgg cggatttcac cttcggcgac caaacccaat 
202981 gcacctgcca accgatgagc gggatcgtgg agccggatca gccgactgat tatacgccct 
203 041 ggtcgagctg ttgttgacgc atccgtcgac gttttcgctg ttcgtaggtc acttccgcgg 
203101 cacaacgcga acaagccacg ccacgttcgt aatcggaatg ctgcttcatt tccggggtca 
203161 acggccatcc acatccgtga cacaactcat acgccccagc ctgcaattgg tgatcgaccg 
203221 cgacacgatt gtcgaacaca aaacattcgc cctgccaacg agaatcctct tcgggcatct 
203281 tttccaaata attcagaatc ccgccgcgca ggtggtacac ctcttcgaat ccgcgttcct 
203341 tcagatacgc ggtcgacttt tcacaccgga tcccccccgt acaaaacatc gcaactttgg 
203401 ggtgcttctg cgggtccaag ttttcttcga caaacttggg gaattctcga aaggattccg 
203461 tatgagggtt gatggcaccc tcgaacgtgc caatctcgat ctcgtagtcg tttcgggtat 
203521 cgatcagcgt gacatccgga tcatcgacca aagcgttcca atcctggggc tcgatgtagg 
203 581 tgccaaccga tcgcagcgga tcaatgctcg acacgcccat cgttacgatc tcacgcttca 
203 641 gccgaacttt ggttttgcga aacgggatct cttcgcagta cgaccatttg acatccattc 
203701 cgcgaaaagg agtggcttgg tcatcaagct ccatcgaccg gagccagtcg ataaacggtt 
203 761 ccatcaactc tcgaggtccc gcgatcgttc cattgatgcc ttcgccggcc agcaacaacg 
203821 atccccgaat gccatcatct gacattcgct ggcggaacgg ctctcgcagc gattcaaact 
203 881 gtggcaatgg agtgaagcaa tacagtgccg cgacggccac cggcaaatca gcggcaccag 

203 941 cctcggcagg cgattcggcg ggcaattcgg cgggtacgga agaagcgtgt gattcggtca 
204001 tgaccgaaaa cttaccaatt catttgaccc attgcagcca agcatccacg taggcaatcg 
204061 gtgtcatgtt catccccgcc tgaaatgcct gcggaaacgg ctgccccgat tgcaggtttc 
204121 gcaacacggc gtcgaagccc cggcgattct gtctggacaa caaggactcc gcgatcgcga 
204181 cggccatccg gtcggctcgc tccggcggca gctttccgtt gaggaaatcc ttgcccgact 
204241 tcaagctccc caccgcggtg accaactcgg cctgccgtcg catattctca tttcgatcgc 
2043 01 gtttggattc gccacccgat attgcggtcc caagtccttg agcaaaccat cggggcacgc 
2043 61 cttcgccgcg gctgaccaca gccagactgg caaccggagc cacgacccgc tcggcgattt 
204421 cctcttcgct cgctcgttcc gatgcgacga ccgcgacata cgcgcggata ccgtcgtact 
204481 tccagtgaga ctgccattcg gcaggaacgt tcctaccttc gaccattttc gcaaactcgc 
204541 tgtagtcata ccgcttcggc aaaacataga tcgaagcttt gccatggaag aaatcctccg 
204601 gcgagccatc cgcctggggc aggaacttac gagcctgctg cagtgcggct tccgcggact 
204661 ctgcaaccgt ttcgagcgtt gccggcgaaa cttcccccca aaccgcgaag tgatcgctct 
204721 ggtgattttg catgcgattg acatcggcac ccgccaaccg gaattcatct tcggcaattt 
204781 ccgcacgccg ctgactgacc tcggaggaag ccgccgcgga caaccaagcc tgagcagtga 
204841 cgaccttcat tggagtttgc acaagagctt cgtccacttt ggcaccttcg ctgatccacg 

204 901 ttgaaatcaa ctgaatatca ctgtcagaaa gaggaggacg cccaccggca ggcattcgat 
204961 cgccagcact cccgcgaagt ttctgaacta gcaagcttgc ttctccgtca ccgggcttca 
205021 tcaccgatcc gctgtcaccg ccgcgaagca tttgagcaag ggtgtccatc cgcaaaccgc 
205081 cacgggtctg catcgcgtcc aagtggcaac cggtgcagtt tttaaccaac agcggtgcga 
205141 cgtccttcga gaacgatacg gtttcattgc cggttggctt cccgaccatt gcgggctgag 
205201 cgggctctgg cgttgcgggc gtgacgggag gctccccgcc gctcttcagg ctggtcagca 
205261 tcqcggtggg atcggggccg tcaaacttgg ctccttctgt cacccattgc ttcaatttgt 
205321 ccaattgctc cttactgact tttcctcctc cgcgaggcat gtcgccggtt tcgatcgttt 
205381 cgatcaatcg actggcgatc acgtcaccag gaaaaatcac gactccctcg gggggccctt 
205441 ccatcaactt ggcaaacgtt ggcatcgcga acccaccttt gctgccgcga atgtgacaac 
205501 cgccgcactt gtcgaccaag atcggtgcaa cctccgagac gaaactgacc ccatcagtgg 
205561 cggaccggct tgaaggatcc gagcccgatg cctcggccat tggctctgcc gttgccttgg 
205621 gccgttccqg aattcggaat ggtggcagcg tgacgccttc cagctccagc atcgctcgag 
205681 cgttgacgat tcggcgcatc atcggcagca atgcatcgtg caggtcagac gactcggcgt 
205741 tgccggccac ttcgatctgc ttgatcgctt tgcgaagagc atcaccggac gtctcgtagt 



205861 ttcqcgatct cgcatccagc ggtttggcga ccgatcgcgg agcggtttct tgtccagcgg 
205921 ccggagcgat tccaccggga aaggtcaccg gtgccgtccc agcaagcaac atcgccgccg 
205981 cgaaaatgcg acgggccggg cggcgaacgt tccgttttga aacgatcgaa atcccctttg 
206041 aaacqacctg aatcatcgtc accaacatct ttgttcgagt atctgccaat cactccatca 
206101 tagaccacaa acgcagttct tcccgaaaaa gactcactcg ttgcgcggtc atcgccacgt 
206161 tqaatqcctg caaccggcat caccctgccc tcgccatccg ggcctcccct gccctgctcc 
206221 qccqcaatct tcgtactcga tcatcgctcc cgactccctt ggcccttccc cgtttccccc 
206281 ctgctctttt acactgccgc attggccttt cccgcaatat ccggcaagaa cacgatgacc 
206341 tcttctgcga acaccacctc gcccgccatc gctgaaacca cttggctgac cgaccaacgc 
206401 atcctggtgc tcgattttgg atcgcaatac gcgcaactga tcgcccgccg cgttcgcgaa 
206461 caaaacgttt attgccaaat tgttcgccac gacatctcgg ccgaacgcat tgcggaactg 
206521 gccccaaaag gcatcatcct ttcgggcggc ccaaacagcg tttacgaaga aggtgccccg 
206581 aagtgcgacg aaggattgtt cgatctgggg atccccgtcc tcgggatttg ctacgggatg 
206641 cagctcgcct gccaagcgct cggtggtaaa gtcgacaaca cacccagccg tgaatatggc 
206701 cgggcgatgt gtgagttcac cgatcgcgat tcgatcttcc gcggcatgca agaatcggaa 
206761 caggtctgga tgagccacgg cgaccaagtg tcacagatcg ccgatcagtt caccgcgatg 
206821 gccaagacat cgacatgccc ctacgccgcc attcgtcaca acgaacgtcc ggtctttgga 
206881 atgcagttcc acccggaagt cacgcacaca ccacacggcg gccaaatcct tcgcaacttt 
206941 gtgattgacg tttgcggttg cgacggcagt tggaagttag gcgacttcgc cgatgccgcg 
207001 attgaatcga ttcgcaaaca agtcggtaac aaacgtgtca tttgcggcct gtccggcgga 
207061 gtcgactcgt cggtcgttgc cgcgttgctt tacaaagcca tcggcccgca gctgtcttgc 
207121 atcttggtcg acaatggact gctacgaaaa aacgaacaac aaatcgtttt ggaagagttc 
207181 agcaaccact tcaagaccga cctgcacatc gtcgaagccg aagaccgatt cctggcagac 
207241 ttggcgggga tcgatgaacc gcaagagaaa cgtcgccgca tcggccacgc cttcattgaa 
207301 tgcttcaaag acgaagccgc caaaatcgaa gacgctcact tcctggcaca aggcacgctc 
207361 tatcccgacg tgatcgaaag cggtgctgac aaagacggcc ccgccgccac gatcaagctg 
207421 caccacaatg tcggcggact tcctgaagaa ttggggtttg aattgatcga acctctgcga 
207481 gacttgttca aagacgaagt gcgtcgactc ggtctggaac ttggattgcc cgaacaactt 
207541 gtttggcgtc atccattccc cggtcctgga ctggccgtgc gttgtctcgg cgagatcacc 
207601 cgagacaagc tggtcgtgct acgggaagcc gacgcgatcg ttgtcgaaga aattgaaaac 
207661 gcgggactat accgagaaac cagccaagcg tttgccgtct tgctgccggt ccaaagcgtc 
207721 ggcgtgatgg gagatgcccg cacctacgac aatgccgtcg cggttcgctg cgtcaacacc 
207781 gacgacttca tgaccgcgga ctggagtcac ctgccgtatg aactgctggc tcgcatcagc 
207841 acccgaatca tcaacgaagt caaaggcgtt aaccgagtct gttacgacat cagcagcaaa 
207901 ccgcccgcga ccattgaatg ggagtgatgc acaccgctgc acacgccaac accccggctc 
207961 cgaccaaaca agacgagcta ccaaaagtgc aatcagacaa gcggatgaat tctcaccgct 
208021 gatcgccaca aacgaacgca ctgacaattc tgtcactttg caattcaaac aactgcaacc 
208081 gtattctccg cctccacctt cagcccacca agcgaaaaca tggccggtca attcatctac 
208141 caaatcaccg acctgaccaa gaaacacggt caacggaaga tcctcgaaaa cgtcaacctc 
208201 gccttttacc ccggcgccaa aatcggtgtg ctcggcccca acggtgccgg taaatccacc 
208261 ctgctaaaaa tcatggcggg catggacacc gaatttgaag gcaccgcgcg tttgggcaaa 
208321 ggcttcacgg ttggttacct cgaacaagaa ccaccgcttg atccaaccaa gaccgttttt 
208381 gaaaacgttc aagaagcggt tgctgaacgc caagccatcg tcgaccgcta caacgaaatc 
208441 agtggcctgc tcggcgaagt caccgacgac gacgagatgc aaaaactctg cgacgaaatg 
208501 gcaacgttgc aagacatcat cgacgccaac aacctgtggg aactcgatcg tttcgtcgaa 
208561 atgtcgatgg cggtgatgaa cctgccaccc aaggacgccg agatcaccaa cctgtccggt 
208621 ggtgaaaagc gacgcgtcgc gctttgccag ttgctcattc gccaacccga tttgctgctg 
208681 ctcgacgaac cgaccaacca cctcgacgcc gaatccgtct cgtggttgga acaacacttg 
208741 gccaactaca aaggcaccgt cgtggccgtc acccacgatc gttacttctt ggacaacgtg 
208801 gctcagtgga tcctcgaagt tgaccgcggc cgcggcatcc cattcgaagg caactacagc 
208861 gcatggctgg aaaaccgagc caaacgaatg gctttggaag aacgccaaca gaaagctcgc 
208921 gaaaagaact tggctcgcga attggaatgg atccggatga gccccaaagc tcgccaagcc 
208981 aaatcgaagg cacggatcaa atcgtacgaa caaatgtccg ccgaacagtt cgaggatcgt 
209041 ccagacgatt tggaaatcca aatcccatca ggcaaacacc ttggatcgtt ggtcatcgaa 
209101 gctcaacacg tcaacaaagc gtttggcgaa aagacgctga tgaccgacat gaacttccgt 
209161 ttgccccccg gcggaatcgt cggcatcatt ggtcccaacg gtgcgggcaa gacgactttg 
209221 ttcaagatgc tgaccggaca agaacctgtc gacagtggcg aaatcaaaat cggtgaaacc 
209281 gttgacctcg gttacgtcga tcaatctcgc gacaaactgg atcccaacaa aacgatctac 
209341 caagagatca gtggcggtca cgattcgttc gagatgggcg gccgcgtcat gcacgcccgc 
209401 acctacgttt cgcgatttaa tttcaaaggc cccgatcaag agaagaaggt cggtgtgctg 
209461 tccggtggag aacgcaaccg agttcacttg gcatcactgc ttcgcaaagg ctgcaacgtc 
209521 ttgctgctgg acgaaccaac gaacgacctc gacgtcgaca cactgcgtgc ccttgaagaa 
209581 gcgatcgcga actttgcggg ctgcgttgtt gtgacgtcgc acgaccgctg gttcctcgat 
209641 cgccttgcaa cccatatcct tgcgttcgaa ggcgacggca aagtggtttg gtgtgaaggc 
209701 aacttcgaca cctacgaacg caatcgtcgt gaacgcatgg gcgaagacgc cgacgacgat 
209761 tcgcccaaat ccagatacaa gagcattcac gcaggctgat ccagcctcgc gatcgctacc 
209821 tccgcgacac tcgcggagga tctcaatcat tcacccatct gtttcctttc caccttttca 
209881 aattcatgac gattctgcgc acaggcacca acgaaaagta cgccgagggc tttgaagccg 
209941 ctttcggtgg cggaactccc gcctcgaaaa agaccgcgaa gaaagcaacc aagaaggccg 
210001 caaaaaaaac cagcaagaag gccaagaccc agcccgctca atccaagaaa aaatcggcta 
210061 agaagtcagt aaaaaaagcg gccaagaaga gcgttaaaaa ggctgtcaaa gccaaagcga 
210121 cgccgaagaa ggcagtcaaa aaagccgcta aaaagtcggt caagaaatct gtgaaaaaga 
210181 ccgcgaagaa gtccgtgaag aaatcggcta agaaggcaaa gaagaagagc tgaatgcttt 
210241 ttggggagtg gccgggaaag ccaagcgttt tcccgtgccc ctcgacataa ccggcacgtc 
210301 ctccacaatg acgtcgcccc cgccaagcgt cccgccttcc aactatcttc ggttcccttg 



210421 gcagcctggt cgccgccttc gcaagacgtc cgcacggcac tccaatccct catcggatat 
210481 ctgaactttt cttctggtcc cgcacccacc gcggtgttct cggcctggaa ccaagttcac 
210541 gacgaagctt ccggcggaga cgtcctcaat gggccgccgc cgttcttgat catgcgagac 
210601 tggttggaag aagccgccca gtcactgcgg aaaagctcac ccgcgtttgc caacattgat 
210661 cgagccctcg gtttgatcaa tctcctttgg tcggggctgt tgccggccta cttggatttc 
210721 catcgcgacc tgctgttcca ccaaccgccg gaactgctct tcaacggttt cttcctggga 
210781 cgttgtgccg aagtcatggt gcaacaaatc ggcgaagacg agtccattgg caccgtggac 
210841 gacgaatccg ctcgcttgcg gcaaatcatc ggcacgctga acgactacgt cggctaccgc 
210901 ccagtcgcgg tcctcgaaaa tcgtgcgtgc caaccttacg cacacgaatt cgttcgaccg 
210961 atcccgcaaa tggtccaggg tgctggtatc accgccggac cctatcgcga attgatcgag 
211021 cgcgcggtgc gagcgatcga agaagctccc gaggacatcc tgcgttccgc ggctatggat 
211081 ccaaagcaac tgcgggagct ttgcttggac ccacgagcgt atgactttga ccatccggtc 
211141 aatcgtcgac cgaactatca ctttggcgga tgggatgaac atgcgatcga ttccgatgga 
211201 cgctacgatc ggttcatcct tcgcagtgtg accctggatt cattgctcgc acgagtcaac 
211261 gagacgccag aactgcctcg tgacgaacta ctcgaagaag ccgcttcggt gttggccggc 
211321 acaatgctca tggcatcggg catctccggt tggggccctg gttgctacag cagcgatgtg 
211381 acgcttcgat cactgatggt tccgatcgcc aacttccgtg atgagtttta caactggcga 
211441 atttcgcgaa tcggtggtga acatggccaa cgtttgctcg acgaagtcaa aacgcgacat 
211501 caagccttcg gtgcagctcg ccagaattta aacgccgctt tggcacgccg tcgtgccatt 
211561 cagttgcaac acgttcaact cgctcgcatc tacgcccgct tgggatatcc cgattcagcg 
211621 acgaagcaag ccgacgtggt accggccacc tccgctcgtt tgatgtgccg cattgactgc 
211681 gggatgacac tcggcctacg cgccctccgc acggatcgct tccccgatgc ggtggaagtc 
211741 cccgaacaaa cattcgattt gatccgccgc gccatcgaat gcggcgccct gatggacccg 
211801 tgggacatcc tcggattcac tggcaacttc agcctgtatc cagggatcga aaattcgatc 
211861 cacgacagcc gactcgatga gttgctgtac atcatcgaga acctcttcgg ctacatcgct 
211921 cgcgtatgga gtgaagcggc cgctcgcgat gacgaagttg cctacgaacg aatggatcgc 
211981 caataccgcg aagttgccca gtggtggcga acattcgcgg ctcacacggt tgaatccgtc 
212041 gaagcaacgg atccttggga atcgtacgag tccgctcgtt tggtcgctca agctctgcgt 
212101 ttgtggcacc gtggcggtgc acaacgaggc gacattgcgt tctgggcacc tcacgcggat 
212161 ctgtttgatt caccacgagc ctacatgctc gtgatctccg cactgctgga acgcaaagac 
212221 ttcatccccg cccaagcact cttgatccac tggctgtgcc aagccgatcg ggtgggactg 
212281 cgcagcggag cgaactcatt gcctcgtctg accgaacgct ggttgctgca acttcgcgat 
212341 cacatgcggg aggactcgca cgagctctgg ccacgtgtca acaagttctt tgactatctc 
212401 gaagccaatg cggaatcgtt ttggtccgca cctcgcttcg aagttggcga cgatgagccc 
212461 cgtgatcggc atcgcgacaa ctgggagcaa gaattagccg aagccaacga agatccgttt 
212521 ggatcacccg aagaagatga cgaggcggga ctgtttgatg cggcgtacga aaacgtcagc 
212581 tatcgcgaca cgaccgatga cggaaacgaa ggttcgatct ttgatactag cagcgacagc 
212641 agcagtgtcg acgaactcga agccgaagtg aaacgactcg ccagccgtct cgacttcttg 
212701 cagtcgctgg ctcgaatgtg ggcggttggt gccgatgttg cactgaccga aattcatgcc 
212761 gcgacggagt cgactgattc agctgactcg tccgacccgc aacgccgaca gcaacagatc 
212821 gaagcattgt ccgcatgggc tcgccgagcg gttgccaacc gaatcgggtt gctcgaactg 
212881 cttggtgatg tccgagctta caaaatcaaa ccggcgggca ccgacaaaga ttccatgcga 
212941 atctatgatc gccgccgcgt cctccgcgac agtctgatgg aacgggtcat caccaccgcg 
213 001 gtggaaatgg ctgatgcgag acgattgatc gcaagcgctc tcggtgccct ccagggcaaa 
213 061 ctgccagaag acgtgggcga agaatttgcc gaagacgatg ctggcgcgat cgagctgttc 
213121 tccgcgctca tcgccggcga cgccgaacga gcccgcgagc ggttcccgac gttcatcgaa 
213181 gcgatcctca atcgcagttt gctttacatc ccactttcac gtggcggtga cccggtcaaa 
213241 atttatgtga cccggttgcg agaacgtgtg ctccgacacc tgttgcactg gttgccacgt 
213 3 01 cgcggcctgt tcgcagaaac ctgccggctg atcgaaaccg ctcgccaaat ggaacaacgg 
213361 catccaatcg gtgccggtgc cgtgacagag tttgacggct tgtatcgagc tggttatcga 
213421 tctttggtgg ggtcgatcgc cgaagccgtt ttgcatgcga ccgccggcga accgccctcc 
213481 cctgccccga ctgaacctca atccgccacg cgagagctca gcgtcacgga ccaagacgcg 
213541 atcgccgatc ggttgattcc actactggaa cgactgaccg aaacgatgct cggcagctgg 
213 6 01 ctcgcccaca gtcaaaccct tcgcctgtca ccgctcgaag ccgtcagtga tcctgcaaag 
213661 tgggaacgtt tagttgaatt catcaaactt tacggcgacc caatcttcac ccaaggtttc 
213721 ctgcaactga gcaacgttcg cgcggtgctg caccaagacg tctcgtcatg gctggaacgc 
213781 gccatgaccg atggcgacga cttgttggaa gacacccagc tatttcgcga tctcgaaatt 
213 841 ggcaagatca agatcaacga agcctcgcga tggatcacgc tggtgtttga atcgctgctc 

213 901 gatcaccacg cggaatacca agactacaac agcacgacga cacaaagcga tcgcggcgaa 
213961 ctgatctaca cgttcctcga ttttttgcgt ctgcgaactc gatacgaacg cgtcgcgtgg 
214021 aatttaaaac cgatcatgtg gacccacgaa gtcctcgtgc ggatgggcct cgatcaaacg 
214081 tcactgggtt ggcgtgaaag cctcaacgaa cgcatcgcag ctgaagccga gctatatgtt 
214141 cagaagcttc gtgaattgca acgtcgacat tccatgcgaa tgccaacgat tgccgatcgc 
214201 atcgaagaac gattcgttca accaatgacg atcgaccgaa tgcgatcgct tgtcgaaccc 
214261 gccatgggcg atgcggaagc ggaccgcttt agtgccgcgt tcgaacaact cgaagcagaa 
214321 gccaacgaac tttcgcgaac tcccgttggc gtcggactcg atctgccgcc ttggttgagt 
2143 81 gctctggaag atgaagtgga aaagatcggg aaacgaaatc taatcagcga gattgacccg 
214441 caagaattga tgaccattcc cgtcagccca ctgacgctgg acgaactcaa aacgcaactc 
214501 gcgacggctc aaacccaagg ccgccggttg cctcatatgc gtcgccgcaa atcctaagtt 
214561 gcgcgttccg tctggttcgc ggtcggcttt gcgcgtgacc gctgcacggg tgaacgaacc 
214621 gcaaacttct tcgcctcccg gttgcgtgca tctcgctttc gaaagtgaca agcaaaacgg 

214 681 cgatgatttt tcagcggaat ctcgaaaagt caaaaaaact ttgactcaag tccgactgaa 
214741 aatggcgtcg cgaggcgagg tttcagaggt caacccgaaa gacattgcag gaactcaggg 
214801 gaatgatccg ctcctctaaa gacccaaatc ctccccatcc ctcctatttt atcgcggcat 
214861 caccattcaa cagcgtgatt caagtgaagc gcacagggtg ccgatctaga ttggctctcc 



214981 tccaacacgc tttccgctcc gccgaatgct tctccgaccg acacctcgtt tccacatctt 
215041 gatttgtcat cggaactagc tcgtcacctg acgatcacgc taggacacga cttcacaggc 
215101 gactcgactg acaaacaaaa cagcgaatac ctttaccaag ccttggcgat tacagttcgc 
215161 gaccgcttgg tgccaatctg gctggaaact tggaagaaga cctgcctgag cgaagatcgc 
215221 aaggtctact acctgtcact tgaattcttg atcggccgtt cgctcaccaa cgcggtcgaa 
215281 aacctcgact tggacgaaga cgtccgcaaa gctctgcgag cctacagcgt cggaatggaa 
215341 gaagtcgctg acaaagaact cgatgccggt ttgggcaatg gtggcctcgg ccgcttggcg 
215401 gcatgtttcc tggacagctg tgcaaacctg cagcttccgg tcgttggcta cggcatccgt 
215461 tacgaatacg gaatgttcca ccagcacatc gaagacgggc gccaagtcga agacccggat 
215521 cgctggttgc gtgatggcaa tccttgggaa atcaaacgcc ctgaagacac tcgccgcgtt 
215581 cgcttctacg gccgcaccga aaactactac gacgaacacg gaactttgcg accacgtttg 
215641 gtcgactcgt acgatgttct tgccgttcca ttcgacatgc ctgttcctgg gtaccgcaac 
215701 gacacggtca acacgttgcg attgtggaaa gcgtcgacga ccgacgtctt caacctcagc 
215761 gaattcaatg cgggctcgta ccccgaagcc gtcgcggcaa agaacgacgc cgaacaaatc 
215821 tcgatggtgt tgtatcccaa cgacgccagc gaaaatggca aggaactacg actgaaacaa 
215881 cagtactttc ttgtctcagc atcgctccaa gacgtgatcg cccgttgggt tgaacagcac 
215941 ggtgaggact tcagtgactt tggacgcaaa aactgcttcc aactcaatga cactcacccc 
216001 gcctgtgctg ttcccgaact catgcgattg ctgatggacg agcatggact tgaatgggac 
216061 gacgcctggg acgtggtgac acgttgcatg gcttacacca accacacgtt gcttcccgaa 
216121 gcgttggaac gatggtccgt cggtctgttc agtcgcttgc tgccgcgttt gctcgacatc 
216181 atctacgaga tcaacgctcg gttcttgaaa ctcgtcgatc agcagtggcc cggtgacgtt 
216241 gcgatgcgtc gcgaaatgtc gttgattgaa gaaggtgaca acccgcacat ccgaatggcg 
216301 tacctggcca ttgtcggaag tttctcggtc aacggtgttg ccggtctgca cacacagttg 
216361 ttggaatcag gcctgttcaa acacttcaac acgctgtggc cccgcaagtt caacaacaag 
216421 acaaacggcg tgacccaacg tcgttggttg tcgcactgca acccaggcct tcgtgacctc 
216481 ctcaacgaaa ccattggcga aggttggcaa aaagacctca ccaaaatcaa agagttaact 
216541 ccttacgcga ccgacgccga cttccgcaag aagtggatcg atgtcaagca agaaaacaaa 
216601 gctcgcttgt cagacttggt ggtcgcggaa accggcgttc gctttgacac gtcattcatg 
216661 ttcgacgttc aagtcaaacg gattcacgaa tacaaacgtc aactcttgaa cgttttgcat 
216721 atcgtgcacc tgtacgatcg cattttgcgt ggtgaaaccg cgggcatggt cccacgttgc 
216781 gtcctgatcg gcggcaaagc agcccccggt 'taccacgtcg ccaaattgat cgtgaagctg 
216841 atcaacgatg ttgcgaagaa ggtcaacaat caccccgcag ccaatgacct gctcaaggtg 
216901 gtcttcttcc cgaactatcg cgtttcgtcg atggaagtga tttgcccagc aaccgagttg 
216961 tctgagcaaa tttcaaccgc gggcaaagaa gcttccggca cgggcaacat gaaattcatg 
217021 atgaacggtg cgttgacgat tggcacgctc gatggtgcca acatcgaaat ccgagaaaac 
217081 gcgggtgccg agaacttctt cttgttcggt ttggacgctg ccgaagtaac cgaattgaag 
217141 aaggactatc gtcccaacga aatcatcgcc gctgatgagg acatcgttcg catcatgaac 
217201 ctgctggaaa gcggacactt caatcctgac aatccaggtt tgttcgattt gttgaccagc 
217261 ggcctacgca accctcagga tccttgggtc accatcgccg accttcgtgc gtacatcgac 
217321 gctcaagccg aagttggcaa ggcataccaa gacgtcgatc actggaatca gatgagcatc 
2173 81 ctaaataccg caggcagcgg ctggttcagc agcgaccgaa cgattcagca atacgccgac 
217441 gacatctggg acgtgcgacc actgtcgtaa gccccaacct gattggtcca aacgaaaagg 
217501 gctcggtgag aaatcaccga gccctttttt gtttgttgag aacgaattgc ctagtactac 
217561 agcgctaact ggtgttgtag agggccgggc ttttcgctga gaatagcagc tggagatttt 
217621 gtttgctgca cggagagctt ggcatggatg tcgctgaact ggaacaactc gaagatcggt 
217681 taaatgctta cttggctcgc tttggcgatt gttttcgacg aagcgacacg agggctcatt 
217741 tgacgaccta tgtccgtggt caactttccg acctggacgc caagagtgtg gagccgattg 
217801 cgttgcaagc cggtacaccg gtgcgaacct tgcaggaatt tatcgcccag catcggtggg 
217861 acgaagatgg acttcgcagg aggctgatcc acatcgtccg tgatgagcat gtcaacaaga 
217921 acactgtcgc gatcattgac gaaaccagcg acgtcaagaa gggcgacaaa acgcctggcg 
217981 tgcaacgaca gtggtgcggc aaagtcggca agcaggagaa ctgtatcgtc acggttcatc 
218 041 tggctgcggc gaacgaagac tttcactgca tggtcgatgg tgaactgttc ctccccgaga 
218101 gctggagtaa cgaccgcgag cgttgtgccg ccgccggcat tcccgatgag atggtctatc 
218161 gccccaagtg gcagatcgcg ttggaattgc ttgatcgcag taaggaggag gggattgaat 
218221 atccttggct aaccgctgac gaaggctacg gcggtaaacc tggattcctg gaagctcttg 
218281 ccgaccgcga tcagaagttt gtgcttgaag tgccgcgaac gttttcggtt tgggagaagc 
218341 atcccgaagt gaccgagcag ccctatcgca agggcggccg cggccgaggt cgcaagacac 
218401 cccgcgtcaa gagcggggaa agttcgccgc gaagtgttga aacagtgttc tggcacggcg 
218461 aagcgatgaa agcgaaacgc tggaaacgct accgcgtcaa agacggcgag aaaggtccca 
218521 tcatctggga agccaagcgg gttcgcgtca cactcaaagg cagcgacgga ctaccggggc 
218581 tgtctctgtg gttggtggtc gcgagaaacg tgcttgacgg cgaactgaaa ttcttcgtca 
218641 gcaacgcgag cgagttcgct tcgatggcga tgctgctaca ggttgcgttt cagcgatggc 
218701 gagtggaacg ttgtttcgag gatcagaaac aagaggtcgg cttagactgt tacgaggggc 
218761 gccgatactt gggtctgaaa cgccacttga tcatcacgtc gttgagctat ctgtttcttt 
218821 cgcaaacctg ccagcaggag cgggaaaaaa aatccggagt ggacaattca gcaaattcgc 
218881 gacgcggtcg acgcaatcgt tgtcagctgg tcactccctc gtgagagtcg tcgcatgttg 
218941 cttgatcagg tcgcctcacg gatcaactac catcaatcac gcaacgtagc ggctcgcatc 
219001 agccatacgc aaacgaagct cgacagatac gccgaactcg gtatcgaccc cgaaagcatc 
219061 acacgatgcc gatggcccaa aacttagcgc tgtagtacta gcaggctgtt ggtttagacg 
219121 cataccgatt gacaatcatt agccgttggg cgttagcccc ggttggcgtc taatcaaccg 
219181 ccgctaacgc ggtgcggctc aataaatcaa cagcctgcta ggacaaacct acgctggccc 
219241 gagccccatc gcacgtcgcc aagcactgaa ttggccgatg tgaagcatca tgtggccgct 
219301 caggtagaaa ccgtgcagcg atccaaccgt gctgaacttt tcgcgaagat gttcagtcgg 
219361 gttcggcgtt gaaaacaccg cgtcgtcggc attcagcaat gcctcgatcg ctgcgccatg 
219421 cgacgagaag aacttctcca ccacttcttc cattggcgga tacagcgtgc cattgggatc 



219541 cgaggtgttg gctccaactt gctcaagcac ttggcatgga tacagcgaca aatgcccaag 
219601 gatgaatgcc ggatgattcg actcgataac agtatcgccg accacggcaa aacgagcgaa 
219661 ctgctcctgc gtcactcccg tcatcagtcg tttggcgtat ccgatgcaaa gctgaccaga 
219721 gtcagcgatc atttgtccga tcatgttgtt tctctaagag gtgcaaatca atccttctgt 
219781 tttagctacc ttccaaaccg aaggtaagcc agcgacaccg cccagacgcg gcatgcacac 
219841 ttggtggctc tctcgttgac ggtgttttgt gcgactcgaa gtgacgatct cgcgaggcac 
219901 ctaccaaacg acaaccccgt tgcgaaccca gacaagcatt tggccgggca cgccgttgat 
219961 caagcagcct gagattctgc ttggtgttgg tgcgatgact tccgccacca aagattctca 
220021 gcccaactgg cgtcgtgccg caaccaataa gcgcggaccc aatcgacggg accacgaccg 
220081 aatttgcgtt tcgttcccca gtaataatcg ccttgaggca ccacatcgat tggccgtgga 
220141 ttgccgtgga atactacgac cttcgctttc gctggacgct tgatctcggg aaaaagctga 
220201 ttgaatggat agtacttcag gcaatgccat ttgaagctcg cggtccattc atacggccaa 
220261 taacgaacat cttcgacatt ctcggtcaag aaatcttggt ccagtggaaa acgctcgaag 
220321 cattcttgtt gattctcgtt gaatcgatgg aacaactgag cttgttcacc aggaaaaaaa 
220381 cccaggatcg atccctgaac gccgcgatcg ggacgcatct tcaaaggagc caaattccat 
220441 agcgtcggat tccactctcg tagtgcatga aagcctcgtt cggcctccaa acgatcaaag 
220501 aacccatcaa ggttctgatt gaccatcaca tcgagatcta gatacagcgt tggctcgtct 
220561 gccggcaaca cacccggtgc aaggattgac aacttcggcc agcaaccttt ttgctgccgt 
220621 tgcataggaa tccccaaatc gggcatttgg actccttcga caccatccgt caaaccagcc 
220681 acattatcgg tgacgcaaac aaaacggtga ggacgattca agttggccgc aacggcgttc 
220741 ctcaacactg tgacatacga tgctgggaac gcgtccaacc accgggtaca gacaacggtc 
220801 gccatgatga tttcgccatc catgattgag ttgagttggg gggcggttcc ccttctaggt 
220861 ttccgtcgaa ccatactcaa ctagcaaatc ggaatccaga ccgaaatcta ccctctatgc 
220921 tgcacgtttc attccttctg ctgagggatc cattctttct gagacgatca tatcggcgga 
220981 acgaccgtca cggttcgctt cttcagaacg ccagtcttga aggacgttcg catttcgaat 
221041 catccagatc gcaaccgtca tcatcatcgt gtaaccaaaa acctcaaccg tttcctctgc 
221101 ggtaacccac ttcgatcgct cgaacaattc ggccagaacg aagcaagaag cgaacggaat 
221161 cgcgggccaa acgaacatcg gcgaaaacca caacttgcga tcctcaggaa acttcagcca 
221221 ggcgtatgca acacgcgaca accaaacgca aactgcaaca gcaatgacgc catgcttcca 
221281 ggctgctgaa attgcaaagc cggtcgcctc agcggtttga taggcgccat cggataggct 
221341 tgcttggctt acagctgatg ggatttctcg ggctgctccc gcaaccgcga cgcaaaccaa 
221401 agcaacagcg atactttgcc agctggaagc cttcactcga agtaacgccc aacaacctgc 
221461 gatcgccgtg caaaccagca ccaacaattg gacagactca agggccccat tttcgccggt 
221521 caaccactga ttcgataatg agtcagccag catgtagttc gctgcccaca atccaccgca 
221581 gcaaacggcc gttgccaaca aagaatctct catcactttc aggtatggcc accgcatcca 
221641 gtgcgattgc tctttggcga ttgaagtttg aacgtctttt gattcgcttc tcgatggcat 
221701 cgcttcggct acttcattgg aggttggatg gaaaacggcg accaaacgaa gcaaccgttc 
221761 gccaatggcg agtacggcat gttcggtgcg agagtcgggg ccaccataaa ttggaatttt 
221821 ccttgcaaac cgggacgcga aatcccgcaa tagaaatttt gaaacggccg ggtttgcagt 
221881 ccacatttga acgatgaatg cggccgaacc aactaccggg atcgccccaa caaacatcgc 
221941 atgccgatgc gggacatcat gagatcgatt tgctagcaaa ctgaaaagcg tgaccagagt 
222001 tcttgccgca ggcagaatca tgattggcaa aatagcaaac cagacactac cgccggtcag 
222061 tgccatcgcg attccaccaa ccttcaaagg ggtgatcagc gggtagaaaa acttgatcgc 
222121 caagtgcatt cccaaaccgc gcgaatagat cgctaactgg ggaccactga actgggaatc 
222181 aagtctcatt gcttcgtcgt cagtcaacca ttcgcgagat tgccaccgcc gcgtagcagc 
222241 tcgaaatgcc ttgcgtccaa aagcaatttg atagctgccg cggaacaaca aagctcgcat 
222301 ctggttgttt cggcgggcac gtcgagttgg atcgaccaaa tagcgatgca cacccgccgt 
222361 ggttgtatag cgaagcaagc ggtgaccaat gaaccctaaa ctcgacgcag gacgggtccg 
222421 attggaacat cgttgctgtt cacgaagagc tttcacctgc cggcgtcgat atttcatcca 
222481 gaaggcggag cgtacacgtt tgtccgttgc aaatcgacga acacgttccc gatgagatcg 
222541 gtttgcaagg cacgactgaa accatcggcc aattgcaccg ggcgggcaca cacccagcaa 
222601 ccacatcaat gtgaagaacc aaacgctggt ctccagagcg ggcacgtttt ccgaatcgac 
222661 caaacgttgt tgctgccatc gatcgataca ggttcgacga tagtgggctc gtcgttgagc 
222721 agaaaaccag gtccatgggc gacggaaagg agcgatctcc cccgctttcc attgctcgtc 
222781 cgcggcaacc aaagcttcca cgtctgccga aagttcagca gccagatttt gctcacctct 
222841 ctgcacaagc tgatcaacca aatcagcgca ggaagctcga agtttttgcg aatccaaatc 
222901 atcgaaggga aagacggccc ctcgacgcca gctttgccac aagtaccgcc aaaccagaat 
222961 cgccggtatc ccggactcca ggtcaatgat cgtatagcga agacttcttg gcgaatcggc 
223021 ttccgattca tccgaatgaa cgacgcgcag ccccgcatgc tctgacagca ataggtttgc 
223081 ggtcgaaacc atggctcccg gcgacacctg ccaaccggat ccaaccaacc caaaatcaaa 
223141 caactgccgt tccacccgcc tcatttcggc gaccaaacca gccatctcgt tttcgacggt 
2232 01 cgccggattc tgattggcac acgccgggat cggaccacga ccttcgacaa aactcgcggc 
223261 gagaacatag gcttcgcgat cgcgatcgta gcgaacgtat gcgggcagcg caacatcaac 
223321 agaagaatcg cgttcatcgc ggatcgcgga tgcttctaac attcgcgaaa ccactcgccg 
223381 gcgatagaaa cacgccaaga tcgcgtgttg attcgattga taagcgaacg gggccgcaaa 
223441 ggccaatcgg tagatactgc gggtcaataa accaggggcg aagacctttt ctacgtagag 
223501 atgacgccgt ccgtcgttcc acgaagcctc caccaactcc gccgtggcag ctcgcccgtt 
223561 cccaagcgtg cgaatcaggc gcacgtcgtc cggggcctcg gactggcacc gatacggatc 
223621 cagaccggta tccacgtcgg aattctccgg aatttcgccg atctgacggg tgttcgataa 
223681 tctcgtcacg ggggaactca aatggacaga acgaatcgaa cacgaaaatt tcataaacgt 
223741 tcgatactat ctctcgaatc ccgccgtcaa ttgcaagaag ccagcatcag cgaccgagga 
2238 01 cgcacattct ggactcaatt tcggctctgc gccggcagtc cgtcgtcgct ggtgacaaag 
223861 aacgcgtcaa agtcgccccc agaattagcg gttccctcca agtcgtactc ttcggacacc 
223921 aatgttccca accaagcttg gatccgattg gcaccgtcgg cgtgcgagtg gtgagttgca 
223981 acttcgctgg cgtaattgat taaatcgtaa actcgagctt tgctgggcag aatccggcga 



224101 agcgaacgga accgcccgat cagtccatcc tgcgtcagct gcttgtcccg acgtagagac 
224161 tccagcaatt cgttcagtcg aagacactcg tgcaccgacg cccaagaagt ctgacttgat 
224221 tcgaatcgct gcctgagtgc cgcatagcca tctccgttgt cgaacgactc gatcgctcgc 
224281 acgatgcaat gctccatgtt ctttccaacc ggcacatcgc tgcggaatgc tttggcgtgc 
224341 ccgatcatgc cgttgctgca gaccaaccgc agcatcgaca aaaacaatct gggatgcccg 
224401 tagccatcga ccggagtctc caaacaaaac cgatcgcgaa actgatcacc accaattgca 
224461 aattgacgac tgccgccgcg aggtgcatgc gtgctggtca caactccgtt gtggtaccga 
224521 atgtcttcgc ctccatttcc atccagcaag ccgcggatgt cgtcgtagcg gatgattggt 
224581 cggttgggat tggtcacagc caacaattga tcggatcgct tcgatgaagc actcgacgac 
224641 cgagcgatgc agtagcggaa cgtcacatcg tcccgttgcg actggatgcg tgaaaaaact 
224701 tcatccggac ggaagtagcg gaacacgttt tccgcgattc cgaatcgggt gaagaacgat 
224761 cgccagaacc gacgggtggc gacaaccggt tgcccatcga tttccagttc acgcagactg 
224821 atccgtccgc cgggagaacg atgcaccgtc ccgacttgga tgtcaccgac actggccagt 
224881 gcataactga aacggggaga ggtttgggat tcgcttggga gagatgctga agtcgacatg 
224 941 attgatttcc gagtgtggga ttttgaaatt tcgattgaat cagtgagctg cgaagtccgg 
225001 tgctaagcag gtcggcagga atgatcgggt agaacctcac gtaggcgaga ctctttgaga 
225061 ctcgcaaatg acagcgtctc ggagagacgc cgctacgtga tcaagcccga tgacttccgc 
225121 ttacgtgctt agcgagccaa ccggtgcata cgacgacgtg gccggtgagc agcgatgagt 
225181 ttcggcagac tcacattggc cttcgcgact cgctgacgtg ctgccatggc cattgcatcc 
225241 aactgacggt tgccagcgat gcttcgccaa ccgtattcag tccacagata ctgctcagcg 
2253 01 gggatcgccg ttcgcatcat tgggtgtcgt gtgtgccgac actggacaac ctgtttgatc 
225361 ggttccgatt cgcatttgcg aagcactcgt ccagccattt gaatcgtggt tccctggcca 
225421 ctcggccggc agaacacggt tttcaacgtt ggacaatcaa aaccttcggt gaggatggcc 
225481 atgttgatca gaacgtcggt atgaccttcg atgaagtcgg ccaactggcg gtcgcgattt 
225541 gtttttgcag tcaccacatc gctgcgaata ccggcttggt ccaaacgtga cttcagttcg 
225601 tcacattccg tttgcttgtg aaagaacacc agcgacttgc cccatcgctc tggttcatca 
225661 atcaacaatt cagacaatga ctgcggcgag tactcttcaa tcgtgtagtg atggtacgca 
225721 ctcaaaaatc catccgcgat caagtgagcg atgccagcgt cgcgaatcac attttcgaaa 
225781 cacaatttga cgcggtcggt gcgatatggc gtcgcggaca atcccagcgt ccatcgtggt 
225841 tgaatttgcc cgtgcaagtt ggccatgctg gtcgctgcat cgtgttgtgc ctcatcgatc 
225901 accaacaaat ccacgtccgg cgggtccttg tcaaacatgc tgatcaattg catgtcgaga 
225961 ccaaacttac ggcgaatgtt ttcggcttga gcctgggcca acagattcct ccgcatcgcg 
226021 acccaaccaa ctcggatacc gaacgttcgc tgcatgatcg cggcgatggc cagccccatc 
226081 accgtcttac cgcttccggt gggactttca atcagaacgc tggatgggcg tgaaacgccc 
226141 tgccctgccc cgtcgatcgc ggaacaaaac tcgtgttcgc cacacccaaa cgcattgaga 
226201 gttcgccgga tgatacggtg ttgatacggg cgaggctgaa tcgcgaaacc gtccaaatcc 
226261 aactcggatg atggttcgga tgggatcgct ggtaagatcg cagtggacat cgtgttccct 
226321 ttctgcaggt gtcaaagttg aacacctgta gtgataggga aagcggtgac acgttcggtc 
226381 accgagcgaa tgctacgccc aaacttctat tcgcagaagc gaacgggcac tgatcggact 
226441 tgaggtgaat ccgccgacgc agcgagcaat cagccgcgtt ttggccttgg aattttggtg 
226501 cctttgctgg ctcccttttt cttcgcaccg cggcgaccgc gtcctcgacc gcccgatttg 
226561 cccgccgatt tctcgtcgac aatctccacc tcagaaagtg gtttgcctat gtgctccttc 
226621 agttgcagca cacggtctcg cagccgagcg gcacgctcga actcgagatc ttccgccgcg 
226681 gccaacatct cttgctcgag tttgtcgacg tactcgatcg tgatgtaaat cgcttcgccg 
226741 ctgtcctgag ccttgcgagt cgactctttg tggttggcag cctcggtgtc gatgcctttc 
226801 cggatcgact tgcgaaccgt cttgggcacg attccgtgct tggcgttgta ttccatttgg 
226861 atcacgcgac gacgttcggt ttcgtcgatc gccatctgca tggaatcggt caccttgtcg 
226921 gcgtacagaa tcacgcgtga gttcgcgttc cgagccgcac gaccgatcgt ttggatcaaa 
226981 ctcgtctcac tccgcaaaaa cccttctttg tcagcatcca agattgcaac caaagaaacc 
227041 tcgggaaggt ccaaaccttc tcgaagcagg ttcacaccga ccaaacaatc gaactggccg 
227101 gcccggagtt cctgcaacaa atccactcgt tcaaaagcat tcagttcgct gtgcagccat 
227161 cggcacttga caccttgttc ttggaaatag gtcgctaaat cttctgccaa acgtttggtg 
227221 agcgccgtga ccagcacacg ttcgtctcgc tcggcgcgga tccggacctg ttccaacaaa 
227281 tgcgtcactt gtccgcgagc cgaaacgatt tcgacttccg gatcgagcaa tcccgtaggc 
227341 cgaatgattt gctccaccac ttcgccgccg gtgcgttcca attcgtagtc gctgggcgtt 
227401 gcactgacaa agcaaatttg tccggtccgt tcttcccatt catcaaactt caatggccga 
227461 ttatccagtg cgcatggaag ccgaaaacca tgttcgacca atgtgatttt tcggcttcgg 
227521 tcacccgcgt acatcgcgcg gacttgaggc acggtcacgt gcgactcgtc gacaaacgtg 
227581 atgaagtctt tcggaaagaa gtcgtagagc gtgtccgggg tcgcgccagg ttccttgccc 
227641 gatagtgggc gagaatagtt ttcgatgcca ggacaatgtc cgacttcggc caacatttcc 
227701 aagtcgaact tggtccgtgc cgacaaacgc tgagcttcca gcaacttgcc ttgtgactgg 
227761 aagatctcca gttgctgcgt cagttcttcg cgaagcagtc gaatcgcgcg ttgaatgcgg 
227821 tcctctggca tcacgaagtg cttgcatgga tagatgtaga gatgctcgac cgttttaatc 
227881 gtctcgccgg aggttggttt gatcagcgag atcttctcga tctcatcgcc ccacatttca 
227941 atccgataag cgaactcttc gtagctgggc caaagctcga tcgaatcccc tcggacacga 
228001 aatttgcccc gttcgaactg gatgtcgttg cgttgatatt gaaggtcgac aaacttcaac 
228061 agcaagtgat ctcggcgggt ctgttcgcct tgatgaagat cgacgaccaa ttgccggtaa 
228121 tcatccgggg atcccaagcc gtagatgctg ctgaccgagg ccacgatgac gacatcacga 
228181 cggctgacca aagaactggt cgttgccaat cgcagccgat cgatttcctc gttaatggag 
228241 gaatctttct cgatgtaaac atctcgctgc gggatgtagg cctcgggttg gtagtaatcg 
2283 01 taatagctga cgaagtagtg gacggcgttg ttaggaaaga actctttgaa ttccccgtac 
228361 aactgagcgg ccaatgtttt gttgtggctg agaatcaacg cgggtcgctg gaccgacgca 
228421 atcacgttgg ccatcgtgta cgttttcccc gtcccggtgg ctcccagcaa cgtttgagcg 
228481 gtcttgccgc tttggatgcc ttcgatcaat ttggcgatcg cggccggttg gtcccccgat 
228541 ggcgggaacg gttgatgcaa atcaaaagcg gccggcgcaa gtttggtgat gctcatgcga 



228661 gaatttcaac gcactgccat ggcgaagctg tatttctact attccacgat gaatgccggt 
228721 aaatccacgg tattgctgca gtccagctac aactatcgcg aacgtgggat gaacacgttg 
22 8781 attttgtctc ctgagatcga cacccggttc ggcagtggca aagtcgcctc gcgaattggg 
22 8 841 attgaatcgg agtccgtttc attcaacacg tccgacaatc tgctgaacct ggttcgcaac 
22 8 901 gagacacgca ttaatccgct ccactgcgtg ctggttgacg aggctcagtt tttgacccga 
228961 acccaagtcc gacagctcag cgatgtttgc gacgacctgg acattcccgt gctggcctac 
22 9021 ggtttgcgaa cggatttcca aggcaatctg ttcgaaggta gcgaacactt gttggcgtgg 
229081 gccgacaccc tgacagagct caaaacgatt tgccattgcg gccgcaaagc caccatggtg 
229141 ctgcgagtga gtgaatccgg ccaggtcatt cgtgacggtg agcaggttca aatcggaggc 
229201 aacgaacggt accaaacggt ttgccgcctt catttcaaag aagcgatcta ccaacgcgcg 
229261 gaagacgaac tgcctttgct ggattcgaac gaaccccgcc aatccgaacg ctgagaacgt 
229321 ctcatcctgg cggccacgac tgcggacgac acgtcctacc ccgcgcgagc ctcggtcgac 
229381 tattctgggg ccatgcaagt ctcacaagaa ttgatcgatc gcgttctgcg actcgttcac 
229441 gcggcggaat accgtcccag caaacccaaa caaatcgccg cccttctcga gctggacgcg 
229501 gatggttatc gcgaagtccg ccgggtgatc aaacaattgg tcctcgaagg ccggttgatt 
229561 tacggcggaa atcacttggt cgtcgcggca gccgccgttg gtggtcccac cgaccagatt 
229621 cgcgggacat ttcgacgcgc gatgggtggc ggctttggtt tcgtccgccc ctccagcggt 
229681 ggtggcaatg cggacgctga tgtccccgaa gatgtcttcg tgcccacggg gatgaccgcc 
22 9741 ggcgccctcg aaggcgacct cgttgccgtg acgatcgaac ccagtcgccg cggtggcatc 
22 9801 gaaggcaaag tggttgaagt gctgcaacgg gcacgacgcc aattcaccgg cacgttcttt 

22 9861 tcctcaccgc agcccgacca acctggatcc gacacgatcg aaggtcccgt cgtctacttg 
229921 gacggtgtcc actacgaagc tccggtcagc gtcggcgacg tacgaggttt gccgctccaa 
229981 gatggcgaca agatctttgt cgagattgtc gatttcccgg atgaagaaag cggcggcgga 

23 0041 gaagccgtca ttctcgaacg cttgggaagc agcaagaacc cggcgatcga caccctgacg 
23 0101 atcatgcggc aatacgcgtt acctgatgag ttctcggaag acgtgttgga cgaagctcgt 
230161 gagcaagccg atgcgttcga cgacgatgtc gtgccaacgg atcgtaaaga cctgacggac 
230221 atgctgacga tcacgatcga cccgttcgat gcacgcgact tcgacgatgc aatttcactg 
230281 cagcgcgaag atggccgatg gcgattgtgg gttcacatcg ccgatgtcag tcactttgtt 
23 0341 ccgcccggtg gaaagctgga cgtcgaagcg cgtcgtcgcg gaaccagcgt ctacctgccc 
23 04 01 gaccgcgtga tcccaatgat ccccgagatc atcagcaacc accttgcgtc actgcaaccc 
230461 gaacgcatgc gtttggtgaa gacggtcgag atcgaaatgc tggatgattt gaccatcact 
23 0521 cacagcgaag ttcacaacgc tgccatccgc agtgacaaac gattcaacta cgaacaaatc 
23 0581 gaccagttca tcgcgteccc tgctgccttt caaaaggact ggggagattc catttgcgaa 
230641 ctgctgacgc acatgcacac gctcgcgatg caaattcgca agcggcgttt caaagatggc 
23 0701 gcgctgtcga tggacatgcc agacatcaag ctcgaactgg atcgaacggg caaggtcaaa 
23 0761 ggtgcttacc aaaccgagaa caccgaaagc caccagatca tcgaggagtt catgctgctc 
23 0821 ggcaacgaag ccgttgccac ttggctggat gatcaggaac tcaatttcct gcaccgcatt 
230881 cacgcaccgc ctgaacgacg caagctgcga cagctcacca gttttgtcaa agacctcgga 
23 0941 ttgggttttg acaacgtgga aagccgcttt gaaatccaag cggtcctgga caaggtcgcc 
2310 01 ggcacgacgc tcgaaaacgc tgtcaatttc gctgtgctga aaagcatgag caaagccgtt 
231061 tacgggccgc accgcgaagg acactacgcg ctcgacaaag aacattactg tcacttcacc 
231121 agccccatcc ggcggtaccc cgacctcagc gtccaccgct tggtgcagcg gctgatcgag 
231181 caaaaatcaa cgccagacga atctttcgcg gagttggtca aactcgggca cgaatgcagc 
231241 gacgccgaac gcaacgccgc tcaagccgaa cgtgaattga ttcaactgaa gttgcttcac 

2313 01 ttcttgaaaa agaagcaagg cgaaacactt gaggcggtga tcagccgcgt ctttgccgat 
231361 ggcattcatg ctcgctgctt aaaacttccc gtcgatggct tcataccggt cacggaactg 
231421 cccagcgatc aatatcgatt cgagcgtcgc gggcaaagtt tgacgggctt caaatcaggc 

2314 81 aaccgcttcc gactgggtga ccacctgacg gtccgaattg gcaaggtcga cctgcaagac 
231541 cgacagttgt atctcgaagt cgtcaaaaat cactcggccg ccaaaagtga tccgcgagga 
231601 ccttctggca gcaagtcaaa gaagtcacct ctgcggcatc agaagaagtc cgatcgacgc 
231661 gaaaagaaaa agcgacggcg gcgttagcag gctgttgatt tattgagccg caccgcgtta 
231721 gcagcggttg attagacgcc aaccggggct aacgcccaac ggctaatgat tgtcattcgg 
231781 tatgcgacta aatcaacagc ctgttaggtc cgtgatgctt ggtcaccatg cggccgaacg 
231841 cttgaattca cggtggggac caccgtgcca caggttttga atcttttcca tcgttgccat 
231901 gaacgttctc attctcaccg gtgccggaat ctctgcggaa tccggtattc ccaccttccg 
231961 cgacgctaac ggcttgtggg aagggcacgc cgttgaagaa gtcgccacac cccagggttt 
232021 cgctcgcaat cccaatttgg tgcacgagtt ctacaatcag cgccgacggg cgctactgaa 
232081 ccccgagatt caacccaacg cagcgcacgt cgcgctggcc gatttcgaac gcgaacatct 
232141 tgaaaatgga cgcggcgatt tcctgttggt gacgcagaac atcgacaacc tccaccaacg 
232201 tgctggcagc caaaacgtgc tggcgatgca cggtcaattg ctgcaagttc gctgcgtgta 
232261 ttccgaagag atcttcgact ggaccggtga cctgagcgtc gacacgccgc acccggaagc 
232321 tcccgacgac gattccatgc gaggctgctt gcgtcccaat gtcgtctggt ttggcgagat 
2323 81 gccgatcgga ctgacacaaa ttgagaaggc cgcgacgaaa gccgatctgt tcatcgccat 
232441 cggaacatcg ggtgtcgtct atcccgccgc cggcatcgtg gctcaaacgc caccgcactg 
232501 ccgccgcatc gaagtcaacc tggatgacac gcccgcgtcg agtgcattcg atgaaaccat 
232561 tcgcggtgcg gccagtgtcg aaatcccgaa gctactgaat cacttcagcg cgatgtaaac 
232621 gtctgtgatc agatctttgg tttcggtcgt ttctggatcg ttgcgataca cctcacatcc 
232681 gatcgccttg gccatcttca gtttcttgta tcggatgtat tgaatcccgc ctgaccacgc 
232741 gttgccaaga . tgcgcgtact caccgatgtg ccggaccaat aaaaacttgc cggccggaac 
232801 cgtgtccgca acgcagccat ccggcaccgg cgtgcccgca tcggccaaat aaccggccgt 
232861 gtaatcgaac catcgtttgc gaagatccga ggtcgtgtga tagaggctgg cccattcgtg 
232921 aacgcgttcg tcgtccgcgt tgtagtgtga cttgactcgt tccaacgtcg cggccatgtg 
232981 acaaccaatg tcgtccatcg tgcaaccgcc acgctgaccg atgatcatgc gatcgggctc 
233041 gctgaccaca cctttgatct ccaacttcga aagcacttcg ccggtttcaa cgaactgttt 
233101 gaacatcagc aacccacgtt cgtagtccat gcccacgaac gtttccatca tcgatcgcat 



233221 accatccgtc atcaccggtt cgatttcgaa ctccaccttt gactgtgact tgaacggttt 
233281 qacgaacgcc aaatcggctt gcaccgatgc atttgctgac aaactcttcg gagcctgaag 
233341 ttggcgatgg gtcatcgaac cttcgccgac caattcaccc ttccagtgat acgtcgcccc 
2334 01 caactcattc gacggatcgc tgacggtcac ttccgcatcg ggatcgactt gcaaccaagg 
233461 cgaccagcga gtccaagtcg aatagtccct gaccgcgtca taaacatcgc gtgtatcagc 
233521 gtcgatggtt tgggaacgtt gaatatgaaa cgcgggcatg gtgaattcat ccctacaggg 
233581 tgactgacgt accgaggcgg atatcctagc aaactcgcct agtaacagca ttacgataga 
233641 aqttttcact ttctcgcaaa cgcacgccaa caaccctgcc ccccccgatt gaacgcccct 
233701 tatttgggaa gcgaaccaga gtgcgtttcc aattcaatct cgaccagcaa cggtcggtga 
233761 tccgaagcga ccggctcgtc cagaacctcc gattcgacga ttcgccagga gttggcgggg 
233821 tacatcgcaa catagtcgag ttgtcggttg ggcgatcggg cggggaaggt gtgcattgat 
233881 tcagttgtcg cgatccgcca tttcgcttgc agtgtttgca tggcaagtga ggccggtttc 
233941 gcattgaagt cacccgcaat gatgaccggg gtgcttccgc aggccaggag ccgattgagt 
234001 tcagtggcct gcttttcccg caaatcagca cgcgagtgat gcagatgcgt cgtcgcgaaa 
234061 cgcagccgag ttttgtgaat caaaatttct gccaccccga caatccggcg ttcccgaatg 
234121 ggatcaccgg gtaaccaatg cacttcgagc gattcgatgg ggtactttga tagcaccgct 
234181 tgcccatatt cgccgccgtc atagtcgatt tgcttcgcga attttccgtg cagactggtt 
234241 tggacggcaa gcgtttcgac ttgattgacc atcccgtttc gacgggtgtt ttgatccact 
234301 tcttggaccg ctaccaaatc aggatccacc gatcggatga cgttcgccaa acgatccaga 
234361 tcaattttcc cgtcggttcc ccgaccgtgg tgaatgttgt agctcagaac tctcaaccgg 
234421 atcgttgagt ttgtttcttg ttcttgagca agcacaagcg gcgagcgcaa catgaaggca 
234481 aacgatgcaa aagcaattgc gaggaagagc attttcctga cgaacaaaat caacatccaa 
234541 tgcacgttgt ttgattcaaa aagcaagccg tccgagacgg ctgcgacggt gcgtcgtccg 
234601 gtttggactc accaagactt ttcaggcgga ggttacttcg gaacgatacc gaaatccacc 
234661 qaccgctgcg gcgtcttcac tgccggcgtc ttgactgccg acgttgactt gcgttgcatc 
234721 gggctgaaga ggagcaccaa cgaagatgac gctttgttct ctacagagca gtcgcgatgc 
234781 tcaatccaaa attggcgatc gcagaaaatc ccgcaccagc ccgaaaattg ggaccggtga 
234841 gagatcaatc ggagcattga agacgcgaga gcaattgaat aggcttcgat tcgcaggtca 
234901 gccaagttac actgaaggct ccacgcgacg gccgttgccc ccctccgctg gccagttagc 
234961 aqggagctct gaaagccagt cgctcacatg cgagacgatg ccccgtaatc gcattgcgaa 
235021 tgacgattcg cctcggatgg agcgactggc aatcaaacgt tagctgccgt caggattcct 
235081 gatgcgcaaa gttggcgcaa acggcttcgt ttttaacaag aaaaataggt tttggaatgg 
23 5141 ctcaggccaa cgctcaactc aagcaagctc gccaaagttt agggttgtcc cagttgcagc 
235201 ttgcgatgcg ggcaggtgtc tcatctcgga ccgtccaatt cgccgaatcc ggtcagaacg 
235261 tttcgatcgg tacgatgcgg cgcatcgccg gtgcattggg aatgcaagcg gaccaattgg 
235321 tccgaatcga tcccggcacg ggccaagatg gtttcgcgga actcccttgg tccgtcgcag 
235381 acaagttccg aagcaatcgc ggcttcgacg aaggcagcat gtgccgtaac gaagccgacg 
235441 tcattgaggt tgtccgacaa ctgcgagaga acttcagcgt tcagattcaa aaatggggca 
235501 cgtcccacga ccagcaacaa gcgttgcaga agaactcagc gatcgatgag gtctactttc 
235561 ggtacgaaca acggtacgtc gatctatggc gcagaaaccc agagtgcatt cgcttggacc 
235621 qcttcgaaga cactgtcggt ggagtcagca ttgtgctgcc gctgacggcc gagtccttcc 
235681 atgcattccg cgatggaaaa ttggcctggc tcgatatctc tgctgacgac ttagcggatc 
235741 aatcccagta cttgcttctc gattcggtca ccgaattcac caagcagtgt cgccgcccat 
235801 qqtaccaagt caccaaatcg ctcagcctca tcacgttcaa ccaagtcgca tcattggcgc 
235861 agtcacccaa tcaatccgac ttcgagatgg tttcgttttc ggccagtccg ttgaacgaac 
235921 qcagattggg tacgatcggc ttcatcccag aaccaaccaa agaacccgaa ttcagttacc 
235981 cgatctactg gttcggagaa gacccacgaa tactggccaa agaagaatac tccaactggg 
236041 ccacattcaa gcacttcgcg atgttgatca aatccgtcga caaggctggc cttcgccgtc 
236101 gaatgattcg aaacctactg tcgatggtca agcgactgca gcggcccgcc gagcgatcga 
236161 tttcgcgtca agctgcttga acgctgccgg attccacttt ggcttttcca tccaagacat 
236221 gccatcgtct gacatgaagt caaagtgttc aaccggcaaa caggcaactc cacgcggcaa 
236281 cgaagccatc gccggtaagc tcaaacaatt aatcgaaacc tcagcgttcg tggttcgtcc 
236341 gcgacgctgc tgcaacgcaa tggcaccgtc acgatcgctc aaatagccgg tgcaattcca 
23 6401 ccagtcattc ggttgccatt tctcgagcca tcgaagagca ccgtcaccat acaacgatgg 
236461 gttgccgtgg attcgatagc cactgccgca tgattcgatg cgatcctggt tggccgttct 
236521 gatcccttgg aaccgaacgt gtttgttcaa tgaacaacca tcttcaaatg aatccaattc 
236581 caatcgtgga tcgatcaatt gatagatggg atgagaatcg gaaccacgat gcatcgacgc 
236641 catcagcgtg tgatccagct cagcgacccc gtaagcctgc aagaccgtga gccggcaacc 
23 6701 acgatttgcc aaccaagctt cgacacttct ttccaaagaa gccggcaagg ggtagccgcc 
236761 tgttgcccag agcaactcgg ttggccaatc catttgagtt tccatggcct gcaataataa 
23 6821 gtcagctccc gccaaaggct gagcgatgaa caccacccgc cgattctctg ccgccgccaa 
23 6881 acgcaaagca cgaatgcagg ttggtggatg cagataagtg atgacacgag tcatcgcttc 
23 6941 catgccaccc aacgccgccg gataagcatc caccaccaat gactgagacg gatcgacaaa 
237001 aaatggttga aggtccggga agatggtcgc cagacgtttc attttgtccg tcggtttctg 
237061 gctgtcgatg acgcgaaact taaaggggtc caacgaggtg gaaccagctg aataggaaag 
23 7121 gattttgccg ttcatgctac cgccgccaat ccaagtgagc cgacccgtgg tcgcgagcct 
237181 gtttcgccaa acgaaccgca acacatctca acgagtgccg gctgatcgga ttgtttggca 
23 7241 agctctgcaa acatgtcctg gcaatacgaa cgccccctcg catccgccgc catcagcaac 
237301 acqatgctct gcgttttcaa ttctggatgg ataaaatcaa agcccggcag gtaagtgaat 
237361 cccatccgcc gatagaaagc tttcatacga acatcagcgt tcgcaattga gatccgcttc 
237421 ccgttggcca actgatgcgc ccatgaacct cgcaccagcg ttcgaagcgc cgcaatggga 
237481 gtgctcggcc cacgacgaat ccgcaatttg caagaagcaa ataattcgtc gccatgctgg 
237541 tccaacaaga ccgttgggta atggtcttcg caatccaacc ggccatttgc ggcggccaac 
237601 atcgtcaatg tgccaagcgg ctgaccttgc cacttcaaga tgtaatggtt gctgtattcg 
237661 tcgtagtggt agagggtttg ctcggatcca tcacgacgaa tttcttgaac cacatcgaac 



237781 aaaaqttgca gactcctccc ctcagccgaa ggaagttcaa caacttgtgt cagtagcacc 
237841 cctcggaagc attcagcaat gtgaaaatta gaaagtcttg atttgcgcat gcggtcgaac 
237901 qcaaaagcat atcgaatcag tgatcgccat aaagctcgcc gattgcgtga ttcgacaatc 
237961 qaaatqctca tttgttttcg attcatcaca tcaaattgac gggatcgata tcgacgagga 
238021 actcgatttc ttctttcgga tcgactttaa aatctgccaa ccctcgccgg atgacttcgc 
238081 cqacaaccgc cgcttcggtg gcttgcaaca acaagtggaa tcgatacttg ccactgattt 
238141 tgacgatcgg tggtggtgcc ggtccaagaa tgcggacctc tgccttgagt agatcgcgtg 
238201 ctttctccaa gcggtcgaca atcgcatcgg ccaccgattc agttttgtct tccaggggac 
238261 cgcgaatgat gatccgagcc acactgccca gcggcgggta gttgaacttc ttgcggttga 
238321 ccatttcgtc ttcgacgaac ttcaaatagt cgtgcctcgc cgcggcttga atcgcggggt 
238381 qctccggtgt gaacgtttgc acgatcactc ggccaccgcg atcgccgcgt ccggttcgac 
23 8441 caqcqacctg ggtcaccaac tggaacgttc gttcggcagc gcgaaagtca gggaagtgca 
23 8501 acqccgagtc cgcgttgatc acgccgacca acaacacgtt gggaaagtcc aaccctttcg 
238561 cqatcatctg cgtgccaagc aacacatcga tctcgcccgc acggaactcc gacagcaccc 
23 8621 qttqqtggct gccggcacgc ttcatcgtgt cactgtccat tcgagcgatg cgagcatccg 
238681 ggaagcgtgc cttcgcttcc atctccaaac gctgcgtgcc aaggcctcca taacgcatcc 
238741 catcgaaccg gcacgcggga caccaaggcg gcgtcgggat cgtgtaatcg cagtaatggc 
238801 acatcgcctt gccgccatcg cggtggtgag tcagcggcat gtcacaatcc gggcaagcgc 
238861 acacqgtgcc gcatgcggga cactggatcg tcgtcgcgta accacgacgg ttgagcaaca 
23 8921 qaatcgcttg ccctttttct ttcagcgttt ccagcaccgc agcgtgcaac ggtcggctga 
238981 tcgcgccgcc tttgccgcgt tcttccttga ctcgcaagtc caccagttgg acatcgggca 
23 9041 tcqqqcgatt gccgacgcgc tccgacatcg tgaccagttc cgcatgcccg gtttgcgtcg 
239101 catgccaagc ttccatcgac ggcgtcgccg aacccaacac caacggaatg ccaagcgcca 
23 9161 tcgcgcgagc atgggccacc ttgcgagcgt gataacgagg ctgcttgtcc tgcttgaatg 
23 9221 aggtgtcgtg ttcctcgtcg atcacgatca gccccaggtt tggcagcggt gcaaacacgg 
23 9281 cactgcgtgg cccgatgaca acttggacct caccccgacg aatccgctgc cagtgaaagt 
23 9341 qacgctccga cgcggacatt tggctgtgca acaccgcgac gttctggaaa cgatcttcga 
239401 atcqtccacg ggtctgtggt gtgaggctga tctcgggcac caacacgatc gccgatccgg 
23 9461 cttgtttgac gacgtgttca atcgcctgga tgtagacctc cgttttgcca ctgccggtca 
239521 cqccqtgcag caacaacgtc ctgccgcgtc cgctgtcgac ggccgaattg atacgagaca 
239581 acgcgttctc ttgttgcgcc gtcaagtcat gcgtttttct ggtttcgcca tcgttggcct 
23 9641 qagcccgaat gcggatgttt tgactgagtt ctcgtcggac ctctggcacc aacaactctt 
239701 tcttccgcaa ccgccgaatc gggtcttcgg tgcactcggc catgatcgcg atttccgccg 

23 9761 cqqtcatcgg tcgatcttga gcgatcaaga accgcatcgc cgactgttgc ttggatggca 
239821 acttggcaat ttgctcttcg gtcagccccg gtgccggtcg gaagtaagtc gttttccgcg 
239881 tqcccgcgtt gtcgcgaacg ctggcgggga tcagcgtgtc aaagacttgg ccagcgggga 
239941 cttggtagta atgagcgatg aacatcacca accgaaccag ggccgcatcg cacagcggtt 
240001 cqtcatcgat gacctcggac acgtcccgaa gttttttctg cgccgcgttg ccggttttga 
240061 tcgacacgca ccaaccgggg gtcggctttt tacgatgccc cagcggcaca cccacacgca 

24 0121 tcccgggacg caggacatcc agcaagtcat ccgggattcg gtagtcgtaa ggcccgtgcg 
240181 qcqatcqagc gaacacgatc gacgcaagct gaacgtcttc gccaaccgtc aactcccacg 
24 0241 qqqgaggatc ggtttcaaac aattcgttct gggtcggttc ggacgaggac gagtcagcgg 
240301 qcacgggtgc ggcggcaaaa ggaggaaagt tcagagtcgg atcgcccaga ttggacaacg 
240361 atgaatcaga caacataacg tgtgagctcc ttttcttccc ggcgggcgca agataaccca 
240421 tccatcatcg cccaccccgt gacacgtctg gtcacacgcc ttaactgccc tgccctgccc 
240481 cqccccgaat cgccttctct ttctcggagc ttccaccccg agctactgac gcagactcct 
240541 ccqgagccac tcgtaccaat ccgcctcggt cggggtcgtc gtctcggaag ccaccgcgga 
240601 gcggcaaaag acggtaaccg ggggtcgcga aacgcacacc cggaacgaag acgacctcac 
24 0661 caaactgccc cccacagggt caaagaacct gggcgtggaa acacgttgac cctcgttgtg 
24 0721 tacgacaatt cgagtacgac aattcgcatg gcacaaaacc acgacctcca atcggggcag 
240781 ctccaccatg ccatccgaat tccatcgagc cattacactg gctctctgaa agccatgcgt 
240841 tgacagaaca tcgccccagc ccaatgcatc cacggagccg ctcagtgccc acgattctga 
240901 gaaacattct ggcagtcgtc gcaggtctct tcctcggcag cgtgctcaac atggcgattg 
240961 tcaccatcgg tccgatcctg attccgctcc ccgacggcgt ggacatgtcg gacatggacc 
241021 aattcgccga gaacctcaaa cttctcaaac cagccaactt cttcgcacct tggctggctc 
241081 acqcqttcgg cacgctcgtc ggcgcattcg ttgccgccaa gatcgctgca agtcacaaga 
241141 tgaagtttgc acttggcatt ggcgtgttct ttctactggg cggcatcacg atggccatga 
241201 cgttcggagg cccgctctgg ttcatcgttc tggacttggt cggagcgtac ttgccaatgg 
241261 gatacctggg tggaactctc gcgagagcaa cgcgaccgca acccacgtga cgtgagatac 
241321 atttcgacgg ccccatccgg ggcggctgga tggttgttga atcgtcctct cggggcttcg 
241381 accccgagct atgaacgcag gctcctccgg agccatcctg ctatttaacc gtgtggttcg 
241441 tgccgttcag ttcattcagg gcgcggttga tcacgcggac ggtgtacttg ccatcgacgt 
241501 aqaactcttg ccttggctcg agtgcatctt gcatcgaatc gatgaccggc aacgctttct 
241561 catcaatctc gtccagcacg atggcggctt gcagtcgctc ccactgggca ccgtccgcca 
241621 atgcctccgc cagcactgaa agggccttgc tcgtggcttc aggatcaccg gaatggcaaa 
241681 qcqcccgggc cgctgcgact cgtaccgtca cggattcgtc ggcaagccgt tcttgcagat 
241741 ccatcaaata cgggagcttg tcgatctcgc cgcgagcgaa aacgttgccg atcccagtcg 
241801 cgccccaata acgcaccgcc gaatcgatgt catcgagtgc ggatcgcatt tctggcaacg 
241861 ctctaggacc ctcggatgcc gcgagagctg ctgcggcaac acgattcgca agctcactgt 
241921 cqtccgattg tctcaggacg gcatactgac ttcccagttc gctggcgcgt tcggcgagga 
241981 tcggctctgg gatgagcccc aagtccttgg tgcgggtcac ccactgttcg tgcgccgcac 
242041 qcatgcgatg gagcacctcc gtgtggtcag aagacgacgc gagattatgg atctcatggg 
242101 aatcattctc gaggtcataa agttcttcgg tgggtttcgt tccgcggaag aaaggcatgg 
242161 ctqcctctqg caacgtcccg gcctgttccg cctcgcgaat cgaccgcatc gtctgcccct 
242221 tttcgggcgt gttcatgtat tgaaagtagg gcttcaatgg ttcgtagttg cggatgtact 



242341 caccgtacac gtagtcccgg tcggcagccg ctagcttgct caagggactg gtcagaaatg 
242401 gcttgccttg catgggctct ggaacatcca gcccggccag gttcaagacg gtgggaccaa 
2424 61 agtcgatcga gctcaccaag cggtcatcga cgaccccagc gacgttgtca gtggatcgga 
242521 actgagccgg catgcggatg accaacggga tgtgcgttcc cgagtcgtag agccaacgct 
242581 tcgctcgcgg gagcccaacg ccatggtcgg accaaaagaa cacgatcgtg tcttcgtcca 
242641 atccgtcggc attgagttgg tcaagcaacc cgcccaccca atgatcgagc gctgtgatca 
242701 gttcgtagtt gcgtttccag tcttctcgag cggccggcgt gtctggatag tacggcggga 
242761 acgtcgagag tttggaagcg tcttgccgtt cacgttcgga gaggcctttg gtcacggact 
242821 tgtatttcgc atcgttctcg attccggact cgtggcagcc cgtgaagttg aagacagaga 
242881 aaaagggcgt gtcctggtcc ggtcgatccc gccagtgagc cttgctggat gattggtccc 
242941 atgaaccttt cggagtctcg aactgataat cctgcttgga gttgttggtg cagaagtatc 
243001 ccgattcacg caaatacgtt gagaacggtc gaatgctctc tgggagtttc gcttggcaac 
243061 gcatgtgctg cgtccccaaa gtcgtttggt acatcccggt gatgatcccg ctgcgacacg 
243121 gggcacaaac accggccgtg gtgaacgcgt tggagtatcg aatgccttca ctggcgagct 
243181 gatcgatgcg tggcgtgatg gcgtgcggat caccatagca accgatgtga ggactgatat 
243241 cttctgcgga gagccaaaga atgttggggc ggtcgccggc ttcaaccgtc tggaacaagg 
243301 acattgccgt caacaagagt gcgaccatca tcggcacacg tgcgtcttgg aatcgtcgaa 
243361 tggctgcctg ccacatggag ttggctcttt gtctcggagg gagggaatgc ggggttctgc 
243421 atcgtagcta cagcagcgcc ctcaagcgag cgagttgaag cggtcgaatc gcgaagaatc 
243481 atcccgcgaa ggaaacgctg acgaggctac gcaatccaag aacactgacg acgaccaccg 
243541 tgagcacgcc gagcgtgttg gcgatccagt ggttgcggtg cttgccgagc agcgccgcgt 
243601 tgttcatgac cgcgagcaag aagatggcta acagaggcaa caacaatccg ttcgcgatct 
243661 gagcaaacgt gatgatgtcc gtgggctttg aaccgctggc cgcgaacgag gtgccaaaaa 

243 721 caatcacggt ggtaaagaca gtccgtagcc gccaatcctt caagtcaatg ggccagccaa 
243781 agcaaccggc ggcggcgtaa gcggcggcta aaggagcggt gatggtgctg gtcaatcctg 
243841 cagcgaacag ccccacaccg aacagccatc gtgcatggtt gccaagcaac ggttcaagct 
243901 gccgagcggc atcggcaagg ttcgtgaatc ccgtgttgct ctgaaagaaa gcggcggtcg 
243961 ccgttgccat cacagccgcg gtcaccaacc cgcccaaccc cacgctaagg atcgtgtcac 
244021 cgcggctgtg ctgaatcgcg tcgcggatgt tttcgtcact gacgggttcc ttcttatcgc 

244 081 tcgaccattt ctcagccgat gcggtggcgt gaaggaacaa gttgtacggc acgaccgtcg 
244141 tgccgatgat cgcgaggact tctttcaatc cgccatcggg gatggtcggt tggatccatc 
244201 ccagcgcgat cgaccgccag tcgattggaa cactcagcgc ggtcaacaaa aacacgcagc 
244261 tcatcgtcac caccaacgca accaaaatcc gctgaagcga tcggtagtga ccgatcatca 
244321 ggatgcacca tgctgtcaag ccgatcacga tcgacacgag agtttgatgc tgcattcccg 
244381 tggcggctgc aacgcccacg gcggcaccgg cgatattgcc tgcttggtag gcagcgtttc 
244441 cgacgatgat cgccgagacg accaagacaa tggccaatcc ccgagccaac gcgttgggga 
244501 tggttggacg aatcgcttct cccaaaccgc gaccggtgac gattcccaat cgcgacgcca 
244561 tttcttgaaa gacgatcgtt gcgatcacag aaaaaccaat ggcccaaagc agcgtgtgac 
244621 caaagttggc acctgcggtt gtggctttgg tgacggtgcc cggaccgatg aacgctgccg 
244681 tgaccagcag tccgggaccg aaacgacgaa agaacttggg aagcttcatg cccgctaagg 
244741 tagcgaatga catcgacccc gtctcaatga actcgcatca ctcaagcagc ggttaccccg 
244801 tgatgccttc gaggtctgaa ctggttcata cttcggccac cacaatcgca aacaacgatg 
244861 attccaacgc gaaccaacgc gtttggaatt tttgcgcagc cgccctgatt ctccgaccgc 
244921 caatcaccca cccaagcgaa gtcctcatga ccactggatt gaaacccctc accgatgcgg 
244981 aggctcgaga ggtcgagttg gtttttcggc tgaccaaatc gcatctcggg ttcgtcccca 
245041 attcgatgcg aacgatggct cgccaacccg cgatcctgag tagcttcacc ttgatggtgg 
245101 gcaatatttt gggtcagccg tctgacgcga agtcgcccat ttggttgggc attcgattgg 
245161 tgatcaaaaa cgtgatttgg tcgcttcgga acatgcgatc aaaagatcgg ttgccgctgg 
245221 ctctcaagaa cctggtcgcc catgtcacca gcggtgctgc gggttgccgg tattgccagg 
245281 cccacacgat cggcgaagca cgcgatcaag gcgtgccgat cgagaaactg gaagccgttt 
24 5341 gggagtttga tcgcagcgac ttgttcgacg aagccgaaaa gtcggcgttg cggttcgctt 
2454 01 tggcggccgg ttcgcatccc aacggcgtca ccgccgatca ctttgccgat cttcgaaagc 
245461 actacaccga agaccagatt gtcgaacttg gggcaacaat cgctttgttc ggtttcctga 
245521 atcggtggaa cgacactttc gccacgacgt tggaacccga atccgccgcc tttgccaacc 
245581 agcatctgtc ggcgtctgga tgggaaatcg gcaagcacgg ctgagagcac gccctcgcac 
245641 ttcgaacgca ggcgaaacga tgggaagtgc tgattccatg gaacgaacct ccaaagcggg 
245701 ggcatgcgac tcgccgccat ggaagcactc aagagatttt tgcccctttc gtttacacta 
24 5761 cttcaatcgc aaggtcgtga ttgcggcgat gcgtccccga ctcgccgact ccctccccac 
245821 ccaaccattc ggatttcttg aatgcgttcc ctccttggcc tgctactgac catcgctttc 
245881 gcggtcggtt ttgtagaaag agttgtttcg gccgaaccga gcgatacggc tcaaatcact 
24 5941 gttcccgcga ccaatcgtcc actctcgttc aatcgagaca tccgaccgat tctgacggac 
24 6001 cgctgtttcg cttgtcacgg actcgatgca aacacggtcg aagctgggct gcgattggac 
246061 gttcgcgaag cggcgttgga gggtggtgcg attgttcccg gggacgcgga atcgagcgag 
246121 atttttttgc ggatcacgtc cgatgatgac gacctcgtca tgcctccgtc ggaactgcat 
246181 aaacccctca gcgacgccga gatcgacttg gtacgtcgat ggatccaaga gggggcacca 
246241 tacgaacctc actgggccta cacccctttg ccccgcaatg gggtcccgtc ttccgtgaac 
24 6301 acaacatggg gagaaaactg gatcgatcag tttgtggcac gccgactgca agacgcctcg 
246361 gtgaccccat cgccgcaagc ggacccgacc acattgattc gtcggttgtc atttgatttg 
246421 actggccttc caccgacccc cgaagaagtg gaagcgttca agcaagacaa cagcaacgaa 
246481 agctatgaac gactcatcga tcgcctgcta tcgtcaagtc gttttggcga acgcatggcg 
24 6541 atctattggc tggacttggt gcgatacgcc gacacggtgg gataccacgg cgatcaaaac 
246601 gtttcacatt caccgtatcg tgactacgtg atcgacgcgt tcaacagcaa catgccgtac 
246661 gaccagttcg ttcgcgagca actcgccggt gacctgcttc ctaatcccac gatcgatcaa 
246721 ttggttqcct cgggatacaa ccgtttgaat caaacaacgg aagagggtgg cgcgcaaccc 
246781 aaggaatacc tttcgatcta ctttgccgat cgtgttcgaa acgtgtccca ggttttcatg 



246901 gatttctatt ctctcggtgc gttcttcgct gacatcgaag aaaggggcaa gtattcacaa 
246961 cgtgaacggc ctccacagat cccagttccc aatgaagaac agcaggccca actcgatgcc 
247021 ttgcgactga aagtggcgga agccgaagcc aatctcgaat cgatcaccca aaagcaattg 
247081 gcgtttctgc ctgaatggga agctcaggcg attgaatcaa tcaaccaatc caaattccga 
24 7141 accgaagaac acacttgggt cgatgaccaa ttggcagcag gtggatccaa gtcgggcgaa 
247201 tggaatcatg tgaccgccga tcaacatccg gtgcacagcg gagagacttc gcggcgacag 
247261 accggcaagg gagttgttca gcacctgttc actaacgcga aagaacccat cgaagtcaaa 
24 7321 gctggcacgg agttgtttgc gtgggtctac cttgatcctc agtcaccacc caaaacaatc 
247381 atgttgcagt ggcacggcgg accagataat tcaagcggct gggaccaccg agcattctgg 
247441 ggagacgaca aggttgggtt tggtgccaag cctcaaagct ggcatggaca caaacgcatg 
247501 ggagatttgc ccgagactgg aaaatggatc cgcctgagtg tggcggccga cgacgtcgga 
247561 tttgcacctg gcaagacggt tcatggaatg gcgttcaccc agttcgatgg catcgtccac 
247621 tgggaccaag cgggctggaa tgaaacagac ggtttcccca gcgaaatccg agctgctctg 
24 7681 ctggctgatg tgaaagcgcg gacgaaaaat cagatcaaac aactgcgcga ctacttcgtc 
24 7741 gagcatgaac cgttgatcac acagcaagcg aaagagttga agcgggctcg aacacaggtg 
24 78 01 tctgagtttg agaattccat tccaaccatg gtcatctcac gagccgttgc gccacgcgag 
247861 atacgaatcc ttgaccgcgg aaattggatg gatgagtcag gggaaatcgt ccagcccgcc 
247921 atccccgaat tcttgggcac actccaaacc agcaatcgac gcgccacccg tttggacctg 
247981 gccaactggc tttgtgacag cgaaaacgtt ctgacagctc ggacgatggt gaaccgtctt 
248041 tggtacttga tgtttggacg agggatctgc tccagcgtcg atgacctggg tggacagggc 
248101 accttcccga gtcaccccga attgctggat cacttggccg tggaattcat cgaatccggt 
24 8161 tgggacattc agcatcttct aaaacaaatc gcgatgagcg ccacctatcg tcaagcctcc 
248221 aatggttcgc ccgaattgaa tgcacttgat ccgttcaacg aattgttcgc gcggcagggg 
248281 cggtttcgag tcagtgccga aatggttcgc gacacggcgt tgatgatcag cggattgctg 
248341 gtcgaggaaa tcggtgggcc gagcgtgaaa ccgtatcagc ccgagggtta ctacagtcag 
24 84 01 ctgaactttc cgagacgaaa ttacgtcccg gatcaaggcg acaaccaata tcgacgcggc 
2484 61 gtttacacgc actggcaacg gactttcctg catccgatgc tgaaagcctt cgacgcaccc 
248521 agccgcgaag aatgcaccgc gatccgggcg cgatccaaca ccccgttgca ggccctcaca 
248581 ctcctgaatg atccaacgtt catcgaagcg gcaacgatgt tcgccagccg aatcatgcgt 
24 8 641 gagggtggca ccacggtcga tgaacgcatc gagtgggcgt atcgaaccgc ggtttcgagg 
248701 ccgatcgatg cggtcgttgc ggaagagctt cgcaccatccaaacggaaca cctccgacac 
24 8761 tacgctcaag aacccgactt ggccaaggca ttggtttccg tcggagaaca gagttcgacg 
248821 accgcattgg atccggccga atgggcggcc tggacgagcg tcgctcgagt catcctcaac 
248881 cttcacgaaa ccatcaaccg gtattgattc accatggcaa ttgtctccgc tatgaccagt 
24 8 941 gaccatctcg cccggcgaac gttcttgaaa cgtgccggcg ttggatcggt tgcgctggcg 
249001 tcgatgctga atgaacgcac gtcgctggca tcggaaggta ttcaccaccc acctcgcatc 
249061 aaacgcgtta tccatctctg catggcgggt ggcccaagcc acttggagac gtttgactac 
249121 aagcctgaac tggctcgtct cgacggtcag ccgatgccag agtcgctgac caagggccag 
249181 ccgattgcgc agttgcagaa caaggcccta cgagtcatgg ggcctcagca cgcgttcgcg 
24 9241 aaacacggca agagcggtct ggagatttca tccgtgttgc cgcatctgtc caagatggcg 
249301 gatgacttgt gtgtggtgaa atccatgcac accgagcaga tcaaccatga tccggcccac 
2493 61 acctttttca acacgggcac cgcgatcagt ggtcgaccgt cgatggggtc ttgggttttg 
249421 tatggactgg gcgccgaaac acaggacttg cctggtttca tcgttttgac cagcgaaggt 
24 94 81 ggtggccaaa gtcaaccgat cagctcccga caatggcact caggttttct gcccagtcgc 
249541 gtgcaagggg ttcaattgca tgcgtccgga aatccagtcc actacgtcgg caatcccgca 
249601 ggcgtcaatc actcgcaaca gcgagacatt gtcgatgccg tcgcgcggat caacatgcac 
249661 cgcaacgaaa cattgcaaga cccggaaatc gcaacacgca tcagcgcgta tgagatggct 
249721 ttccgaatgc aaacgtcggt tcctgagttg accgacatga ccgatgaacc cgaccacatc 
249781 gtaaagatgt atggctgcac tcctggtgat ggaaccttcg gcagcaactg cttgatggcg 
249841 cgtcgtttgg ctgaacgcgg cgtgcgtttc atccagctct accatcgcgg ttgggatcac 
24 9901 cacggcggtg tgaagcgtgg caccgaaaag acgggttcac tcgtcgacca aggctccgcc 
249961 gcgctgattc aagacttgaa acagcgaggc atgctggacg acaccctgat cgtctggggc 
250021 ggtgaatttg gtcggacacc gatggcccaa ggcgatggac gtgaccacca catcaagggc 
250081 ttctcaattt gtttggccgg cggcggaatc aagggtggga ccagctacgg cgcgacggat 
250141 gagttcggct acaacgcggt gaaggatcct gtccacgtcc gtgatttcca cgcaacgatg 
250201 ttgcacatgc tcgggatcga ccacaaccgc ttcacgtaca agttccaagg cttggatttc 
250261 aaactgaccg gcgtcgaaga agcccatgtg gtcgacggta tcctgagtta gccgctacgg 
250321 cgccaagatg aactcaatga tcgagttttc cggaccaatg tcgacagtgc gactttgtgg 
250381 tgcgaggtcg cctgcttgca cgtcgatgcg gacggactca aaggtcggta aaagtggcac 
250441 cgaagcaatc ccagactgat tcgtcgctga tttccaaaag ccgttaacgc ttccggcaag 
250501 atccgttgcc gcgatgtccg ccgacacagg atcgccgttg gtatccatgc accgaactgt 
250561 cagtggcgta accggctgca gctcaaaagt aacttcgttg tcctccgttg acgcagtgag 
250621 agaacgaacc cattgctcca aaggaggatt ggacgtgggt tcggcgggat tcgcgtggag 
250681 tcgatcgtag gccgtttggt acccgaggta tcgccccgct gcaaacactt tgacgtcttc 
250741 gattggaacg accagctcgg cgtgaccttc gtcgtcggtc agtgtgtgcg gaaagtggcc 
250801 gtctaaatcc tcaggcccac gaattggcct tcgacgctcg tcacaccagt cgacgtacac 
250861 gctgggaagc ggccgcttcg tctcagcgtc cacgaccttc accttcacaa cccgatagtc 
250921 gcgtgagtca acggtgatgt catctttgga tcgccctcga acccaatcag attgcattcc 
250981 ccatcgacga tccgtcgcac caaaaacttc tggaaccaat tggaggcgtt cctcacgaag 
251041 gccacccagt tgctcaatgg tttcgtttgt ttgcatgacc acaccgtttt caacgaacgg 
251101 atgcacctcc ccagacccac gcgaattcga cagtggaatc aatggatcgc gtactccggc 
251161 gggtgacgtt gcgatgcgaa gcatcgtggt cgactgggac gcgaaggaga acaattcatt 
251221 tgttggagaa aaacaattga agactgccaa gcgattgggc ggcaagccac cgatcgttac 
251281 ctcgcctgct tcattggctt gcatcaatcc gggaagctca ctccccatca agttcatgtc 
251341 atagaaatcc aatgagttga agctttggct cgtctcacgt ccatcaacga gcaccgccag 



251461 gtagggttcc aacagaatcg ttttgtcatc gaatttcaaa tctccggggg gatgcacacg 
251521 tgttccaaca cagtgatccg gagatagaat caaggtttcc caggcggtcg attgattcca 
251581 tcctcgccat ggggttggga tggatctcaa atcatacgca ccgttctgat ctgtccacga 
251641 ataagcgaac gtggcttgct gataggattg aaaacctctc atccaagtcg tgttggtccg 
251701 ctgacgcaac agaaccaacg cgccggccac cggtgtttga tccttcgcga gcacttgacc 
251761 gcgaacatca atccgttcgc ctcgatgaat tttctgtcgg cgttcgattt catctgcaaa 
251821 tgaatcagag aggcgaatcc agggatgagc ctgtctgatc atagcgattc gttgcgacca 
251881 atttggttct tctgttggat cagtcgcgaa cgaaggtgcg aatgaaaaca aaagcaagag 
251941 cgttgccgcc gagatcgacc aaccgatgac tcgtcgacga ggattcgtgg gattgcgact 
252001 cttgtccaaa tccaggactt caaaaacacg ctcgcgaaca ctgtgttccg agacagcggc 
252061 cgacaggtgt gcgacggttg aatgagcggt caaaccgcgt gcgattcgaa tcaaattctc 
252121 ggcataattg tccgccgaaa agcccgcgtt caaaaccgca tcgtccgcag caaactcggc 
252181 cgtccgacga agttggcgac gagcatggtg tgccagcgga tgaaaccaat acaacacgac 
2 52241 caacagattg gcaaggaatt gccaccccac atcatgtcgg agcaaatgac aaagctcgtg 

2523 01 ctgcaagatg gcggtttgtt catcggacga ccaacgctca aaatcgtgcg gcagcaagat 
252361 ggttggcttc caacacggca ccacgcacgg aacctcaacg tctcgtgaat gaaaacatcg 
252421 agcccgactc agcgttgtct tcattggcaa gcgaatcaat tggaacatcg cctttggatc 

2524 81 aggaacctcc gtcgctgcgc ggaccatgga ccataccttg acgtagccta tcaacattcg 
252541 gatcaacaga gcactggcaa tcgcgaacca aacccatccc acagccatca ccagattcga 
252601 gattgtttcc gcgatcgccc atggcctcac aaacatttcg ggcgatgatt ccaacgcggc 
252661 tgaattcagc gcagacacgg gcatggaccg atcttttccg acgaacgaac ggtcgacatt 
252721 cgtggagtct gaaacatcgc caccagccat cgctgtgctg aattcggaat tgagtccggt 
252781 ggcaactgac acgtcatccg cactgatcgc cgagggcaaa tttgtgccca agctcgatgg 
252841 tgcgaatccc caaagtgcca tcgaagacgg tcgaagcgtc aaagcaatcg gcagaatcaa 

252 901 catcagaacg atgcccattc tccaaagatc attgacgacg gtcgccgaaa aatgcgatcg 
252961 cttccgacca gcgatcattg cggggctcat caccaccaac gaaacgatac tggaaatcac 

253 021 caaagcggcc caaatcaagt tcaagctgtt catgacgttg gcctttttgg ttttgcctga 
253081 ctcttcttgg aaatttgcgt tcgcgcacgc tgcacctctt cgatcatctg gtcgacctcg 
253141 tcctcggtga gcggtcgctc gctcgcatcc aaaaacgcgg ccagagcttc ggcgggagaa 
2532 01 ccttggaaga agttctgcag gacctgcttg gccatggatc gtttggcagc aacctgtttt 
253261 tgcgtgcagc ggtaaaccag tcgcggtccg tcttgttccg acaacaacca ' gccctggtcg 
253321 accaaacgat tcaggatcgc tcgcaccgtc gaatagctcg gcggatcttc cagaccatca 
253381 cggacgcttg tcacgctggc gttccccaac cgatggacgg tctcatagat ttgtcgctga 
253441 cgtttcccaa ggcttatcat cgttggatcg ggcatcgtga ttctctacag gtgaataggc 
253501 tgtctccgag gcttggtgct acatttgtag caccatacat ttgagagtca aggactttcg 
253561 ggggacacat gccggcgatg gcggattttg atcgtgcatt tcggaggtgc tgatgctcgt 
253621 gacaaaccgg ttcggttccc gacgcttgac cgttttccac gaagcaaaac ccaggtcata 
253681 cgcgaccatc gcaacacaga ctcaacgtcc aggacgtttg gagtattcgc gtccgcaatg 
253741 gataattccc attcgcgggc gccagtcctg tacactaacc agccttgtct tgcatgccat 
253801 tcggcgtgcc acgttgccgc caacatgaag tgggtgcatg gttcccgatc aaatccaagg 
253861 ggccgcgaaa ggttcgtttc gcggtcgacg cattggtttg atgttcacga acatggtttg 

253 921 catgctcctg gtttgcatgc ttttcgcttc ccatcgagcg tttgctcaag acacttcgaa 
253981 tgctgatggg ctgaaaggtc tcgacctgga cgggcggtta attcggctgg gccaatccaa 
254041 cgagaccaag gccgtcgcgg tcgtcttctt gtccacgcat tgtccgatca gcaacggtta 
254101 tctgccgttg ctcaatggct acgcatcgga atacgggaca caaggcatcg aattgatagg 
254161 cgtgatttcc gatccgtcgg tgacacgttc ggacgcgaaa aagcacagcg ttgattatcg 
254221 agttgagttc ccggtgctgt ttgacggttc gggcgaattg cgtttggctc tctcgccaac 
254281 ccatacgccc caagcgatgt tgattcggcc gaccggcgag atgctttaca acggtgcgat 
254341 tgacgatcgg catgtccaag ttggtcgcaa gaaggatcac gccgctaatt cctatttgcg 
2544 01 agatgcgatc cgcgcagtca tcgctcgacg cgcggtggat gtgccgcgga ccaaacccat 
254461 cgggtgtttg ctggaggatc caccaaacaa agcgatcgaa ggcgaagtga ccttcgcacg 
2 54 521 cgatgtcgcg ccggtcattc aagcaaactg cgcggggtgt cattgccaaa acggttcggc 
254581 tccgtttccg ctgttgactt acaacgacgt cagtggtcat gccaaccaaa tactagaggt 
254641 gacgcattcg cgtttcatgc ccccatggaa accggccgcc ggattcacgc ggttccaaga 
254701 tgaactgcga ttgaccaacc acgagatgtc tttgttgaac gtttgggttc gcgatggcaa 
254761 accggcgggc gacccgaagg atcttccggc ccctttgccg ccatcaaatg gttggccact 
254821 gggtgaaccg gatatgattt tggaaatgag cgagacgttc tcgattccat ccagtggtcc 
254881 cgacgttcgg cagtacttcg tcatcccaac tggtttgaaa gaacatcgct tggtcaacgc 

254 941 gatcgacttt catccgggga cacctcaatc tgttcatcac gccagtttct ttttggacac 
255001 ccaacgaaag gggcggcgac tggaccaatc ggatccggcg ccgggctact ccagctttgg 
255061 cgggccgcgc tttgaacccg aaggaacgct gagcagttgg ttcccgggaa tgagcccgcg 
255121 accattgccc gaaggaatgg gacgtttggt tcccaaaggc agcgacatcg tggcggaaat 
255181 tcactacgtc acgaccggca aaccgcatca agacaagtca aaaatcggcc tgtactttgc 
255241 gccgccgtcg gccagaaaaa tggtcaccga aatacaggtt ggcaacaaac gaattcgcat 
255301 cccgccgggg gcggagcatc acttgcaacg cgcgacgtac accttgcccg tcgacaccgt 
255361 gttgctggac acggtgccgc acatgcatct ctttggccgg gaaatgaaag tttggtcgaa 
255421 atcacctgac ggcgagacca agccattgtt gtggatcaaa gattgggatt tcaattggca 
255481 aggcaagtac tcgtttgccg aacctgtgcg attgaaaaag ggaacaatca ttcacgtgga 
255541 tgcttggtat gacaattcag cgaacaaccc gttgaacccg aactcgccac ccaaaccggt 
255601 ttcgtggggc gccgactcaa ccgacgaaat gttgatctgt cacttccaat gcacctgcga 
2 55661 aacgatgggg gaactgaacg agttgattcg gcaccaaaag aaacacatcg cggatcgaca 
255721 ataatcgtct cggggcttcg ttgacgcatt gcgcgtgaga caacctgccc gcttcgcttt 
255781 tgcgaagtta tgcttgtctc gattcaagct agaaaccttc gaaggccgtt ccgtgtcagc 
255841 atctgaaacc acccctcaat cgaatcacgg catcggaatt ctgggtggat cgttcgaccc 
255901 ggttcatgtc gggcatctat ggatggccga atcggctctc gaacaattgc cgattgaaca 



256021 cgagcaccgt ctgcaaatgc tgcggttggc actcagcgga caatccgggc ttgtcatcga 
256081 cgactgggaa cttcggcaag acagcgtcag ctacaccctg ctgacgctgg aatatctgca 
256141 ggaacagttt cctgaccgcc cgctgtacct gatcatcgga gccgattcac tggcctcctt 
256201 cgaccgctgg cgagaacccg agcaaattct gaaacggtgc cacctggccg tgatcgctcg 
256261 cggcggggac cctccgccgg actactcgat cctggacgga atgaccgacg aaacgcagat 
256321 ccaaaggatt cgcgagtctc agatccagat gccccagatc gaaatcagca gcagcgacct 
256381 tcggaatcgc atcgcgaccg gacggagtat ccgattccgc gtgccacacc cggtcgcgac 
256441 gttgatcgac aatgaaaaga tgtaccgggt acgatagcaa agttgcctgc gattccccgc 
256501 gtgtcgaaac tgagatctgc tatgtcgacc aatccgatcc cactgctgtc cgaacaggaa 
256561 taccttgacc aagaacgaaa ggcttcgttc aagagcgagt acttcggcgg cgaaaccttt 
256621 gcaatgggcg gcgcgaagcg agcacacaac ttgatcgtga caaacctaat ccgggttcta 
256681 agcaacgcgc tgctcgatgg tccctgcgaa gtttacccct gcgacatgcg ggtccagatt 
256741 gaagccactg gtttgtacac ctaccccgat gtggtcgtgg cttgcgacga acccaaattt 
256801 cgcgacgacc aactggacac cctgcaaaat ccgactttgt tgatcgaagt cgcttcggaa 
256861 agcacagagg catacgaccg aggcaccaag tcgagacact accgacgcat tccatctttg 
256921 aaaggatacg tgctcgtcgc gcaagatcga ccagcagttg aatcattcac ccgtggagac 
256981 gaccacacat ggacgcttcg tgaagcgggg ggactggaga acgaattgcg gatcgacgcc 
257041 ttgaagctga acctgtcgat gacagaagtt tatcaccggg tccagttcga tcgccccgat 
257101 ccgttggtga aatgactcag agctcaatgc gcccctcgcc agcgagtctg ctctgaacgg 
257161 agcatccggg ttcaggttcc gcatcctgtg cggacactga tcgcccaaga aaagctttat 
257221 cgggcagaat aggcaaaagg gattgcgtgg aaacgccgat ctcggcacac tgtgtggccc 
257281 gcgatcgcga cctttttgcg ttcacgggac accattggcg agccggagac gttggttttc 
257341 cggccgaatc agtccattcg ccttcgatat tttcgaaaag cttcatgcct ccacgtccaa 
2574 01 tgagcacgcg tagccgtgcc cgcaaacgct cccgagtccg cagccgcacg cgtcgcaagg 
257461 accccatctt cgtcgatggc catcgtccac gtccgatgta cgttgactac aaggacctcg 
257521 aattgctgtc caagatggtc aaccgtcaag gacgcatcat gggccgtcgc aaaagtggtt 
257581 gtaccgctgc cagccaacac gctgtcacgg ctgcgatcaa acgagcccgc ttcatggcct 
257641 tgttgccata cgtcggcgaa taagccaaat cggcaaatca gtgaacgatt ctgcctctga 
257701 cggagccccc acccagcggg ctccggtcta ccagacccaa cgccgcgtcg agtttcgaga 
257761 caccgatgcg gctggaatcg tccacttctc ggccttcttc ccgatgatgg aagcggccga 
257821 gcatgagttc ctgagatcgg tcggcatccc tgtgatgccg caccatgaat ccgaggaacg 
257881 cttgacttgg ccccgagtcg ccgccagttc cgactttcat gggcccgccc gattcgaaga 
257941 cattttgcaa atcgatttgc acgtcgaccg aatcggccag tccagcatca gctatcgctt 
258001 cgaactgacc tgcgaaggcc gccgcatcgc caccggcaaa atcaccgccg tgtgttgcga 
258061 cctgaaagcg ggcggcaaac tggtcaaaac caacgtgccc gatgacctgc gtcagcgatt 
258121 gtctggccct caaaacgact gaacaaacac atgcccgcaa ctttcgaagg cttcggcctg 
258181 aaactgctgt acccggacaa ctggaagctg gtcgaacgag ccgaagacga aggcgaccag 
258241 ggagccacgt tcgatctgcc cggcggcgga tttgtgtcgc tggaaacgtt gccgatcacc 
258301 cgagaagacg aagtcgtcct cggtgaaatc gaccaaatgc tccgaggcca atacgaagac 
2583 61 ctggaacgtg acaacgtcac actgccgggt gccacggaag gcgaacgcgc cgttgacctc 
258421 cggttctact acttggacct cgtcgtgatg tcgcgcgtgg tgctgctgga cgcccctgcc 
258481 ccgactcgcg aacaaatgcc catcgcggga cgcatcttgg ctcaactcca agccgaaatg 
258541 tcggatttcg aagccgccga acaggtcttc aacgcgatcc tgcaacagat tcgaatggcc 
258601 gaacttcccg aagaagctta atcggcgaag acggaacgaa ggtgaacgcg accgcggcca 
258661 atccaatcgc cgcaaatcga ctcagaccaa gtcttgcttg cgttttttgc gtcggttctt 
258721 cccgcattcg cggcacacac cgctgaccag catccggtgc ccagaaacgc gaaaaccgtg 
258781 tttcgcggcc gcttcgtctc gcgtcgcaat cagttcctcg cttcggaact cgatcaattt 
258841 gcggcaacgg gtgcagtaca ggtgatcgtg gggcgggtag ccgtaatcca gttcgtaaac 
258901 cgtgcggcca tcgagctgaa agcaattcag caatccagcg tccacaaact cccgcagcga 
258 961 ccgatacacc gtcgctgaac tgacgtagtt gggctcgccc ttgcgaggca acttctcgat 
25 9021 cagctcatcg gcatcaaagt gggcgtgctg agagaagatc ttctcaacga gaaacttacg 
259081 aggcttcgtc tgccgctggc ctttgctctg caaatactcc tcaaaacgct cctgaggggt 
259141 caaagccacc tcgagacgtt cgagagattc gggaaccgaa gacacgtcag acaaaacgga 
259201 ccgctgagaa agagagaaga aatgcttgca agccactata gcagccccgt ccagatttgt 
259261 cccgagcgga gccctgcgaa accgacgcag aatcaccagg actcgcgaaa aacgcggatc 
259321 aacggggccg atgtctcggt ctccgtccgc aggactgggt ggatcacgct gcaacgagac 
259381 aatcgacgga aatcgtcatt tccgctctcg acttttggat tcgcggtcga tatcttccgc 
259441 agccgattga aaaacagtga gcgtgcaatc acatttcagt gatgcgaacc gactcactgc 
259501 ggttcaatca gcacgacacc aggtctcgcc catcacgggc gcggcggaag tgtgttggca 
259561 ggacgtttcc tagcggaatc gcccactcca aacgcacaac gttgccatac cgcaaggtgc 
259621 cgcaacctcc attgcgatct ccccaaggtc aacacgttat tgcgcccagg acgcggctcg 
259681 ataacagcga agaagaagtc ggccagggtt ccatcggaac cctcagacga agccaccggg 
259741 cttcgaagtt ttgactgtat ttcgtttctc aactgacctt ggtccccaat tggaggcacc 
259801 ccagccgcat gatcgcatcg accattctgc gcgcacccaa cgcccgaacg gctcagccga 
259861 acctttcctt cgacctcgcc aaagagctcg ctgaaaaaca cggcacacct ttgcttgtcg 
259921 tatcacgatc caaagccatc gaaacctact gggcgatgaa gaacgccctg ccaggtgttg 
259981 atctttacta tgcggcgaaa gccaacccgg acctgcattt cctgtcgacg ctcaacggcg 
260041 aaaatgcatt catcgacgtc tgctcgcctg gcgaattgaa ggcagcactc gccgctggtt 
260101 tcacgccaga ccgcatgctg cacacgcacc catgcaagac cgatgcaaac ctttgggaat 
260161 gctatgaagc cggcgtgaac tggttcgtct tcgataaccc aatcgaagcg gaaaagattc 
260221 gccgactgac tcccgatgtg aacttgttgt tgcgtttggc aacgaccgga gcctccagcc 
260281 gaatcaacct gtcggcgaag ttcggttgcc ccatgcacga agcgctggaa ctattggcaa 
260341 cggcaaaagc caaaggcctg aaagtccgcg ggttctcgtt ccatgtcggc agtcagtgcc 
260401 tcaacccaca agactatgtc gaagtgctcc gcagcgttcg cgaagtctgg gacgaagcca 
260461 ccaaagccgg acaccacttg gaagtcctcg acatcggtgg cggtttcccc gctccgtatc 



260581 tcttcggcga cttgccgatt cgtttgatcg ccgaaccagg ccgcgggttg tgcaccgaaa 
260641 gcgtgaccct gatcactcgc gtgatcggga agagccgtcg atgggaattg ccttggttct 
260701 tcctcgacga tggcatctac ggatcgttct cgggcaagat cttcgaccac acggacttcc 
260761 cgttgctggt cgaaaacgat ggctcacgag agaccgttcc ttgcgtggtc gccggtccaa 
260821 cctgcgactc caccgatatc gtttcgcgtg atcagtggct tcctgatttg gaagtcggtg 
260881 aattggtctt ggttccatcg atgggtgctt acgctgccgc cagcgcatcg cctttcaatg 
260941 gattgccgat ggccaacagc gtcgcaatcg actgacggac gacacgccga tgggattgat 
261001 cgccgccgtc ctggcctatt tgcttctcac gatcgcgatc gggttgctcg cagcccgacg 
261061 cgtcggcaat gcgcaggact tcatggtcgc gggacgctcg cttccgcttt acatgaactt 
261121 cgcttgcgtc ttcgcaactt ggtttggtgc cgagaccgtg ctttcggttt cggccacctt 
261181 tgccgggcaa ggactgcgtg ccatcccggg cgacccgttt gggttttcga tctgcttggt 
261241 gctcgttgcg ttgttctttg cacgagcctt ctatcggatg gacttgttga cgatcggcga 
261301 tttctatcgc aaacgctacg gacgctcgat cgaggttttg acgtccgttg tgatttccgc 
2613 61 atcgtatctg ggctgggccg cagctcagct caccgcactt ggtttggtga tttcggtcct 
2 61421 tggcaaaggc atcggctacg aaacattgac catcaacaac ggcatcgtga tcggtttcac 
2 61481 catcgtcgcc ttttacacgg tgatgggcgg catgtggtcg gtggctttga ccgacatgat 
261541 tcaaacgttc gtgatcatca tcggtctgtt ggtggtgtcg gtgtacatgg ctcatgcggc 
261601 cggcggtgta tccgttgtca tcgagtccgc tcgcgaatcg gggcgtctgc aggtctttcc 
261661 cgactggggg caatccggtc agtggtggat ctacattggc ggattcctga ccgcggccct 
261721 gggttcgatc ccgcaacagg atgtctttca gcgtgtcacc agcgcgaaag acgaacgcac 
261781 cgccatgacg ggaaccttgc tgggtgggat gttctactgc atgttcgcat ttgtcccgat 
261841 gttcatcgcg tacgcggcgg tcgtgatcga tccggatcac ttgcaacaat tcaacagcga 
261901 cgacctccgc gaagtccagc gaacacttcc gcacgcggtc atacagtcca ccccgttctg 
261961 ggtgcaaacg gtttttctag gtgctctggt ctcggcgatt ctatctaccg ccagtggcac 
262021 gttgctcgca ccgagcagcc tgatcgtcga aaacgtcatc cgtccgtttc ggtccgacct 
262081 cgacgacaaa aacatgttgc ggtggctgcg aatcgtcttg ctgatgtttg gtgcattggc 
262141 gttgcaccaa gccttgacca gcaacaacac gatgtacgaa atgatccagc aggcgtacag 
262201 cgtgccgctc gttggtgccc tggttccctt ggccgttggg ctctattgga aacgagcgac 
262261 cactcgggga gcgatggcgt cgatcgtgtc cggcgtggca acctggttgg cgttcgaata 
262321 catgttgccc gaattcctga tcccctcgca actgatgggg ctcgccgcca gctttctcgc 
262381 aatggtggtc gtttcgctgc tggacaaatt tgaaaacatt cggcctgatg ctgtgcagcg 
262441 ttgacttgct gatcgcgtgt gttcgccatt tttgcgaacg ggttcacgac tcgctttgtt 
262501 ccaaagcatc ggcgggaacg tcaggtacaa cctgacctac ggtttagggg acttcaccgc 
262561 tcgtgctgct gcacgatatc gagatggact tctcgcattg gcaaccagaa gtcgttcttc 
262621 atcttcgggt cgttggaacg cacgacgttc aaactgattt gcagtttttt gagtgccgtg 
262681 tcgaattggc ttttcgtgca cgccaatcgc tcgatcgcac gcttgcgaag cggcccggtg 
262741 taatccgctt ccaaacgaat cagctcaaag atttcttggc agaggacatc caattcgtgg 
262801 acgggctggt acgcctcggc ataatgcact tcacgaaagt gctgcatttc catcaacacc 
262861 gcgtccccgc cgcgcacctt cccgtagaaa acttccgcgg gataggtttg cgggatgcga 
262921 gtcttccaat cccagacggc cgtcaccttg gggctccaac cgcccgcctt gggtttgtct 
262981 tccgacaaat cggtgttgtc ccacaacgac gggtacggcg agttcttgct gccgaagacg 

263 041 gtgcagacct tgctcttcag cacaaactta taagcctgtc caaacgtctt gatcatggga 
263101 tcgcctctcg aatcaatcga cgcgctgcgt gatctcgatc aagtggtagc cgaactgagt 
263161 tttgacgggg ccgtgaacct tgcccagttc accgctgaat accacttcgt caaactcttt 
263221 aaccatttgg ccagggctga acgttcccaa cgctccaccg gacttgccgg acgggcagga 
263281 ggaaaactcc gcggcgatgg caccgaaatc ctgtcctttt tcgatctgat ctttcagatc 
263341 ctgacaggct tcttcggttt caacgagaat atgtcgtgca cttgcagtgg ccatgatgtc 
263401 tcttgggttg tgttccgtcg tggggtggta aggggcccgg cggaggtcga aagtcgctcg 
263461 ccagcttcgg ccaccatggt ggcttgcccg ctgcagttcg aaaacccctg ctgaagacgc 
263521 ctttcagggc gaaccgtttg tcggacgacg gaatcttcgc cgaagtgcgt tcgcaccagt 
263581 ttccacgacg ccaccgggcc taacggccat cggctcattg ctgcagcttg ataaagtggt 
263641 gtgccccgct gctgcttccc gcctgcttgc agcatggttt cgatatggga taggttcgaa 
263701 tgccgcgaga ccatccatcg gagcgaaaga gtggcgggct cgatgaaatc ccgtcccaag 
263761 cggattcaac gcagcctctt tcccgactcg cgaaaagatt ccttggttcc tgcccacgcc 
263821 gtctcacccg atcgacacca tgatttcccg acttgttttt caaccactcc tgatcagcct 
263881 tgtcacactc ctgacctcgc agggactctc ctccgcgaaa gagtggtccg tgccggtggc 
263941 gggcaacgcg ttccgcactc agcctgaacc cggcggtcgg tcgatgcgac gcggtggctc 
264001 cctacgactc cgcgattcga gcgaagtcca ctccgtctac tttcacgtcg atgacgctca 

264 061 cacgctcgag ctaggactga aaggcaacgc gagcgatcac cccgcgacaa tcctcgcgac 
264121 cgtcgaggaa accaagctct cctgtgatct gcaaccagga tccgccaaga cctacccgct 
264181 cggcgaaatt caggtcgaga atgcgggcta cgttcgggtc gacctgcaac tcgcgccggc 
264241 agccgggggc aacccttctg gatccacaca aatcacggat ctggtggtga catccgacga 
2643 01 cgccgacctg caactcgact atgtacaaac gaacgatggc aacatgttct attggggacg 
264361 ccgaggtcca tcagtccacc tctcctaccg attaccacga ggcacggatc tgacctacgc 
264421 ctacagcgag atcatggtcc ccgtcggcga agaccaaatg ggcacctact tcatggccaa 
264481 cggttttggc gaaggctact tcggcatgca ggtcaacagc cccaccgaac gcaaagtcct 
264 541 gttctccatc tggagtcctt tccgcacgga caaccccaac gagattcctg acgaagaccg 
264 601 agtcgagacg cttgcaaaag gcgacggtgt gatcgccaag gactttggca acgaaggatc 
264661 cggcgggcaa agcttctttg tctacccatg ggaagccggc aagacatacc gtttcctgac 
264 721 cgaagtcaaa cccgatggcg atggcaacac gctgtacaca tcttggttcg gtgataaggc 
2 64 781 gaagaacgaa tggcgtttgg tagcgacctt ccgacgcccc aagacggaca agcacctcac 
264841 cggcttccac tccttcctcg aaaacttctc agcagaacaa ggccatctgg aacgatcggc 
264 901 tcactacggc aaccaatggg tctgcgatgt gaacggcaag tggcacgaaa tcacgcgagc 
264961 caaattcacg ggtgacaaca ctgctcgcgg tcgtcatcga ctggactatg ccggcggaag 
265021 ccaagacgaa cactttttct tgaagaactg tggcttcttc tcggagaccg ttgatctgga 



265141 gccacgctga tccatcgcaa cgccactctc gtagaggacg acgcaaagaa tcccacgcat 
265201 gggtcgtgcg accggactcg cggcctcgtc cactaccatt caaacaagaa tcactcatgg 
265261 atgaaatcgt ctatcacgac cgatacaccg gcgaagaccg catcgaaaaa gtgtatggcg 
265321 acaaggcatt gcgatggacg tacggcacca ttgccggacg tctttcgctt tcgctggtcg 
265381 tgaaacgaac gttgttctcg cactggtacg gttggcgaat ggaccaaccg aaaacacgcg 
265441 agaagatcgc tccgttcatc aaggagtatg aactggactc ggatgagttt gtccgcgacg 
265501 tggacgagtt tgccaatttc aacgagttct tttttcgaaa gctgaaaccg gaagctcgtc 
265561 cgattgatgc ggctgccgac tcggtggtct tccccgccga cggccgtcac ctttgcattc 
265621 ccgacctatc aaaatgcgaa gggctgtttg tcaaaggcga gatgtttgat ctgccaacgc 
265681 tcctgcaaga ttccgccctc gccgaccgct atgcgagcgg cagcctgctg ctttcgcggt 
265741 tgtgcccggt cgactatcac cgctttcact ttcccgccgc cggtgtcccc ggtgagagcc 
265801 gtctgatcaa cggtccactc tattccgtta atccaattgc cttgcgtcag aacattcaca 
265861 ttctgaccag caacaaacgc tgcctcacgc agctcgaaac ggaatctttc ggaacagtgc 
265921 tattgcttga aatcggcgcc acctgcgtgg gcagcatcca acagagttac tcgcccggtg 
265981 aaaccatctc caagggcgac gaaaaaggct acttccgatt cggtggttcc tcgaccatgg 
266041 ttttgttcga acccggccgc atccaattcg acgcggacct cattgaaaac tctcgccaac 
266101 atcgcgaact ctacgctcgc atgggcgacc atttggggaa tcgacctgcc aattccatct 
266161 gagccaaatt ttcaattttg atgtcccgat tcgtgttctc ggttcgtcac cctcacaaat 
266221 gacgtggcta tgttgccact gtcgaacccg aaaccttcgt ttgaaagaac accgatgcca 
266281 aaattcatgt ttgtttatcg cggcggctgc gacgagatgg aaaacgcctc tccggacgag 
266341 atgcaaaaag tcatgcaacg gtggatggac tggatccaag acgggatgaa agccggttgg 
2 66401 ttgcttgatg gtggcgacgg actgaaaccc gaaggcgcaa ccgtcaatcc ggatctgtcc 
266461 gtcacggatg ggccgttcac cgagtcgaaa gaattggttg gtggctactc gatggttcaa 
266521 gccgagaatc tcgccgccgc agtggaactg gccaagacgt caccgatcat ccatcacaaa 
266581 ggtgccatcg aagttcgcga attggcgaat gtgggccagc ccgaagaagc atgaccgatt 
266641 caagcggcca tttggtcgag cacttttttc ggcacgagtc cgcccatttg attgcggtgc 
266701 tgactcgtgc ctttgggatc cgtcgaatca acttgatcga agacatggtt caggtcgcca 
266761 tgctagaggc gatgaacgcg tggaagcaag gcggagtgcc tgacaatcca actgcgtgga 
266821 tccatcgcgc caccaagaac cgaatcctgg atgcattgcg ccgcgaaaag acgttggcac 
2 66881 aagcgatctc tttggcggga ccatcgctgg acgtccaaga ggacaccgtg gatcactggc 
2 66 941 taagcgaaga acaattgccc gacagcttgc tgcggatgat gtttgtttgc tgtcacccaa 
267001 ctctgaaccg tgagacgcag atcgcgctga cgctgaaaac actttgtggc tttgggtacg 
267061 ccgaaatcgc tgccggattg ttgctgccac tcgagacgat caaaaaacga atcgctcgtg 
267121 ccaaacgttc actctgcgaa gccaatgtcg cggtggaatt gcccagccct gacgagattg 
267181 cacagcgccg cggcgtcgtt cacgatgtgc tgtatctgat gttcaatgag ggacacaaca 
267241 cctcacacgg atcccagccc atccgagatg acatttgcga agaagcagca agactgtgtc 
267301 atctgctgag cgaaagcgaa catggctcgc gcgccacgaa agctctgctg gcactgatgc 
2673 61 tgtttcaagc agcccgcttg gaatcacgaa ccgacgaaca aggcaacgtg attctcctgg 
267421 aggaccaaga ccgttcgaaa tgggatcggc gactgatcac catcgcgcaa catgagctgg 
267481 ctggctcgaa gattgattcg ccgactacgt accactggga agcagccatc gcgatgcagc 
267541 actgcatcgc cccaagcgta gaagccaccg actggtcaac gatcgttcgt ttttacaatc 
267601 gcctgcttga gatccaccgg tcgccgattt acgaactcaa ccgagccatc gcgagggctc 
267661 aagcaggcca gatcaacgag gcactgaaag agctgcaaac gctccagcaa tccggtgcaa 
267721 tgcagaccta tttgttgctg gattgtgcaa tcgccagagt gcatgaactg ggcaaccaca 
267781 cctcacaggc aattgacgcc tatctgatcg cgctgtccaa agccaaagct gatcacgaga 
267841 aacaactggt cgaaacgaaa ctgcgaaagc tacaacaggc gtagaacgaa aaagccctcg 
267901 caagagaact tgcgagggct attcgttgat tcgtttggat caagtgaccc gctggatcaa 
267961 ttggctggac gacttgactg aactacttgt cagtgaagtc caagaaccac caaccatcgt 
268021 cccattcgag cgagacttta cgccatccca aaggaacttg tgggtagggg tagaaaggtc 
268081 cgatgtatgg gaaagccgat gggctgtact gttgtgggta cgtcagtgca gcgtagtttg 
268141 gatacgctgc gtagcctggc catgcgtagt tcggcaggtt tggcgaatcg tagcgaggtg 
2 68201 ctccgacggg tgcggcagga gccattggaa cgggttgtcc catgacaggt gcgtgagcca 
268261 ttggtgcagg tgctgctggc acgccggctc catgagtcac catgctaccg ggcacgacca 
268321 tttcgccgtt gatcatttgt ccgccagctt gtccgtacga aacgggacga gcttggactt 
268381 gaccacggtt agctggtggc aaaggctgca atccaccggt gcggtgagaa gcaggtgttg 
268441 caaccaaacc ggtgttcgaa ccagccttca cgctcatcag atcgcgaacg ccgcgtgctc 
268501 caggagcgtg acgagcgatt tcggcgatca gttcacgttg cgatgccgaa gcagcttcgc 
268561 cggtgatttc gacgatgccg tcgtatgcgt tgacgtcgac gccgaagtct ttcaagtgac 
268621 cgaccgattg ggctttgccc aaagcagcgg cgaccgattg cgtgattgct tcgtcgctgt 
268681 tggcttcgcc gccagcgttt tcccaagcag ccacggtgcg agtttgcgag tcagcggctg 
268741 cttcgccggc gactggtgaa accgtaccag gaacaacttc ttgagcttcg gtttccaaag 
268801 cacctttgaa gtcaaacgct tcagctggct caacagctgc gattgcagcg gctttgggtt 
268861 tgtaaatctt tgcttcgaga acttcgagtt cgttaacaac attgacaacg ccgtcgacgt 
268921 ctttcaaagc ggccaagacc aagtcttgtt gctccgagcc atcgacggtg ccgcggaaca 
268981 ataccacgcc gtcgttgact ttcatgtcga ggttgaagtt cttcagctgg cctgcatcac 
269041 ggctgacttt gagccgttgc atgacttgtt gtgcgatctc gcgatcaccg ccaaaggcct 
269101 gagcaggccc cagcgtggca atcgcggcca tcgccagtcc aaaatatttg cgtcgcatgg 
269161 aaatgcctcc ctacagtcct ttgtgaggtg tcattcaccg accgccgtgc tcgtggcaaa 
269221 aaaggaacgg ccgacagccg gtccgatcgg tgctatggac tgtttcggtt gtaacgaggc 
269281 ctgaacggca gtttttttcc gattttgtgg ataaatcgac gcgcaccacg atcgacggcg 
269341 aaagagaaac cgcggcgcaa acgggctcaa ttcaaattca accggccgag aaaaatcgct 
269401 tgttcgtccc gccactgcca atcggaatct ccatcctcgc gccgcacttg gggcatccgc 
269461 cgcgatacgc ggttcgctgg tcgttgcggt aaatgcgacc ataggtttgg caacatcgga 
269521 actgaatgcc taagaacggc cgtgggtttc ccggcgttcc gtcacgtccc gaagccgaat 
269581 cggacgcgtt ggactgatcg cgtacgtcgt gcacaacatc gagatgtttg ccaacgtttg 



269701 agaggtgtca gttgtctttt gccagaacaa 
269761 aaacccgctc ggaatcgcct ggtccacaaa 
269821 cggttcgccc gaggattctg actcaggttc 
269881 cttgcgagcc cgattcaatc ggctgcgaac 
269941 catttcctca taggaagatt cgttcatctc 
270001 gctgagttgc cgcagagcca attgcaccaa 
270061 atcgacggcc tcgccgccac aaacgacttc 
270121 cgagaggcga accttctggc ggcgctttcg 
270181 caaccaagtg aagaactggc tattttgctg 
270241 gaaggcctct tgagtcactt cttcggcatc 
270301 gtgatgcagc cggtcttggt gacggatcac 
270361 cagcgccgct tcgatcaacg ccccttcgtc 
270421 gtttcggttc gagctggctt cgtccacttc 
270481 tcagtttaac tcaaacgagt cgccctgttg 
270541 agcctctcga acgtcctcat cgaaatgatc 
270601 catcggagga aacgattccc cgttttcatc 
270661 aaacgaaaca atcaacccaa cctgagatgc 
270721 caaaccccgc cacagcaaca attgcacgct 
270781 tcccagtgaa tctcgtgatt gggcgaattc 
270841 cgcgagcaat atcacgccca tcaccgcttg 
270901 cagaatcatc aaatcaatca acctcttcgt 
270961 cgccgatgga caaaaacttt ttgccaacac 
271021 tgttagcaaa cgcaatccga tttgaaccaa 
271081 ccacccgctc ggcggccaag tgccaatgcc 
271141 cggaacgacc atcgcttgcg gaaacgcaaa 
271201 taatgtcaat cgtgagggca acgcttttcc 
2712 61 ggttgccaac gccaccacgt gatacaggat 
271321 tgtccacatc caattcggcc gcaccgaatc 
271381 ctcagtcgca ccaaccaaag cgaatgtgat 
271441 tgacgaaatc ggaagccggc aatctcggca 
271501 cggcacattg tcacgagccc gaagccgatt 
271561 cacgttcaaa cgttgaggca atcgccaaac 
271621 cccaaccacc aagaaaaacg taaaaataac 
271681 gaccacgcga aaatgcgggt gtgcccccga 
271741 cgccgatgcg agcagatacc cgatcagcaa 
2 71801 tcgtcgcgac aaaattgcat ggcccttggc 
271861 aactgacgac tcccatccaa tccccatcgc 
271921 atcaattcgc caacgaccac ccgccccggc 
271981 tcgcccaacg gcggctgcac tggaatttcg 
272041 aatttgaaaa accgctgctg atcttcgaac 
272101 accggatcca ccggttcatc atcgaaggca 
272161 tgccggtggt gtactacccg tacgttgaac 
272221 accgactcgc caaacacgct tgcacggtcg 
272281 cgcacatgat cgacgcggtg aaggacaaac 
272341 atggagttct cccactgcga gcggctgaga 
272401 gattcatgca gaaaaacatt ctgcccgcgc 
272461 atcgaaacag caaccgccct gccctgccct 
272521 gaaagaaatt gctcccgtcc aagatcctgt 
272581 tgctcggcca agatggcttg tcggaaattc 
272641 cgacgcccgg cggaacggtg gaagccaccc 
272701 tgtctcgcta caacgacgac cgcaaccatc 
272761 ctcatctgca cttcggtcac atttcggcgc 
272821 agggttggtc cgcagaccaa acgtccaaac 
272881 tcggcgaaaa cgcggaagcc ttgctcgatc 
272941 attggtcgtt ccaaaatcca gacacgtatg 
273001 agaagactct gaatgaaacg cgtgatgacg 
273061 ttgagaatgc cgagacgcac gacgaaattt 
273121 ccggcgagat gcacaactac atgcgaatgc 
273181 aaacgcccga agaagctctt gagttcatga 
273241 gtcgcgatcc caactcatac tgcggcatcc 
2733 01 ggggtcccaa gcgtcccgtc ttcggcagcg 
273361 aaaagctcaa gctcaccaag tacctccagc 
273421 agagcgaaaa tcaagtggca gaggctttca 
273481 gcctgagtca aattttcacc gctcaaaggg 
273541 agaaacgatc cgcccgtaac cgactgaggt 
273601 caagcgatgg cccctcagca agcacgtggc 
273661 gagattcgtt tgccgacgga gggcctccgt 
273721 ctgacacctt acgagcacgc gaaacgggta 
273781 gtcgctccgc gactggtcgc ggaggcacaa 
273841 gtttcaaccc gaggtttaaa tcgttccgcc 
273 901 ggttgagaca cacgaccaac tgacgtcgct 
273 961 ggtccttggg ttgaaaccca aggctggccg 
274021 aggcacaacc gcgacgctgg ttttcatcgg 
274081 ttcatggcct tgacctcctt tgcgaggtcg 
274141 gccctcagtc ggcgtatcca ccaccatcgc 



gctgaaatat tggcctcgtg ttgggtcgaa 
ccacttcgtc aaccaaccgg ctcaggagct 
tcgcaactgt tcgatcgcca atcgcaattg 
ggtaccgatg gacagttcca aaatctcagc 
tcgcaagaca agaatgctgc ggtgctcatc 
cgaaacgtca tcctggcgaa tcatgttcgc 
ttcgccgatt tcctcgcgct gctggtccag 
cgacaaagcg atgttgaacg cgactcgata 
gaacgtgtcc agtttaacga atgcccgaat 
atggaccgaa ccggtgacgt gaatcatggc 
caagccttca aaggcggacg cgtcacctga 
agtcggcgcg agattcgccg cggtgcgttc 
ccctcaactt gatgatcgtt cgctgggtag 
caaagcatct gggtctttcc tggatgcctg 
cgagactccg ccgttgccac tttcatcgcc 
caaccacaac ggaatcgtta gaactccgaa 
ttgctcgctg ggccacactt gagacgccac 
cgtggcaacc ggcaaactga cagcaaaacg 
acacatacca agaatcttcg cgaaaaccgc 
ccagcccacc acgatcgcgg gcaccgcgat 
tgatgatcca aggggatcca ttttcatgtc 
acgagcgaaa aagcccgccg ccacgagcga 
agtcaccgac cattccgcac cagtcgtcat 
cgccggcatc gattcaccaa cgagttgcca 
catcccgcca atcagcgtca ctgccgcgaa 
atcaaagcgt atcaggccca tcgcccaagc 
caacgtcaac aaatcagcgt caacaatcgg 
accgaaatat ggcaagctcc aacgcgtcaa 
cccgaccaac gtttcgacga tcgggtagcg 
tcgcccgctc aaccacaacc aaccaaagat 
gcgacacct c ggacagaacg aatggccgt t 
gaccacatta agaaagctgc cgacgctcgc 
cacgtcattg gctcgcggca tcaacaaatc 
gatccacagc gcagacaacc acacataagc 
gcaaacggcg agaattcgat acggtgagcg 
gtgtccgcac actctactag ccaagccccg 
tcatgacttc cacgctccct ccccagcgtc 
atgaggggga ttacgtcgtt tattggctga 
cgttgcaaca tgcgattgac cgggcactcg 
ccctgcgagt tcgctaccaa tgggccagcg 
tgcgagacaa cgctgcggag gcggaatcat 
ccaaaccggg tgtcggctcg ccactgcttc 
tcactgatga atacccgtgt ttctttctgc 
tgcccgcccg tttggaattg atcgattcca 
agacttacac ggtcgcacac agttaccggc 
tcgagcagtt gccactcgcc aacccactgg 
gccccgatgg cgaatcgatg gcagaacgcc 
ccaaatggcc tgccgctgat ttagaaaagc 
caatcgatca ttcggttcgt ccgtcccaag 
gtcgctggca agaatttttg aacaagaaac 
ccgatgaaca cgccacgacg ggcctgagcc 
acgaaatggt ttcttcgatt ttggaacacg 
ccaacggcaa gaaccatagc ttttggaacg 
aactgctgac ctggcgagag atcggattca 
actcgctgga atcgttgccg gattgggcgc 
agcgtcccca tctttacacg ctggcggagt 
ggaacgcggc gcagcgtgag ttggtggaaa 
tatggggcaa gaagattttg cattggaccc 
ttcacctgaa caacaagtat gggctggatg 
tttgggtgct cggccgcacc gaccgagcct 
tccgctacat gaccagcgac agcacccgca 
ggttcgccgg atcgaacaag aagtagggac 
gcctgtgatt gcgcggaact caggttggaa 
atcggaccga atgaagtgaa ttccggcaga 
cctcagtcgg cgtatccacc accatcgctg 
aacgagtgag ccaacggctc tggcgtcgct 
cggcgacatg gaaaggaagc cgataaacgg 
tcgcctggcg acgggcggca aacaactgac 
ccgcggagcg gtgaaatctg tcagcctcgg 
catttgagaa gccgcggagt ggcgacaggt 
ccgcgactgg tgagatggtt atgcgctgat 
gatgccgtcg ctccgcgact gaagacgagg 
ctgacgccaa tgccgacaac ctgccggcac 
ttcgttgcca gccccccgta gccgactgag 
tgcaagcgat ggcccctcag caagcacgtg 



274261 gtcggcgaca tggaaaggaa aacaccccaa attggcctcc tcgtttcggt tagattaggt 
274321 tcgctcttcc agggcaatct ctttgaattc tccatcatgt cgcgcatcca cccgagactc 
2743 81 ctgaagcatc atgaaagtat cccttgcggt cttcttggca ctcaccctga ccatctccat 
274441 ccaagcagaa gaacttcctg acgtcgccgt cgcgggggaa gacgcgttgg tgtcgtcgca 
274501 acaaatcttt gatcttgctg acaagcccac caaagagagt catgcgtcca ccattgtcga 
274561 gacaccaacc gggctggtcg cggcttggtt cgcgggcact cgcgagcgtg accctgacgt 
274 621 cggcattcgt gtttctcgcc atgagaacgg ccagtggact gagtcggtcg aggtcgtcag 
274681 cggtgtccag tcatcgacgc tccgctatcc cacctggaat ccggtgctgt ttcagcccag 
274741 cgaaggcccg ttgatgctgt tctacaaagt cggtccgaac ccacgggaat ggtggggcat 
274801 gctcacgact tcccaggacg gtggcaaaac gtggtcctgg ccaaccaaac ttggcgaagc 
274861 ccacacgatc ggacacctcc ttggaccagt caagaacaaa ccggttgagc tcgccgatgg 
274921 aacgatcttg tgcccttcca gcaccgagat cgagtacgcg gatggatcct cccactggcg 
274 981 agttcacttc gaggtcacca aggatctcgg caagacttgg gaggtcatcg ggccaatcca 
275041 gaccggcgaa acgttccatg cgattcagcc aagcatcttg acctaccccg acgaccgtct 
275101 gcagatcctc tgccgcagca aagagaaacg cattgttgaa agctggtcca aagacggcgg 
275161 aaaaacctgg agcacgctca ccgcgaccga cttgcccaat cccaattccg gcaccgatgc 
275221 ggtgacactt caagacggac ggcaagtctt gatctacaac cactccgaag gccttgtgag 
275281 acggaaagac gctcccgacc tgaagcctcg ccggattctc aacttggcga tttcttccga 
275341 cggcaaaact tggaaacctg tgctgacgct ggaacatgag acgggccccc atccgaaaga 
275401 ttcagatcgc cgccgtcact tcggtgagta cagctacccg gccatcatcc aaacctccga 
275461 cgggatgctc aacatggtct acacgtacaa ccgcgaaggc gtcaaacacg cggtcgtcga 
275521 cccaagcaag ctctgagcac acccgtcttt cgatctcgcg aatgattcac caagtgacgt 
275581 gcgccagatg aggaagaact gctgcgcagc gacaatctcg caacaatcgc ctgccaacca 
275641 accgtagccg actgaggccc tcagtcggca aaaccactat caatcgctgc aagcgattgg 
275701 cccccagcaa gcacgtgata gcaagtgagc agacgtttgg ggcgtcgcaa atgctcgttt 
275761 gccgacggag ggcctccgtc ggctacttga agcctccgtc ggttacttgg tcgctcacga 
275821 gatctcgctc tcgtggcctg cggcggccaa ggttttgaat taagttttgg ctcttcctcc 
275881 gaagtaccct cacgctttcc ttcccagttc gccaccgtgc ttctcccgaa gtactctcgc 
275941 atcgccttga tcgcgttcgc attgttcgca agcgtcagcg gcgaggccgt acgcacgcag 
276001 tcaactgcac cgtcaggtca aacgggactg ccgtcgtcgc tgcaattcgc atcggagggc 
276061 acacgtttta cgacgcgttg ctgaacaacg ctgcttggcg attccattga ccaagatcgc 
276121' agcccctcac cagttccaac gaaggaggtg ctcgcgatct ttgatcaaca aataattcgc 
276181 ccccaacacg ggagccgccc aggtcggtga atccgacaca gtgtaagtcg ctttggcttg 
276241 gtattcttcg gcggccgcgt cgatgacacg cagttcaccg ctgtccgtca gcgtgataat 
2 763 01 gtggccggga atcgccaaga acgtggcgtg gtcgcccaag cggggacggc cttgccaagg 
276361 gacgctgcca tcgtcagcgt tgatgcagaa caatcgccca ctgtcgtagt gagacaaacc 
276421 gaacagacga ccgtcattca tcaccgcggt cgacatgttc attgceaagt cttccagcac 
276481 ccaggcttct tctacgatcc agccatcgtc tgtcttcgtt ggacgcaagg aacggattcc 
276541 gcggttctcg gcaccgatga tgatcgtgtc gcgatgctgg accggcgtcg gggaattctg 
276601 gtccgggtca acatgcttca acgcatacga ccacagccat tttccggtct caacatcgac 
276661 accggcaacg tcgtcacgat tccaatcgat gacctggcga accccgtgca actccaccac 
276721 cacgggtgaa gagtacgaga tcggettgtc gccctgcgac caaatcgttt cgcccgaggc 
276781 aacatcgaag gcgtacagag caccggtttc atcgttgccc aaatggatca gcagacgatt 
276841 cccatccacg. acgggtgagc aactggcgcc ccactttgga tgattgatcg aaaatcgctt 
276901 tcgctcatca cgctgccaga gttgatcgcc cgattccgcg tcccaggccg tcacgacgcc 
276961 gttgatgctg gccgtgaaga ggcgtccgtc cgccatcacc ggattggctt tgggccctgc 
277021 cccgtgccgc tccccgccgc caccgatttc aaagggcacc ggccaggatc gcttccaaat 
277081 ctcgcctccc gacgcgaggt caaagcatgt gatcacttcg tcctcgcctt gacgagcatg 
277141 ttggtacaca cggtcgccgc tgaccaaagg cgtcccatag ccttcgccaa cattgatctt 
277201 ccaagccaat gacagttgct cgggccacga ctcaggtggc gaaaaatcgg ccacccaacc 
277261 gttgcgatgc gggcccaacc aaccggtcca ggcatccccc gcagtcgccg aaggaatgga 
277321 gttcgcaaca ggcaagagcg tgcaggccag ccacacgaaa acgcatcggg gaccgttgat 
277381 catggaatca ccaaaattcg aaagcgagcc aggacaggct gatgctctct cgagggtcag 
2 77441 aaacaaattg tcatcacatt gaaagggatg aacattcaaa cgaccaagcc gccctgtcta 
277501 caagccaccc tgtctttggg agccaaactc tcgacctgga tgatcctgaa acttcaggac 
277561 gtccaagttc tgtccagggc gtttgaaaac gcgggttggg tggagagtga acgaccgggg 
277621 cgtgatggat tgaaaaagtg aaattgacaa ttgcaaaatg caaaattgat tcgctgggca 
277681 gtcgcgtttt cggggaagga cggacgtccg attccggcaa aagtcattgg ccccggtgga 
277741 cttagaagtc atgccgtttt tgggtgccgt attgccaagc gtctctcaag gtgcaatcca 
277801 ggaataaaat tggcacccgc aatcagagcg tcaatggcga cggcttgtcg actcaacagt 
277861 ctctcggggc ttccgccccg agctaagcac gacggcccca tccggggcga aacccagacg 
277921 ctagccgcca tttatttttc gaacgtccga agggaggatg aagctaacct gcacgacctc 
277981 tttcaacggg caaggtttac cagaggttgc gagacccgca tttcaaaatg ccgtccgaac 
278041 tcctgaaatg gtttcttgcg gatgctctcg ccagacgccc ggcaacgccg gcgaatcact 
278101 cgcgaacggg cattcggacg acttggccgc tgatggcggg caaagcttcg atgtcggtgg 
278161 tcgctcgtga cgcggcacct tcgctggctt cgtgagtcat gatcaccaac ggcacggtgt 
278221 tctccgccgc gttggcttcg tctttcgact tcggttcgtg ctggatgacc gacgcgatgg 
278281 aaatcgaatg gcttgagagc acacccgcga tcgctgccaa tgttcctggg tcgttggcca 
278341 ctcgcaatcg caagtagtac cgacctcgca gccgatcgac atctcgctgg gccgtccgtg 
278401 gaggacgctc cgccgaaaag tactccagcg tttcaaacgt gattggcgtg cgtcccacgg 
2784 61 cggtgtcgat gacatcggcg acgacggcag acgcagttgg catctggccc gcacccaacc 
278521 cgtgatagaa aaccggtccg accgcatcgc ccaccacacg aatggcgttg aaggcgtcac 
278581 ggacttcggc catgggcgtg ccgatggcga ccaaggtcgg tgcaacggac aattccagtt 
278641 cgccatcgac caagcgagcg gtggcgagca atttgattcg atatccaagc tggcgagcgt 
278701 aaaccagatc ggctggatcc agcccatcga tgccttggcg aggaatgtct cgccaatcca 



278821 cgacatccat cgcaggatcg gcttcggcgt accccaattc ctgagcccgt ttgacggtct 
278881 cttcgtagcc ggcgcccttt tcgtccattt gcgagacgat gaaattgctg gttccgttga 
2 78 941 ggatgccctc caacgattca atttgattgg ccgacaaaca ctggctgatg ttggcaatga 
2 79001 tcgggatgcc accggccacg gccgcttcga acgcgatgct gcgtccgagt tcgcgggctc 
279061 gggtgaacaa ttcggcaccg tgttcggcca acaatgcttt gttcgcggtg acaatgtctt 
279121 tgccagcttc catcgcccgc agcatgactt cgcgagccgg tgacaacccg cccatcaatt 
279181 ggatgacaac gtcgatttcg gggtcattca ggacgtcatc gatcgaatcg gtcaggactc 
279241 cttcgggcag atcaattccg cgaggtttgc tgagatcgcg gacgacagcc ttggacaacc 
279301 aaatggtttt gcccgcgtga cgagcggttc gatcgccgtg atcgatcaac aaacgagcca 
279361 caccggcacc gacggtcccc attccgatga tggcaacgtt ggtccggtcc gatgattttt 
279421 gcttggtgga tgtagcgttc atgtggggga tggtaatcag agcccacgat cattgaacac 
2 79481 gggggtcatc cagattcctc gacgaagtcc acggtaccgg acggagattc cggtcggtcg 

27 9541 ggttcgttcg gcccccaaaa aagcatccgc catcagaatg aggcggattt gctggcaatc 
279601 aaaagaaaac ggcggtgcaa cgtgcccaaa cgctgccccg ccgttctcat tttcaatcgc 
279661 aagaaacatg cagccgggtc tcggtgtcgc aatcccacac cacgacacgt gcccccgtct 
279721 gtccgtttcg gccatagcgt cgggtcctcc acttggttga ccacgccggc tagtcgtaat 
279781 gttcggcata accgagcccc gagcgaatgc aaattcctct agaaagacga cttcatcggt 
279841 ccaccgtggt cgaccttgcc ggttaaggca attcggctcg ccacattcca gatttattca 
279901 ctttcgatct tgtgaccgat ccgattgctt tgagaaactt gagcaagatt cgaaactttc 
279961 cgtctcgaat attcccgacc aatcgcttcc gccgctgcaa ggatgctccg atggcaatcg 
280021 gccccctgac cgacaccacc acgttgtcga tcgatcgcct gtacgacctc tatcacgcca 
280081 ttgctgaacg ggaccatgcg tttcgtctgc aagctcaata cggcagcacg ccgcctccca 
280141 aggggcattg cgaattccgt ccgttgggac gccaaacttt cgtgcaacgc gtgctgcact 
280201 acgactcgct tcctgccgag gtgggcgccg cgttccgcgc ccggttgtca cgccaagcgg 
280261 aagcttacgg cgtggatcct ttgagcaaga cactcaacaa aactaacgct gcctgatgag 
280321 cgacagaggc caaacgcttc ggcaaatttt gcgtgccgcg cactgtcgca gcacgcatca 
280381 tcacttcgcc ttggacgcaa ctgaattggt gcaaaccgat tcgggacagc gtttggtttc 

28 0441 gcaattgcta cggcaccaca ctcgctattt ggcgggtgca aaagatcccg acgtgcgttt 
280501 tcgcgatttc cagaatcacg tggtgcatgt cgacgacggc tactggggcg gcgcaccacg 
2 8 0561 cgtcgctcac cagtggtacg accgcttgca gcgttacctg caccaatcac gttttgacga 
28 0621 tgcggcacat gctgccggcg tactcagcca ttacgtgacc gatcccatcc agccgctgca 
28 0681 cacggcccag acgccgatgg aaaaggtttt gcaccgaccg attgaatggt gcatcactca 
28 0741 aaattacttg gatctgctcg ctcagtggcg acaggatccc actcgaatcg ttttccaact 
28 0801 ttccgaccaa ccgggatggc tcggcgaagc gatcttgcaa tccgctcgtt tggcgcacag 
280861 tcactaccaa acactgttgg acaactttga tttggcggcg tcggaacatg atcccaaaca 
280921 aggtctcaac gaagtttccc gtcgcatcgt ttcacaattg gtcggactgt ctgtgaccgg 
280981 ctgggctcgc atcctcgagc gtgctgctgc ggacgctgag cagcgaagcg gccgccccat 
281041 tcccacggcc ggcctctcgc taccgatggt attttccgcg atccgcgtgc ccgatcgcat 
281101 gtggctgaaa cggattacct acaaaatcga acgagaaaag gtgcagcggt tggtcgatga 
281161 atttcgacga accggcgatg tggagcaaaa cttgcccagc gaagttcgcg tcttgcaacg 
281221 cgtgcgaacc atttaccatc gcgaaaaaga ataccgtcag gcaaaggcca aactggccgc 
281281 cgcccgcgaa gcgttggtgg cgtccgaagc agccgaacgc caatccgaat cgcacgatgc 
281341 aaacgtgctg ccgtttgtca gcaccgaaga caacgttcga ttgcatgtgg ctgacccgtt 
2814 01 ggtcgacgca ccatcaatcg gcaaaaagac cgctgctcga ttcgcaggca tcggcatcca 
281461 tactgtgggt gactttttga aagccgacga gaaaaccatg acgcggcgat tggccacccg 
281521 ctggatcacg gtcgacacga ttcatgcttg gcgttgccaa acgatgttga tgtgtcaact 
281581 gccatcgatg ctggcccgcg aagttcaact gttggtcggg gcgggttaca cgacgaccga 
281641 tgcgctcgcg aaaaccgatg tgacgacgtt gcaatccgcg atcgagactt attcggcgac 
281701 ctactcgggt cgtcgatact tacgcggggc gaacccaccg acattggacc gattggaaat 
281761 catcatcggc gacgctgctc acgcaatgat caaactcaaa cgcgataacg cgatgcacca 
281821 cagcgactcg gaatcaaccg aatcgtccag tgcagctcaa tcccaacttc aatcagccgg 
281881 ccagccgcag cagtcgcctc agcgacgtgc tgcctgagcg acccgaccga tacgccaact 
281941 cattcggcgg gtgtgacgtc cacttcgatc atgccatcat cggtcatcgg gaccggctga 
282001 gcgatcgctg acgtcgccgg tgctggttgc acaacgtcgt aggacgcggc tcccatcgtt 
282061 gactgaaccg ccgcaggatc aaactgcccc gatgtcaacg gacgcggtgc ggaaaccttc 
282121 tcgggatgac gacgcggctc aaggaatcgt cctagctcat acgcatcgcg ttcggctcct 
282181 gggcaccaag ctccctcggt caggtcgact tggttgtacg ccagcaacga tcccttttct 
282241 cgatggaaat cgcgaatcgc gaggttgtag ttgctgatcg attgatagaa ctccgattcg 
282301 ctggtgacca cttgccgttg tgcttgcaac aggaagttga tgttgtcggt cccgtcacga 
282361 tagcggcgac gcaaaacctc aacctgacgc aaatcggcca ggtaacggtt gtagttggtt 
282421 tcaaccaatt cgaatgtcaa gtcgatggca cgtgccgcgt cggacaaatc gtggctgatc 
282481 cgcaattcgg tttccgccaa aacggctcgt tcacgttgca agttcagttt ggcgtgagcg 
282541 actgcggaac tcgcggcacg caatccaatt ggtgccaaaa actcgacgcc agcggtccat 
282601 tcttgatagt ttccgccggt gatcgaatcg accaagttgt cgttgagacc ttgatcgccg 
282661 ggtccgatca aattgtcacc caaacctcgc caacgatact ggccaacaaa gtccacctgc 
282721 ggacgacgat tcaaacgtga cgccagcaat tccatttcac ggcgtttgac cgtgaacttt 
282781 tggcgacgca cttcgacgcg gcggtcgaga gcttgcccca aagcactttg ccaatcgaac 
282841 accactttcg cgttgagtgg atcgctggtc ggacgcaaca gtgatccatc cgtcgctggc 
282901 aaaccgatca ggtatcgcaa cgcttgttcg cgagcgtaca aaccggtttc gccggccaga 
282961 gccgatttga cttgagcttc gaattggtag tactgcgaac gtgcttgagc ttcttcgtcg 
283 021 cgccggcctg taccgacttc caatcgcacc tgttggtact ggaacgtttg caacgccgac 
283081 tcacgacctt tcgcattcgc ttccagcaaa cgataagagg tcgtcaggtc ccagtaggct 
283141 tgctcgacat cggaaaccaa ctggatcaca gcggcttcga aatcagccaa ggcaacgtct 
283201 tcgttgactc gagcaatcag aacaccgttg tactggccga tggctccgct ggccccagcc 
283261 gatccggcga tccggttgta tgtcgttccg gcacctcgca tcaaaggctg acgatactcg 



283381 tatccgacat tgctacgcaa agcgaactgg gcaccggtcg cggtgcgttt ggacagttcg 
283441 ttggcgaacg ttgcttgggt ggcttgaaat gtggtcggtg tgaaagccgc accaagccca 
283501 ccggttgcga tgttctgagg ttgatcgagc ttgttccaga acaacgattg gctgtactga 
283561 gcgtcgaagt tggccagagc tgcttcgaca ccccccaacg gatccgcgtg agccaagcct 
283621 ggcgtgaaga ccgtcgtcgc ggattgagga gcgctcacca ccgcgccacc aatcgttcgc 
283681 agcacgggac tggatcgcaa tgcttgctgg accgcttgct ccagcgtcaa ctcgacgctg 
283741 ggcagattgg ccgggtcttc cagcgtattg ggctgaaccg cagacacggc agccgagatg 
283801 actggcgtgg cacattggtt gacctcgggg tactcgatcc gcattgccga atggtcatga 
283861 tacgacgttt tggtgtcgtg aggaccgtcc gggattttct gcgcccaatg gcatccgctg 
283921 gtcagcgtgg atgccgccac cgcgacaccc agccaagagg ccaagcgacg acgacgggtt 
283981 tgtttgcgcg aagagagaga cgtttgccga tccatggcaa tctcgattcc atatcaggat 
284041 ggttccctct gctcacacca acatccttgt cggtgcccag tcgcagaggc ggatagtttc 
284101 gttgtcaaat cactcagcac taaggcagag cgatgattcc gcccgcaatt gcggcggctt 
284161 ctacaacaac agcacgccgt tcaagcggcg gacgctgcat cacgcaagga gccgatgagc 
284221 cggcccggta gtaacccttg ctataaacga tgtcgcacgg ttcctgaatc gttgccaagt 
284281 aaggcaacat tggaatcgtt gtgaagaacc gagcaccagc ggcgatcggt tgcatgcatc 
284341 cccatcgttc gtgaccgtgt cgttccaaca ttcggtcttc gaagtacaac ggatgcgaat 
2844 01 aggtgtttgg tgcagcccaa ggcagtaccg cccccgctgc caccattccg cgagcgaatg 
284461 gttcttccgg caaagcgatc gcagcaacgt cttcggtgga atcaaaatca tccggcagac 
284521 gaccgttgcc aatcgtcgat gtatccaaag aaaccgctcg gagcgaaggc aacacaatgc 
284 581 ggccaccggt cacattgggc agtcgttcgc tgatgaacgg ctgcgagaac aacccgtctt 
284641 catccttttc ttcttccttg gcgtccgcga taatttcatc ccgcagcgac attcggtctt 
284701 tgccgttgac ggcttccgca cgatcgcttt ccgcaccggt gcgtagcaaa tctcgcaaac 
284 761 gctgacgacg agcctcatcg gcattcgctg gtgcagtatc gcttggcacg tcgctcgctg 
284821 cggctgaatc ggcgaccaca tcgtcgtcaa tcatcgcaaa accggcagcg gcgaagaagc 
284881 cactgggtag cgacgccacg tgtgcatctg gagtggtcaa cgccatcggt tcgtcagcgg 
2 84 941 tgccaacagc cacggtcgag gccaagcatc ctgacaatac ccacgcgaac gttcgccgac 
285001 ttggccggat cccccggacc caacgcacca tttcgtaaaa tcgtttgacc gagaactctg 
285061 cgaacattct cgatgtgtta cgacgacgga cggggccatc accgaaccac caatcccatg 
285121 tagattccag cattgcgatc gtcttccttg acgggtcatg caaaaagtga aagctcgttc 
285181 aaacaacgac cttcccgtgc atccttcggg tcgtgataaa caatcactcc aaaacagttg 
285241 atccaccttg cgcgtttgtt acgccggatc gacacacatc cagcacaacc cgcacaaacg 
285301 gacccgaacg attgccagag acaattaggg acctcgacga cccccggctc tcgccctacc 
285361 gaaaccttcg acaccgtgaa actgacgacg gcttcatcgc cgaaggcagt ttggtggttg 
285421 gacgactgct gcaaagtgac ctcggcgttc aatccgtgct gatccacacc ggtcgcgaat 
285481 cgaagtattt gccactgatt cctgacagcg ttcccgtgta cttgatcgat cgagaactcg 
285541 gcaaagagtt ggccggttac gatttccatc gcggtgtgct ggcacatggc ttcagtcccg 
285601 ccatcgaacc actggagaaa tttcgcgact caccccagtc ctcgctcgca ttggcactgg 
285661 tggggctgag tgatcccgaa aacgtgggca gtctgcttcg ctccgccgcg gcattgggtg 
285721 ttcgcgacat ccttctgggt ccgcacacga tttcacccat gacccgtcgc gtcattcgag 
285781 tcagcatggc atcggtgttt cgtcatcgct tttatcgttt cgacaatcca gtgggacaac 
285841 tgcgagattt ggcaaacgcg ggcatcgcat cgatcgcgac gcttttggat gataccgcga 
285901 ttccattgat cgatctttct aacgctatcc agtccaaacg acgaattctg atggttggca 
285961 acgaggccaa tggcttgccc cgcgagattg ccgaagccgc cacccaccgc agcacattgc 
286021 caatgcgaaa cgacaccgac agtctcaatg tgggggtcgc atcagccatc ttcctctacg 
286081 agctgactcg attgcaggac gctgatcatg gaaacggctg atgtgcatga cttgctgtcg 
286141 gcgctgattc atcatgcaaa acagcggccc gacgaaattg ccttgatcga ttcccaaacg 
2862 01 ggcgccgccg ctttcacttg gagcgaactg acttgccgcg tcgatgcgac cgcgatcatg 
286261 ttgcggtcga aattctcaga agcgaccacg acgaaacgct tgacctatcg atgcaccaac 
286321 cgccccgagg atgtggtgct gtctctggcc tgcgttgctg ctggattcac agagatcccg 
286381 atcgacgcgt tcttgcccgc cccacaacaa gacgtcttga tcaaacgatc caaagcgttg 
286441 cactgggctc acgaaatcca tgcgaattcg attcactcga gcaaggtggc ggatgcgatc 
286501 tcgaacttag aaacggcagc tcgcgacgtt gacattcact cgccatcgtt ggttctttgg 
286561 acgagcggca cgacctccga accccgaggc gtgatgctgt cccagcacaa tctgaccacc 
286621 aatgcgaagg ccaaactgct ggcggtccct caacacacct cggacctgcg actgtcgttg 
286681 ctgtccatcg ctcacgctta cgcgcgaacc agcgacatgg gaacgtggtt gctctcgggg 
286741 tgtcgctgga gtctcggccg aggccggtcc acgttgcgtt cgctccccga aaacctctgc 
286801 ccgactttga tcaacgctgt gccggtattg atcaatgaca tcttgaatcg aattgaatcc 
286861 gggcaatcca gtttgcaatc gctgcgtttg ctcggatgcg gcggcgtcgc aatgtctctc 
286921 gggcaattcg agcgatgcaa gcaaaacggt atcggtgtca ttcagggcta cggctgcacc 
286981 gaaacgtcac cggtgatctg cagtgcatca cctgacaacg cgactcccaa tcgagtcggt 
287041 cccttggtcg caggctggga atccaaggtc gaacacgggc gtctgttcgt ccgaggccct 
287101 ggcgtgatga tcggctacct cgacgaggaa gccgcgacgc agcagaaggt ctcgccgggc 
287161 ggttggcttg atacaggcga cttggttgag attcacgacg acggtcaatt tcaaatcctc 
287221 ggtcgcgccg acgatgtgat cgtgatggac aacggcttca aagtcttccc cgcca.cca.tc 
287281 gaacggcaac tactgcagct agagggaatc gaacaagccg tcctgctcca tcaccaaggc 
287341 cagctttgtc tgctgctctc acacagccaa ccccaagtta ccgccgacgc tccaaccggc 
287401 cgcgatccaa ttgaatcatg cttggccgat agccttccgc cgggaacaag cgtgaagcga 
287461 ctcaaactgt ccgaaccatt gtccatcaag tcaggagaac tcaccgccaa gggaacccct 
287521 cgccgaccaa tcatccgtca gcgccgactc tctagccccc cggatagtcg cccaaaccac 
287581 tctcgcgatc cataacggcc tgttatccgg aaacaattcg tagcaagacc gttcgccatc 
287641 tgtcgacggg aagcatctac cgacattggt cctcaaaccg aagcctgaag aagctcgttc 
287701 aaattcggga tgcacggaca acaagcaagt gctgctggtt tcctaggcac gttcgcgact 
287761 ctcgatacgg catcctccct gaacaaacga actcggctcg atcgtttgta tcaagcctca 
287821 aaaaaaccgc tctcgtctgc gggggaccgc aactcccata gacccagcaa cgggactctg 



287941 atttggcttt cgagttgcga gagggatgaa atgagaatca ggccgtcgag attcgctcga 
288001 tcttgaagca tttgcacgaa cctgtggacg ccagattcca aagactgttc atagcaaatt 
288061 cttgactcat tcaatgaaac caactgaagg actctcatgc ccaactactg tcgcgatgtt 
288121 aacaagaatc ggaaagcatt cacgcttgtt gaactcctcg tggtaatcgc aataattggc 
288181 gtattggtgg gattattgct gccagccgtg caatccgctc gcgaagcagc acgacgcatg 
288241 agttgctcca acaacatgaa gcagcttggt cttgcaatgc acaactatca cgacgccttt 
2883 01 cgaagctttc cctatggtca cctcaatact ggactttccg tgacccacaa acgggattcc 
28 83 61 tggtaccaaa acatcctccc ttttgttgaa caacaggcat acgccgaaaa atatcgaaac 
288421 gctgccacga tctacagtgt ctacgaagcg gaatggattc atcgcatgcc cctggaactc 
288481 gcaggggtcg ccgtccccac gttcatgtgt ccgtccgaac cctccggccc agctctcggt 
288541 ggcggcgggt ccgacaatgg tttccaaggt agttacggtg tttccgctgg tggaggcact 
288601 ttgcctcaag atcttctagg agggatcact ccggatgaga tcaacatcat taaaacggat 
288661 ccgggaggga tgttcggtga gacgaggacc tataagttca gagattgtct cgatggaacg 
288721 tctaacaccc tactcgccag cgagacgatc attcgcggcg cgagcggcgg aagttgggga 
288781 ggcatgggag gctattgggg tggttctcct cacggatcct tcggtttctc ttcagcagag 
288841 cccccaaaca catccgttcc cgatcgtgtg tattcctgca aaagcacaaa tttcccgaaa 
288901 tcaccttgcg aaaacgggaa cgctgacggt ctcacgggcc gatggaactt cgcccgcagc 
288961 catcacaccg gtggtgtatt aacggctttc tgtgatggat ccgttcggtt tgttaccgat 
289021 tcgatcgaac gtcagatctg gagagactta ggcaaccgtg ccgacggtca cgtcctgggt 
289081 gagttttagg cgggcataag ggagtcttca gatatgtgag ccgcctagcg ttcgccaagg 
289141 tctctacacg aaaacggggc taacgcccaa acggctcgca ttcttgcccc caatcattcc 
289201 agccaaacgg tttgagatac gtctacctga cgaacaggag cttcgttctc aaatcgtgaa 
289261 accttcccac gccgtacaca tcaaattttg ttgagattca cgtcaatgta caagaatgca 
289321 gtttgttcta tcgggctctt tatcacgccg ttgatactaa ctggttgccc agcgccgagc 
289381 ggtctcgagt tgtatccggt gactggtact gtgacattcg agggcaagcc gatcgatacc 
289441 ggccgaattc aattccgcac tgcctcggaa gagaggcgaa gtttcagtgc tgcaatcgag 
289501 aatggcaatt acgagatgga aaccctcacc ggtccgatga cagttgaagt ccgggcctcc 
289561 cggttgattg aaggcaagtt tgacaagtcc aaccccgacg aactcacccc agcgggcgaa 
289621 atgtacattc ctcaaaaata caattctcgc actgaactaa ccgcggacgt gccagcgggc 
289681 ggcgacacaa tcgacttcaa cttgcttggt tcctaaacag cccgctaatt cagttgtgct 
289741 ctgccaatca cggcagcaga accataccca acaaatgcaa gcaagttccg ccttgagatc 
289801 acttgcttgc gtatgttgaa attacgttgt ttgtcattcg ggggtgacga cagcgtaaat 
289861 ccagtaggga ttgtgttact gagcaacggc ccaagccagg cccgggaata tcaagccgga 
289921 tcgtaagctt cgctgcctag tcccctgccc cgtggcatgt acgcgcacat cacgatgccg 
289981 aggaagtaca ggaataacag cggaactgcc agagccacca tgctagtcac gtcggccggt 
290041 gtgacaatca tcgatatgac gaagattacc aacaccgcga ctcgccaact gttgatgtag 
290101 tcctgcgttt gaatcaaatc aatgcgttgc aggaacagca tcaccaatgg caattggaag 
290161 gcgacgccga atcccaatgg caacatcaac acaaagttga cgtaataggt cagccgaggc 
290221 tccaccgcga cgtccatcga tccattgaat tgcaacaaga acgtcagcac gtagtgcagc 
290281 accaacccaa acgcgagcac gacgcccgag acaaacagca ccacgctgaa cggtaggtaa 
2 90341 acgtagacat aacgtcgttc gtgactatgc aaaccagcgg ccacaaacga ccacaggtga 
2 90401 tagaagatca tgggggaggc aagcaccgcg ccaacaatca aacccgcttt cacccaaatc 
2 90461 atgaacggct cttcgatctt caatgaactc agcccgcgtt cactgcgacg gagctgaatc 
290521 atgggcacca attcatcggg gttggggatc gtccccatcg atttcatcaa ttccgcggtg 
290581 gtgatcctct ctggaatgtc cagcgcagcc gactcgcctg cttcggcttc gacatcggga 
290641 ctgttctttt catccggttc agattcgacg gccgtgccag tggcctccgg cggttccacc 
290701 atgtctccgg tcgccactgg gacgctcccc gatggcagcg tgtacaccac gtccgcgacc 
290761 agagaattgg acgtcaagaa ttcgtagaac cgactgacct gcggatcttc cttgcggtcg 
290821 ggtagcccca tctctttcag gtcgcgatcg gcgttgtatt cgatgatcgc ctgtttgagc 
290881 ggctcctgga tgaatcgcac cactcggttg gcgaacatca atccaaccgc caatccgatc 
290941 aacaaccaaa tgattgcttt gaccagagag ccacggagtt cttcgagatg ctccccgaac 
291001 gtcatcgtcg aattgtcgaa caaatcgtct ttgggtcgcg tcagtgcttc cacgagtttc 
2 91061 aatccaccag gggagggagg gtaccattgt gctgaactac tctgcaccga actactcgat 
291121 gccaactact cggcgtcgag ccattctaac tgaccattcc aatctcgcac ggtgccattc 
291181 atgaccgatc caacttcctc ttccaatccc gcgaccgccg tttccgtcga ttcggtttcg 
291241 cctcagacac tcgtgggaaa acgagttctg tcgtttgtgg gcgaaattta cgaggacttg 
2913 01 gagctttggt accccaaact acgcctgatc gaggcgggag cagaattttt tgtcgcgggc 
2913 61 ccaaaagcgg gcgaaaaata cgacggcaaa ctgggctacc cctgtgtaag cgaccttgcc 
291421 atcgacgctt gtgaggctga ttcctttgac gggttgctgg ttcccggcgg gttcatgccg 
2 914 81 gacaaactga ggcgagaccc gaaggtattg caactcgtcc gtgactttga cgcggctaaa 
291541 aagccgatcg ccgcgatttg tcacggcggc tggattccca tctcagcagg cgtctaccgc 
291601 ggcgttcgcg tgaccggatc cccgggcatc aaggacgacc tcgtgaacgc gggagcaatc 
291661 ttcgaagacg cttcggtcgt ggtcgacggg caccacgtca gcagtcgccg ccctgacgac 
291721 ctgcccgatt tctgccgcca cttcatcgcc ctgctcgcgt aatcccaacc tgtagctcgg 
291781 tgctccccca ctgagacgag catggaccag ccgtcttcaa ccgtttgcgt gcgagggggg 
291841 acttgcctca aacggtcgca cgaacgacgc gagcctgtga gacgatggat cgctgcttgg 
291901 cggctgagcg cggcgagacg gaacaattag ggacaatggt tctactctgg tgcaatcacc 
291961 gaaccccaac atctcacggc ggggaccgcc gtgctacagg tttttggtgg ctgtgacatt 
292021 ttgtgaaact gaagcgtcgt gattccttgg cggagttgac ttgacatcga ttgtcgcgga 
292081 ttcgtagggt cggctcgatg gatgattcca aggttctttc cgatttggtt ggacggcgtt 
292141 cccgcgtgta ttttcgttcc cacgaacgaa tgactctctc ggcaccgatg cggtgttggt 
292201 gggcaacctg cgcgctggca attgtgatgc ttggttcagg atgctcgctg tttcgaaacg 
292261 gtgtttggtc cgatggatcg tccggcgaat tgaattcggg gtcgaatcac acacaagttt 
292321 caaagctcac ccacaagagc cgaaagacga tttcgttgca ggcggatttt cgtcacgtcg 
292381 gcacccaaga cctggacgaa tcgctgtggc agcacgttga tgaaaccgct tttccgccaa 



292501 ccttgcccga aacggcagaa gagaatgttg 
292561 tttcaaccgc cggggtgttg ggacgcagtc 
292621 catcgcagcg acacgaactt ccgatctcca 
292681 ctcaagagaa cggcaagctc accggccgct 
292741 tgacaccaac gttgggcccc ggacctgggc 
292801 agcatggttc ggtccagcaa cgtttcatca 
2 92861 gacgctccac ttgggaactg ccggaaatga 
2 92921 cgttgctgat cgctccggtg catcaaccag 
292981 ggcaaatgct tcgcgacgcc gatcacatgc 
293041 gaatcaacga tttgccgtct tcctgatctc 
293101 cgtgcaaacg aggcggctag ctcatgtctt 
2 93161 atgcccccgc gaaggtttgc caccatttgc 
293221 catctgtacc ggtggaactt cgttcatgaa 
293281 ccgatcccgc gccgaccgtc aacgcaccac 
293341 tggacgaagt gaccgtgcgg caatccaccc 
293401 tgcgtccgtt tgtttcgcag catctgctgc 
2 934 61 tgacccgcca cgggttcgtc gcgatggtcg 
293521 aggtctacag tcccaagctg gccgaactgc 
293581 gtttgggact gggacgcaaa ttggtcgggc 
293641 tgatggaaat cctcgcgatc agctcctccg 
2 93701 actcgttgcc cgaccagaag aaagccttgt 
293761 accactgaaa tgcaaccgct cgaatttttc 
293821 ggttacgaag aaccgatcca aaaactgatc 
293881 gtctcgatcg acgtgcacgg aaacctgacc 

293 941 ctgatgttgg ctggtcactg tgaccaaatc 
294001 ggattcttgt acgcccaaac gatcgggggt 
294061 atgacggtgt ggaccgacga cggtcccgtc 
294121 ttgttgtctc aacaggaacg cggcgaggtc 
294181 ggagccaaag atggtgacga agccaaatca 
294241 aaccttgcct accgagaact gctgggcgac 
294301 acgggcatgt ggacggtcat tgaaaccgcc 
294361 cagtgcgaac tgcacagcgt cgcgactgtc 
294421 accgctgccg gtcgcatcaa cccagatgtg 
2944 81 gactgcccga cgatcgacaa acaacaacaa 
2 94 541 atcttccgcg gccccaacat caacgccaaa 
294601 gacaacgaca tcgcgtatca acctgcggcg 
294661 gtgttgcagg tttctggttc aggcgttgcc 
294721 atgcactcag ccgtcgaaac gatcagcctc 
294781 acgctgttcg ctcaatcgct gacaccagaa 
294841 gctgaggaag ctgtgacgat tcgggttggt 

294 901 cggagaccgc cgtgctacag gttgacgatc 
294961 cgagtcgagt gtcaaacgct tcaacttttg 
295021 atccaagaag catctcggct ctctcggaca 
295081 gctcagaaag ttgttcacaa acaagcctgt 
295141 gttcaacgca ccgattctgg cttttgaaga 
295201 aggtatcaaa ccgcgacttc aagctcacca 
295261 ctcgagccag aacgcgtttc tccagaacgt 
295321 gcaactcggt gtctttgcca aacacgtcgt 
295381 ggaacccatc gagcaagaaa ttccggttgc 
295441 cccgaagcaa cttcaccacg atcgtactgg 
295501 caatggccgc cgcgtgcggt ccattggctt 
2 95561 acaaatggac ccagcccagt tgggattcgg 
295621 cgggaccgcc aagcacaaac accacgttgg 
295681 gatgaatcac aaccttgggt gaaggagcca 
295741 ccgtgggagg aagctcgtct tcgcaggtga 
295801 aaatcctcca tgagcgatac gtattggaat 
295861 catccgcacg atcccaagcc atcttgccgt 
295921 attccgactc cgcaggcggc gcagtagagg 
295981 ccgcaccgat cacctcgatg tagaaggtat 
296041 ggaggagaac cttctacatt cgttgggcga 
296101 gctctcagcc cagcttttcg gtcatcgttt 
296161 tgctggcgtc tttgccgccg gcttgagcca 
296221 ccaccttggc cgccgcaccg acccagtcac 
296281 ggctcagtcc accgaccaac atgactttgt 
296341 ttggcgtgtc gctcttcttg cgaatttgat 
296401 caccgggcgt ttcagcaacg atcagcaacg 
2964 61 agtcatccgc ggaaatctta ccgcccgcgg 
296521 atcgatcggc caacagcgaa tccaagcgac 
296581 ggcgagtcaa accgcgaacc gccgcacgca 
296641 ccgctttcgc agccttggca tcaaagacaa 
296701 acgacaattc tttcttcagc cggcgaactt 
296761 ccaccgaagc atcgcaattc agtttgcctg 
296821 cgcggtgttc tttggcacgc tcgcccgtca 
296881 cgctttcttc gaccacgacc tcaaacgaac 
2 96941 caccgcacag ttcacgactg aacgttccca 



cctcgggcga tgacccaatc aatcgcttgc 
catcgggtat cgaaaccatt ccgttgcgtc 
cggtgctgga tgggtctcat gtcgtactgg 
cattggattc gccccagatg tcgctgtcgt 
aagtcacctt ggagatccgt cccgaaatcc 
gcagcgaagc ggcgacacgt ttggcgaccg 
atctttcgtg ggccgccaat ccgcatctga 
acgaaaacga accaacgttt ggattggcca 
aagacgacca acatgtgatg ttgctgctgc 
cggaaaaatt ctgctgggtg gattcgctgc 
tcacacgagc gagattttcg cgatccacgc 
taaacttctt ttttcgatga ctgctgatcg 
agacgctcac tttggctgac actccctctc 
cggtctcgcc acccggccaa gcgctgtcga 
cgctagacgc ccaggccatt cacgctttga 
tgtcgcggac cgaagccgaa atcatcgagt 
gtccacgctg catcggcttt tccgccatcg 
aatgcttggc cgtgcacccc gaagcccaac 
actgcatcga aagagcccgc acgctcggtg 
aagacttcct caaatcatgc ggcttcgatt 
tctgccaact ccgtcctcgc aacttcgatg 
aagcaatcca ttctgacgcc cagcccttcg 
ggcgaatacc tgaaaccgca cagcgacgaa 
gctcgggtcg gcgaagcggg tggccccaag 
ggcatgctga tttcacacat cgacgatcaa 
tgggacccac agcaactgat cggtcaatcc 
tccgcggtca tcagtcgcaa gccaattcac 
gtcaagctgg aacaaatgtg gctggacatc 
aaagtccgca tcggcgactg cgtgacgttg 
atggtcagcg gtccgggcat ggataacaaa 
cgtcgctgtg cctcgtccga tcaagcgttg 
caagaggaga tcggtctgcg tggagccaag 
gccatcgcgg tggatgtcac gcatgcatca 
ggcgatatca agatcggtgg cggcccggtg 
gtcgcaaagc gactgatgca attggcggac 
ctcggccgcg ccgcgccgaa tgattccaac 
acaggacttg tcgcgattcc caaccgctac 
ggtgacatcg aagcgattgc gaaactgctg 
tgcgatttca tccccggttg atcgagctcg 
ggattgcagt gagagagaga gatctcacgg 
atcccagcct gtagctcggt ggtcccccac 
tgccaccgtt ctaattcggg aacggcctca 
gcggtttgtt tgccagctca caatcggtgt 
agaccgcttg ccgatcctgg ctggtgtcca 
actcgacggt cggcaacgtt tcctttttga 
agttgtcgtc ttgtcgtccg gtgtacttgg 
catacggaca ttcaaagaac agcatcttgg 
accacgcttc caagtttgac tccgaccgtg 
cggtttcgcg agtgatcttc tccatcgcat 
gaactaaatt tcctgccgca atgatctctt 
cgcgttcagc tcgcaataaa tcgccggtcg 
ccaactcgca catcgttccc ttccccgcac 
gttcggggca tggcaatcgt ggatcggagt 
cgcctttctt gtagacatgc caacgtcgat 
tgtcataagc gagatggcca tccagccgac 
acgacagcgc aggacttctc gctgatttcc 
tccaataacc tggactggaa acggtgcccg 
gaacaaagag cccatccact tcgggaagtc 
tgggagtcag ctcgttcgat tcagtcatca 
cgcggtcagt ttcaggcgag caaaaggaat 
cgcgagcttg ctggagtgct tcaggcaact 
tatccggccg gccaccgccg ctgccaccga 
cggctttcaa accacgatca accagatctc 
cgttcatcga cgaagccaac agcactgcgg 
cgatccaacc tcgcatgatg ttggggttgg 
tgtcgccgac cttggttcca tctgcgatca 
tcacctgttt gagttgctcg accagcttcg 
tgagcacatc ggtgatcgca acgttcaaac 
cggcgttgta atcgctgatg tcggtcgttt 
attcagcggg atagtccgca gctttgcctg 
cttcgatcaa cgccaccgtg gccgcagcag 
ccacctcgtt cagcaacgct tgcgtttgct 
atgcttcgat ccggcgagtt cccgtggaaa 
cgacttgttt ggtgttggtc aggtgagttc 
tcgaaaccat gcgacatggg tcgggatact 



297061 agcgaatttc atcgcctttg ccgaccatgc caagcacgtc ttgttcgatc ttgaccaacg 
297121 tctcgtcgtc gatggctttg gggttggtga aatcaaaccg caagcgatcc ggttcaacct 
297181 tgctgccttg ttgttcggcg tgccgtccaa catgagtgtg taatgcgtgg tgcaaaatgt 
297241 gagtcgcact atgggctcgt gccaacgccg tgcggttttc gacatcaacc tttgccgtgc 
297301 atgtttcgcc ttcgttgatc ttcccacgaa tcaatcgacc gtgatgaacg atcatcgacg 
297361 cgtgccgctg agtgtcgatg acctcgaact caaagttgtc gttggaaatc acaccgatgt 
297421 cgccaacttg gccaccggat tcaccgtaga acggcgaatg atccagaacc aatcgcagga 
297481 cagcatcttc ggggcgatca agatgactga gcagttgccc gtcgtcgcct ttgcctttgc 
297541 catcgccggt gatgatgcct ttgacgaccg cggtggcttc cgtttgttcg taaccaacaa 
297601 atggggtttc tcgcagagcc tctttcaacg tttccagtgg tccagtttgg aacagcacac 
297661 gttgtccgcc gtcgctttcg tcagcgtgtt tgtccatcgc ttcccgaaaa ccagaccaat 
297721 cgaaggtgaa gttctgttcg gcggcgaggg tttgaaccaa ctcgggcggc acgccatagg 
297781 tggtcagcaa atcagcggct tctgcaccgg gcaccatcac ggatgcttcg tcgttcatct 
297841 cttcgaacaa acgatggatg cgtttcatcc caccgtcgat cgttgagaag aaagccttct 
297901 cttccgattc aatcgcttcg ctgacacgct gagtcgtttg gccgagctct ggataagcgg 
297961 ccttggaggc gtccgcaact gcttcgacca acttgtacag gaagggctca cgcagattca 
298 021 tttgatagcc gtccagcacc gcgcgacgaa tcaaacggcg gatgaccgac cgagcgtctt 

2 98 081 tggggccggg atacacattt tcatgaaccg cgaaaacgct ggctcgcgcg tggtcggtga 
298141 tccggcgcaa gcgtcggcca ttgtcacttt cgtattcgta cttcacgccg cagacttccg 
298201 acgcggcttc gacgatcggg aacagactgt cgatgtggaa gttcgttggc actccctgca 
298261 ggacgctggc ggttcgttcc agtcccatgc cggtgtcgat gttcttgctg ggcaacggat 
298321 gcaagttgtc cgggggcgtg ccgacgcgat tgaactgggt gaacaccaag ttccaaattt 
2983 81 cgacgtcgct gccgtcttcc agttggtaat agatctcgct gcagggaccg cagactccgt 
298441 ccgggccttc gctgggtgcc gacgcgggcc agaagttttc gtcctcgtcc attcgcgaaa 
298501 tgcgttgggt cggcaatccg atcttgtcgt gccaaatgcc aaacgcttcg tcgtcatctt 
298561 tgtagaccgt gaccgtcaga cgctcgccgg ggatgcccag ccactttttg tcggtcagga 
298621 attcccacgc ccagtggatg gcttcttctt tgaaataatc gccgaacgaa aagttgccca 
298681 gcatttcgaa gaacgtgtgg tggaacgcgg tccggcccac attgtcgatg tcaccggttc 
298741 gcaagcattt ctggcacgtg gtcgcgcgag tgaaatccag tttgaccttt ccaaggaagt 
298801 ggtctttgaa ctggttcatc cccgcgggag tgaacaaaac cgatggatcc caagccggaa 
298861 cgagcacgtc gctgggttgg cggacgcaac ctttggtctc aaagaacgcc aagtattttt 
298 921 cgcgcagttc gtcagttttc atcaggatgt gcgggtggat tcaacaagga aaagatcggg 
298981 gattgagcca cgggggctcg cgttccatcg tcggcatgat atcagtcgat gcccaacggg 
299041 tgacaggtgc cgaatttcgg cccacaatgc cggccgccag acggctgctg cgatgctttt 
299101 ctggccagtc gaaacgctcc gccgcgaaac taccaactca acttcaccac tggatatcga 
299161 cttcatgact cagcgtattg aacgacttgt gcaatccatc gccgattcgg accaaccgat 
299221 ggacgcgatt ttgatctgca gcgaagtcaa cgtgcgatat ctcagcggat tcaccggcga 
299281 cagcacttgg ttgctggtcc gtcccgacgg caaagccacc ttgctgtccg atcgtcgcta 
299341 cgaaacacaa attgccgaag aatgcccggc tctagaaagt gccattcggc cacccagcca 
299401 aactttggtg gcgctgctgg ccgagtattt ggctgattcg tcgttgaaaa cgatcggatt 
2994 61 cgaagccgac cacgtgcaag tttccacgat gcatcagtgg aaagaacaaa tcgaatccgt 
299521 ggaatggaca cagacgtccg gtttggtcga aacgctgcgt tccatcaaag acgccgacga 
299581 attggcgacc attcggcgag cgatttcgat tgccgagcga agttttctca gcgtcaccaa 
299641 caaactgacc ccgcggatga ccgaactgca aatcgctcac gaattggaag cgaccatgcg 
299701 aagcctgggg gcgtccggcg ttgcttttga tgtgatcgca ggtgccgagc ccagcggtgc 
299761 cctgccccac taccaccctc gcaacattgc cttggcggat tgccggactc tgctcatcga 
299821 ttggggggcc cgtgtcgacg ggtattgcag cgatttgact cgaacgctgc acaaagcgga 
299881 cgtgcgttca gcgaccgccg atcgatttga agccgcctac caggcggttc tggaatctca 
299941 agaagccgcg atttcagcca ttcgagatgg cgtggaagcc atcgaagtcg accgagccgc 

3 000 01 tcgccaagtc cttcagaatg ccggcctggg cgacgccttc aaacacgggc ttggacacag 
300 0 61 ttttggcttg gaaattcacg aggatccacg catgggaccg atgtccaccg atgtgctccg 
300121 cgaagggatg gtcctcaccg tggaacccgg ggtgtatttc gagggggaat ttggaattcg 
3 00181 catcgaagat gacatcctcg tgacagctga tggatttgaa cgactgagca cccttccgaa 
3 00241 gggtctcgat gattgtcgcc ttgttgtgta aaacgttgcg aacttactct tcaatcagaa 
300301 aaggttgatc gggtccgttc tggattcgaa aaacattgga ctttgcaggt atcggacttc 
300361 gtatgaaatc ggaaacaccc gacggcgggg atgtgtttga catcgatcgg attcgccaga 
3 00421 tcgtcgagtt gatggaacaa cacgaactga acgaagtgga cctgcagcag ggcgaagatc 
3 00481 gcatcaaact gacgcgtggt ggaacggctc cggccatgat tcccgctccc gttgcggcac 
3 00541 ctcctgtcgc ggccgctccc gctgcacctt ccgccgct cc ggcggcgtcc ggcggcgatg 
300601 ctcccgctga cacggcgggt accatcacaa tcaattcgcc gatggtgggc acgttctatt 
300661 ccaaggccaa cccagaatca ccgccgtttg ttcaggttgg tgatgtcgtt ggcgaagaca 
300721 ccgtggtgtg cattgtcgaa gccatgaaag tcttcaacga gattcccgcc gagtgccgtg 
300781 gaaagatcgt tgaagtgctc gttggcgatc aacaagccgt cgatttcggc aaacctttgt 
3 00841 tccgcgtcca ggtggacggc tgatccatgt tcgaacgtat tttgatcgcc aaccgaggcg 
300901 agattgcgct gcgtgtgatt cgcgcctgcc gtgaaatggg catcgagtcg gtcgccgtct 
300961 acagcgaagc cgacaaagat tccatgcacg tcaaactggc ggaccatgcc tactgcgtcg 
301021 gaaccaatcg cagtgcggac agctatctga aaatcgatcg catcatcgcc acggcagaag 
3 01081 ttgccggcgt cgatgccatc caccccggat acggcttcct ggcggaaaac gctcaattca 
301141 acgagcaatg ccgaagcagc gggtttgagt tcatcgggcc gtcgccaaca gcgatggaca 
301201 aattgggcga caaaaacacc gctcgttcca tggccatcgc tcaaaacgtc ccggtcgttc 
3 01261 ccggcagtga cggattgatt gcggacttcg atcgagccat cgacacagcc aaagaaattg 
301321 gctttcccgt tttgatcaaa gccaccgccg gtggtggcgg caaaggcatg cgggtcgccg 
301381 agaccgagga tgtcctcaag agccaactcg aagcagctcg caacgaagcc atcgcggcct 
301441 tcggcaacgg cggcgtgtac ctggaaaaat tcatccagca acctcgtcac atcgaagtgc 
301501 aagtcattgc ggacacgcac ggcaatgtct gccacctttt tgaacgcgat tgcagcgtcc 



301621 gtgaagcaat ctgcgacgcc gccgtgcgaa tgatcgccgg tgccgactac tccggtgcgg 
301681 gcacggtcga gttcatcgtc gacaaggact acaacttcta cttcatcgaa gtcaacgctc 
301741 ggatccaagt cgaacatcca gtctccgaaa tggtggccgg agtcgacttg atccaagaac 
301801 aaattcgagt tgccgcggga ttgccactgt cgttccgtca agacgaattg acatgcaccg 
3 01861 gtcacgcgat cgaatgtcga ctcaacgccg aagatcccac caagaacttc caaccgtgtc 
3 01921 ccggcaaaat tgaatcgatg tttgtgcccg gcggcatggg cgttcgcttt gattcacacg 
301981 tgaagtccgg ctacgtggtt cctccgtact acgattcgat gatcggcaaa ctgatcgtcc 
302041 atcgtcccac gcgagaacaa gccatcgcga cgatgctacg agccttgaaa gaactgcaga 
3 02101 tcgaagggat caccacaacg gtcggattcc ataactggat cctgcagaac gaagcgtttg 
302161 tgaacggcac cgtcgacacc aagttcgtcg accggaacta caaaggccaa acgtcttagt 
302221 gttgcgtcac tgctaggaaa tgtggtcctg gccgtagcgg aagtcgccaa gactttcggc 
3 02281 aatttaagat gcagatcgaa actcttgacg agttccgcta cccaaaacgt cgattcatgt 
302341 ggcataggct tccagcctgt gtttccctgc atcacaacag actggaagcc tatgtcactc 
302401 tgtattccgt tggtccccaa ggaataacca tgcgagtctt tgtgactggc ggcacaggct 
302461 tgctgggcaa cacgatcctt cgccaactgt cggacgccgg tcatgacctg atcgcgctcg 
3 02521 tcagaggcga accggatccc caagttttcg aagggatcaa cacccaattc gcccacggcg 
302581 atttgctcga tcaaacagcg atcgaaaacg ccatgtccga ttgcgatgcc gtcattcatt 
302641 ccgccggcat gatccacctt ggttggaaac ggctcgacga cagcatggcg gtcaatcgcg 
302701 acggcaccca aaccattgtc gatgcatgcc tgcatcacga ttgcaaactc gttcacgttg 
302761 gcaccgtcaa cacactgggt attgccgacc gcaacggcgt cgcgaacgaa gacaccccgc 
302821 tggatcacgc gggcggccaa gtcccatgca gctacgttct tagcaaacgg gccggaaacg 
302881 aagccgtcat gcaaggtgtg aaagatggct tgaaagccgt cttggttcac ccaggattca 
3 02941 tgctcggacc gtgggactgg aaaccgagta gcggtcggat gcttctcgaa gtcagcaagg 
303001 cctggcgccc tctttctccc tcaggaggca acagcacctg cgattcccga gacgtggccg 
303061 ccgccaccat ccgcgccgcg gaaaagttgc tcgccaatga aatcccgtcc ggccggcaat 
3 03121 acattttggc cggccacaac atgcggtacc tcgaactttg gaaggcgatg accgaggcga 
303181 tgggagcccg ccagccaatc atgcgtgccg gaccagctca gcgatggatc gcgggcgttg 
303241 caggcgacat cgtcggcaaa ttgctgcccc atgaagtcga cttcaacagc gcagcggtcg 
303301 gcatgtcgag ccaataccac tactacgaca gctcgcgagc ccaaagcgaa ctcgactaca 
303361 agatccgccc cttcgaagaa agcctcaccg acgcggccaa ctggatccgc cagcatcacc 
303421 gatgaggccc ggcaaaaaca cactctccac tgcaaacacc aggattttcg ccatcttctg 
303481 aaattaggcg taatatttgg ctcggacatc cgtgagactc tttgaacagt cggtccacct 
303541 acctggaact gcgttttgga ttttgattcc accgccgccg aaccaaccac tcgcgaaagc 
3 03 601 ttacttttgc gactgcgtga ttccaaggac gaattcgcct gggccgagtt ctccgcaatc 
3 03 661 tatgagccgg tgatctaccg cttggtcaag cggcgtggcg tgcaggacgc ggatgctcgt 
3 03721 gaaattgtgc aagagatcct gatgtcggtc gctggtgcga tcgagcggtt tgacgtgacc 
303781 ggccaaggat cctttcgcgg ttggctcagc cgaatcacgc ggaacgcaac catcgaccga 
303841 ctgagatccg ttcaatctcg ccgcgaactg accggcggca gcagcatcgt ccgaatcatc 
303901 gacgccgctg cgtccaacga acgaggcctc gacggcttcc aacaggatgc ggaattgtcc 
303961 cacgagttcg atcgggatca tcggcaacag ctgtttcgct gggccgccag caaagtgcga 
3 04 021 tcgcgaacgg gcgaaaagaa ctggatcgcg ttctggagaa ctgccgtgga ggaccaatcc 
304081 atcgcggccg tcgcctccga actgggcatt accgagggat cggtctatgt cgccaggtgc 
3 04141 cgaatcctca aacggattcg tgaacttgtc caccaacgtt tgagtgattg atcgcatcat 
3 04201 gattgccaag caagacccac actttgaaat ggacaagttg cgtgaattgc tcgatgagcg 
3 04261 attaccggtt cccgaagcgg atcgcatcga gtatcacctc agcaattgcg aacgctgccg 
3 04 321 atcgcaattg cgagcaatcg ccggcgagcc gatgtggtgg gacgaaaccg tcgaagtcct 
3 043 81 cagcaagagc acattcatcg accaaagcag ccggcctact agccaaccgc caagcgatcc 
304441 agcccagccc cagcgatcct tcgacgcgtc gttggactgg atccgtccgc tgctgcaacc 
304501 actgcgggat caacactcag acgccgtcga cgacacgcgt gcaattggca ccttggatca 
304561 atactcgatt cacagtgtga tcggacaagg cggtatgggc gtcgtgctac gaggcatgga 
304621 tccggaactg aatcgaccgg ttgcaatcaa agttctctcc ccacaccttg ctggcgttgg 
304681 tgcggcgcgg acacgcttca tgcgagaggc ccaagctgcc gcggcgattg tgcaccctgc 
3 04741 gattgtgccg atttacagcg tcgtcccgag cgctcgattg ccctacctgg tgatgccctg 
3 04 8 01 catcgacggc ggaaacctgc aacaacgcct ggatcgcgaa ggcccgctgg aactgaccga 
304861 ggtcgtcaga atcggcttgc aaatcgccga aggattgtcc gccgctcata aaaacagcgt 
304 921 gctccaccga gacatcaaac ccgcaaatat tttgatcgaa gagggcaacg gtcgtgtctt 
304981 gatcagcgac tttggactcg cccgggcact cgatgacgcc acgctgacca acagcggaat 
305041 gatcgccggc acaccccagt acatgagccc agaacaagct cggggcgaat ccctggacgc 
305101 ccgcagcgac ctgttcagct tgggcagtct gctctacgcc ttggcgtcgg gacgcgcacc 
305161 cttccgcgcc gagtcaccgc tgggaatcct gcgaaagatc accgagacac gagccaaacc 
305221 cattcaagag atcaacgaac ggatgccccg ttggttcgac aatttggtcg ctcgcttcat 
3 05281 ggaacctgac ctcacgctca ggattgaatc cgcagacgag gcggtctcgc tgcttcgtga 
3 05341 agtcaacgcc catgtccgaa acccaactgg caatcgactc ccgaactcgc ttgaagaacg 
3 054 01 tcgtccgcga cgttcccttc gatggacgct cgcagccgcc ggagtgctga cggcggtggc 
305461 cagcgctgca acttggaatc accgcaagca ggaaaactct cacacggaac ggtcgctgcc 
3 05521 aacggtcatc gaacaatcgc gacggttgac gacgatggca ccaagcgtcc ctgcgccccc 
305581 cgccccaacc accgacgact gggacgcagt tcacttcaac gcagaactgc attcgattca 
3 05641 gttcatgttg gagcaactgg caaaccaact cgaaggcacc gacccaacag tcaatcccga 
3 05701 cctcaccgat ggagatcaaa aatgaaggat gtagctcaaa tgaattcggt tcccactccg 
3 05761 gcattgccca agcgacgaag ccgcgtttgg aagtctgcat tcatcgcaac attgacgatc 
305821 cccacggccg ccgtcttggt ctggtcgtgg acctccctgc cctcaacatg ggcccaaacc 
305881 cgggatccgc gatccatcga ccaaccgcga cgcttgaact cttatcccca acctaacaat 
305941 ctaccgtacc aggagtcgcg cgcgcaaatt gcaagcgaac tgagcctctc acatcaggca 
3 06001 tatcccaaca gcgatcacca atcgcaacca ggattcaaca accaatccaa gacgacgctg 
306061 cgaatcgaca aacaagcaca acccgcccaa gaagtcgatg taccagtcac gcgtcgaacc 



3 06181 caactgaccg gccgcctgat gaccacagag gattccaaac acaacgaaag aatcaagcaa 
3 06241 ctggtcaacc aactgcgggg atcaacaagt agcaacgaga tggacgccaa gctggctgaa 
306301 tttcgcagcg aactggaaca ggagttcaag cggatgcacg agcagcaagg ctcggagatc 
306361 gcagcgattg agaaacgcct ccaatcgctc aaagagatcc atcagcaacg tgctgaaaac 
3 06421 caagacaaaa tcattcagcg acgaatcgac caattgctcg gccgcagcga tccgctggac 
306481 tggaactaca caccacccaa cactccattt gccccgctgt cgccgtattc ctcaaatcaa 
3 06541 tacggcagca gcccgatcag cgataacttc cacccaaagt cgccttacgc atacccagaa 
306601 agcaactccc ctcgttcaac ccgttcatgg caaccaagcc ccaaccggcc cccgagtcct 
306661 tcgccgcaaa aaccagcgtc cagaccaatc cctccatctt accccggcaa actcatggaa 
306721 gcctacgagt ctgcattgaa agcaagcgac gcagcagtgc ttgagcagtc aaacaaggaa 
306781 cgactcaacg aatacactcg actgcatgaa gaaggcgcgc tcactgcctc cgaattgcga 
306841 agagcaacac tggactacgc atccgcccaa aagcaaaccg ccctgctgaa gctgcaactt 
306901 caatcaatcc agcgtgactt ggctcgctat ttaaatgcag ccgaactgga gctagctgcc 
306961 ctgaacgagc agcaaaaaac aaatccttca gccaactcca agtctgcaca acttgaacgc 
3 07 021 caacgcgccg aagccgcgat cacagatgca aagaccacac tcgaacagtt cgaggctgcg 
3 07081 atgaaattga tccccgtcga cacggattcg gcggaaccat ccgatgcacc aaagaatcca 
3 07141 aacgaaagtg cagatcagcc agaatctgaa ctggacccga caaccaacgc ccccaaagca 
3 07201 agcgaagaca caactctgaa cctagaaaca gaagaagtcg aaacgacgcc aaatgagccc 
307261 gttccgagct tttgaacgtc gcttggttca ggttgaaaaa caggctactc cagggtgatc 
307321 acatccttgg tcactttcat gatcatcacc gctcgcttgc catcatccgc gtgagtgggc 
307381 aatccagttc gtgagtcagc ggccccgacc aggaaccact ttccaacggg tgccctgacg 
307441 gtggtcgcga tggtcattcc caattctggc tttgttcgga actcaacctg agcctgaact 
307501 tcacccgact ccgtcgtgtt cagaaaccga acctggcact gcaattgacc gtatcgcgag 
307561 cctccatcca cacgagcgtc cattccgacc gtgacagatg ccatggcact gcccgcggga 
307621 accaatcggc caaatccgac tgttgacaag gcctcgcgcg cgggccccgc atattcaatc 
3 07681 cggttcgcat cgtcggattc gatcaaccac aaagcctgaa aagaaacatc cttgatcgga 
3 07741 ccagctggga cagccgaaac cggctgctgc gcaaccactt cttgagcttc aatcgcggaa 
307801 aacattgcca ccggaatggc ggcaagtaca agcggagcaa aacgcgagcg gacgcatgag 
307861 aagaatcgat cgagcatttg agcgaaccta agcaatgggg ggatgttcca tgaacaaact 
307921 ttggacggat gcattccccc aaagttccat cccaatctag ctcactgggc gtttccgctg 
307981 acctggtttt GCtgatcggc tcgctttcga aacgcgatcc agcacaggat caccaacgag 
3 08 041 atcgccaagc agatcccatt gaggctggtg aaaactctca ccacaacctc cggcaaatca 
3 08101 aaccgcaaca agaacgttcg cgtcgataag tagttcaatg ctggaaacgc catcaacgca 
3 08161 cacaacaacc accgcagttt gggcgaaaca tttgaccagc catccagtct cgccatccaa 
308221 gccccggcaa ccggagcaat catgagcagc gaatcgtacg actgatgata caaactgacc 
3 08281 aacaacgcgg tcagcaaaac gccaccggtc acgccagcaa tcccgtcgtc ctcgccacgc 
308 341 cgggctcgca caaacagaac ccacatcggg acaaccaaca gcgccgccat cgccaaaatg 
3 084 01 tgtggcagat ctcctgggtc ttctccccgc catttcgcga tgatggcgaa gagatcaagt 
3084 61 cgcgtccatg aatacacggg cgactcatcg tccatcgaac ggtgaacttc ctgcgtgtcg 
308521 acgatttgct gctgcaggat cgaaactcct gcaacgatgt cgccgtcacc ttcgtggtgt 
3 08 581 gcgatccaag ccagcagcaa cgcggcgccg gcaatgctga gcacggcacc gatgatcaac 
3 08 641 gctcgccaat tcccgcgagc caacatcagg aacccaagcg gcagaatata ggttggcttt 
3 08701 gccgagacca ccaacaacgc aaacgccgct cgaaccggtc ggcgatccgc ccaaacgacg 
3 08761 gataaaaacg ttgccaacac caactgaaag gtgaagtagc cgttgaacaa tgtgatgtgc 
308821 ccaccccggc tgtagaccat cgccatggca atgcaaagaa ccgcgtcgta tcgttttgga 
3 08 881 tacccagctt cacgagccac caacacagcg atcgcagcca gcagaaacaa ctgcaggatc 
3 08 941 acgttcaaca cttcgccaac atgcaatggc aacaacgcca gcggcgcgtg catcaacaga 
3 09001 atcaccggcg agaagaatgg aatctgtctc gcgactggat acttcgccgc ataagcatcg 
3 09061 ctgtacggac tttcccccgc caacaccgct cgcgtgggaa agtagattcc gttgtgaaaa 
309121 tcacacaaac cctgattctc gggatcaaac ggaccaggcg tttgatagtc cacgacaatg 
309181 cgcgagaacg tcgccaccaa acccgccagc aacagcatgg cagcaatgat cgcgacggtt 
3 09241 cgattcccga cggtccaacc ttcccgcggg gcagccgaca tcaggcggtc tgtgtttcca 
3 09301 tcagaggacg gccagtcatt tcaggtggaa cttccaaacc catcaaagcc aaaacggtcg 
3 09361 gtgcgatgtc agccaaacgc ccgccttcgc gaagcggttt accaacgaac tccggatcca 
3 09421 ccacaatcaa cggaacctga taagtcgtgt gcgccgtatg aggtccgccg gtttcgggat 
309481 cgatcatttg ttcgcagttg ccgtgatcag cagtcacaac cagtgatccg cctgccgcca 
3 09541 acgtcgcgtc cacgactegg ccaacacaag catccacttt ttcaaccgct ttgatcgcag 
3 09601 cttccaaaac ccctgtgtgc ccaaccatgt caccgttggc atagttgacg atgatcatgt 
309661 cacacttgcc cgacttgatt tgcttgagca ccttttccgt gatgtcctcc gccgacattt 
309721 caggcttttg atcgtaagtg gaaacgtcac gcggcgatgg tgccattccc cattcttctt 
309781 cctcgaatgg attgtcgcga tagtcgttga agaagaacgt cacgtgtggg tacttttcag 
3 09841 tttccgcaca acggaattgg tgcagcccct tggaagcgac gtactcgcca agaatgtttg 
309901 gcatcttggc aggtttctca aagatcactt tgaccggcaa accagtctcg tatcccgtca 
3 09 961 tcgttgcgaa atacagattg tcgatcttct taccgcgatc aaacccaccg ccagggatgt 
310021 ccttccaagc cgcatcatcg aatgtgaacg ccttggtaat ttcgcgagtc cggtcgcctc 
310081 ggtaattcat gaacaacact gcatcgccgg gctgaacagt gatgggtgat ccaccttcag 
310141 gaacgatgga cgatgccgtg atgaactcgt ctccatcgcg actgctttca gtcggattgg 
310201 cgtagtagct ttcgattgca gcactggcac tcgggaaggt ttgccccgtg ccctgagtca 
310261 tcaagttgta agccgcctcg actcgttccc agcgaaggtc tcggtccatc gcgtaaaagc 
3103 21 gtccgatgac ggacgcgacc acgccgacgc cggaagcttc gcagtggtcc gtcaattgtg 
3103 81 aaacaaactt cagtccgcca cgaggcgaag tgtcacggcc atcggtgatc gcatgaatcg 
310441 ctaattgatc gccagtcaaa ccgtttcgtt tcgccagatc gatcaccgcg atcccatgct 
310501 gcaaatcact gtgcactcgg ccgtcggaca tcaaccccaa caaatgcaac cgcccgccac 
310561 tctttttgac gtgatcgatc gcgccggtga cgactggatt gctaaagaat gaatcgtccc 
310621 gaatcgctcg agtgattcgc atgacctctt ggtcaacgat ccgtccagca ccaatgtttt 



310741 tatgaatcag cacactggga tactcctgag tcagcttgtc atccacgggc gtgttggcca 

310801 ggtgaacagc attggactcg ttccattttg gatcagggtt ttggccccaa ccatcacgaa 

310861 cgatcaaaac aacgggacga cgacgggtgg cagtcatgaa agtcaaaacc aaagaatgtg 

310921 cgtaacagga acgatcccag tcgagtgttc acaccaggga tcggagcccg tccggcagga 

310981 accgacacgg gcaggatcaa ccaatcgcaa aagatgaatg catcaagctt ggcgacggca 

311041 aattagccgt cgacgttgga cgaaccagtg actttgtcac tgtacatcgc cagtttgccg 

311101 agcacttgat cgatggcatg tgacgagacg ttgtttgctt ccaacgcatc ggtcaaatgg 

311161 ccacagaatt tggagaaatg aactccggtg attccacgtc cggcatgaac cttcgtcagg 

311221 tccgcaccgg tgtactggat atctccggaa gtgatcgaag cgatgaactc ggtttgcatc 

311281 cgagccagct tttcgatcgg gactccatca aagaagtgag tcaactctgg atccgcaaga 

311341 acccgaacgt acatctcgtc gacaacacga cggacgccat ccattcctcc gagctggtcc 

311401 agtaagtcgg attcagattg actcatgatg gaatatctcc agtcagatta cataggggtc 

311461 ggttcgtaag gaatgtcttc gatcaaatcc atgggcaatg gagccacggt catgccggct 

311521 tccttgcaga ttccatcgat cgcttcctgg gacttctgca ccaagaattc tcgagtctgt 

311581 cccttgtact gagacgtcga tgccgttggg ctcatcgggc cgctgaaatt ggtttcgttc 

311641 aggtggctga cgaccttcac gtgatccaac gtcatgccat tcatttcagg aagcagacga 

311701 tcggtcgact tcacagttgc caagtcagcc tcttcaggaa cagagccaag gcgaagagca 

311761 acaatctttt cgcccgagat ttcgctgagc tgatccattg caccgtcgcc gacaacccaa 

311821 tcccaggtat cgatgacata gccgacgttg tcgttcggca ccgcatcgac caacgcacgg 

311881 aaggtttcga cgttgttgac gaaggtgaat tccttgcctt ccaaaagctc agcaccggca 

311941 cggaaaccaa tcccaagttt gatgtcgcgt tgccccaaca cgtctgcgat ttgtcccaaa 

312001 cggttgcgtt ggacatcgaa gaactcgttg aatggcaagc gattcgttgc agcaggaaca 

312061 cgaatcaacg cacgctcggc acccaattgc tgagcgattt cgccgagcgg gtgcagttga 

312121 gcaaccgctg cggtgaaagt gtcgtcatcc gagtccagat cgattgccaa ttggaacgaa 

312181 ccaatcttga tttccgacgc cttcagatat ttcgaagcgt cttcaaacga agtccgttgc 

312241 gaacgacgaa gcatgtcgtg catgtcgacg tccatgccac ggaaggcata agtcagtgcc 

3123 01 agttcgatca gctcgctttg acgaccgttg atgccgagcg cggaaggaca gaagttttta 

3123 61 aacatggtga ttccttgatt gatttgcttg gattgttttg gatggtttta cggttgggac 

312421 agcgaatgcg gcgaccaacg ccgcaatcaa tatcaacgac gacggccgcg agaatttcca 

312481 ccgcctgatt gaccgcgacc tccgcgtgga ggctgagtct tacggccctt caaattgctg 

312541 tcgacaatctgattcacgat ttcgtcccaa gtcggaacat ctggatcttt cttgcgagaa 

312601 tccttgcgtt cgccgcgacg tccacgcgac gaagatcgct cgccatcgcg ggctgagccg 

312661 ccggaacgct ctgaactact cgaacgctct gaaccgccat ctcgtgtgga accgcccgag 

312721 cgttctcggt cattgtcccg atcacgattg cgaccaccgc gagaccgcga acgacctcgg 

312781 gctcgagttt ctttctctcc gtcggtttct tcgcgttcgg tatccaccga atcgtcatca 

312841 tcggcaaacg cgaatcgtgg caactcgtca tcaaacgagt cgtcatcttt ttcggattcc 

312901 gaaccgcgac caccgcgagc cttcacctcg gaagaatcac gctcttcgcg tccgcgccca 

312961 ccgcgtcgac gaccaccacg acgcgaacgt cgacggccgg cgcgaggttc gccatcatct 

313021 gaatcttctt gtgatcgctt ggagtctcga tcagcaacgg gctcatcgac cgagccatcg 

313081 tcgacgaaca agtccgccgc gaaggaatcg tcgctggatt ccagatcgga atcgtcccaa 

313141 gcctttgtac gagtttcttc acgtccgccg gagcgacttc gatcaccacg tccagacttt 

313201 gatcgatctg aaccagatcg atctgaatcc gttcgatctg ccttggggcg ttcttcagtg 

313261 ccgcgatcgg aatccgaacg ttcgccacgt cgtccaccgc gacgtcgacg acggccgcga 

313 321 gtttcggatg aatcggaatc agattcttcg cggtcaccgc gaggcgattc cgagcgagct 

3133 81 tctacgacga cgtccgaatc ggacattttt tgtcgttgac cacgacggcc accacgacct 

313441 ccgcgagacg acgatgattc cgctggcttc gattcgccgg actcgtcctg tgaatcgtcg 

313 501 gcatcacggc cacgtcggct gcgtccgcca cgtcctcggc cacccgcgtt ccgatctcga 

313561 tcggactttg cggcgggagc tgaatcgcct tcgtcgacgg aatcggcatc ccaaccggtg 

313621 atttccaccg tgtctgatgc gatgagatcg gactcatcat cgaaggcttc tacatcggaa 

313681 gcatcctcgt cgtcgaaatc atcattgccg agcaaaccgc tagcgaatcc gccaccgcca 

313741 ccgttaccct ttttacgtgg aggctcgcce aaatcgggac cgccacggtg cttgcgatca 

313 801 cgcttggaag cgggagactc atctccacca gtcttcagct gcgattctcg cttggagatt 

313 861 ttgttttcag ccacgccgaa tccagatggg acgtcatctg agtcatcgtc atcggcaacg 

313 921 ccaaaaccag gcaacttcgg cggtggtttg acggcggcga gtgcctgcaa cacgtcatcg 

313 981 gcaccttcgt catcgctcga gaaagcggag gctttcgcgg gtgcggtttc cgacgtctct 

314 041 tttggcttgg gcgctgcgtc agattccttt tctttggatt cgcaggattc cgcggagggc 
314101 tcaggttcag cagcaggttt ttcaggttcg actttggcag cggggcgatc cggcggcttg 
314161 ggtgtcccca acaaactcgc gaggatattc caattgtctg gcatgtggtt tccaatgcga 
314221 cgccaaccag cgaatcgatg gagccaatca aaaatggctc ggaatgcgtt agctaggcgt 
314281 cgacaaaaag tgtgcaatga tctgaactgg cttcaatcgc ggcgtggact ctgcctgcgt 
314341 gcaggaaacc ggttcgtggg ccgaccgaca tcgattgatc gggcttcgct ccccatctct 
314401 gacacccttt tctctacaag agtgtccctg tggtacccga attttctcga 
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LOCUS (LOC) : AR268952 GenBank (R) 

GenBank ACC. NO. (GBN) : AR268952 



AR268952 . 1 GI: 29699689 
504700-37-2 
27 



GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 

MOLECULE TYPE (CI) : DNA; linear 

DIVISION CODE (CI) : Patent 
DATE (DATE) : 10 Apr 2003 

DEFINITION (DEF) : Sequence 6 from patent US 6500643 . 

SOURCE : Unknown . 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 



10 t 1 others 



Unclassified 
:5a 6 c 5 g 
1 (bases 1 to 27) 

Wu,D.-H.; Gu,Y.; Millard, W. J. ; He,Y.-J. 

***Human*** ***high*** ***af f inity*** 

* * * Choi ine * * * * * * transport er * * * 

Patent: US 6500643-A 6 31-DEC-2002; 



FEATURES (FEAT) : 
Feature Key 



Location 



Qualifier 



source 



1. .27 



/organism= "unknown" 



SEQUENCE (SEQ) : 

1 gggtcattgt aarttatctt cagtccc 
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LOCUS (LOC) : 
GenBank ACC. NO. (GEN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE: 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) 

FEATURES (FEAT) 
Feature Key 



GenBank (R) 
GI:29699688 



AR268951 
AR268951 
AR268951.1 
504700-36-1 
25 

DNA; linear 

Patent 

10 Apr 2003 

Sequence 5 from patent US 6500643. 
Unknown . 
Unknown . 
Unclassified 
:5a7c8g5t 
1 (bases 1 to 25) 

Wu,D.-H.; Gu,Y.; Millard, W . J . ; He,Y.-J. 

* * *Human* * * * * *high* * * * * *aff inity* * * 

***choline*** *** transporter*** 

Patent: US 6500643-A 5 31-DEC-2002; 



Location 



Qualifier 
/ organism= "unknown" 

COPYRIGHT 2004 on STN 



source 



1. .25 



SEQUENCE (SEQ) : 

1 gccatggctt cccatgtgga aggac 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE: 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 

FEATURES (FEAT) : 
Feature Key 



AR268950 GenBank (R) 

AR268950 

AR268 950.1 GI: 29699687 

504700-35-0 
1893 

DNA; linear 

Patent 

10 Apr 2003 

Sequence 3 from patent US 6500643. 
Unknown . 
Unknown . 
Unclassified 

: 430 a 570 c 516 g 377 t 
1 (bases 1 to 1893) 

Wu,D.-H.; Gu,Y.; Millard, W . J . ; He,Y.-J. 

* * * Human* * * * * *high* * * * * * af f inity* * * 

* * * choline* * * * * * transporter * * * 
Patent: US 6500643-A 3 31-DEC-2002; 



Location 
1. .1893 



Qualifier 
/organism= "unknown" 



source 



SEQUENCE (SEQ) : 

1 atggcagcaa aaactcccag cagtgaggag tctgggctgc ccaaactgcc cgtgcccccg 
61 ctgcagcaga ccctggccac gtacctgcag tgcatgcgac acttggtgtc tgaggagcag 
121 ttcaggaaga gccaggccat tgtgcagcag tttggggccc ctggtggcct cggcgagacc 



241 ctgaatgaca tgtatctcaa caaccgcctg gccctgcctg tcaactccag ccctgccgtg 
3 01 atctttgctc ggcagcactt ccctggcacc gatgaccagc tgaggtttgc agccagcctc 
361 atctctggtg tactcagcta caaggccctg ctggacagcc actccattcc cactgactgt 
421 gccaaaggcc agctgtcagg gcagcccctt tgcatgaagc aatactatgg gctcttctcc 
481 tcctaccggc tccccggcca tacccaggac acgttggtgg ctcagaacag cagcatcatg 
541 ccggagcctg agcacgtcat cgtagcctgc tgcaatcagt tctttgtctt ggatgttgtc 
601 attaatttcc gccgtctcag tgagggggat ctgttcactc agttgagaaa gatagtcaaa 
661 atggcttcca acgaggacga gcgtttgcct ccaattggcc tgctgacgtc tgacgggagg 
721 agcgagtggg ccgaggccag gacggtcctc gtgaaagact ccaccaaccg ggactcgctg 
781 gacatgattg agcgctgcat ctgccttgta tgcctggacg cgccaggagg gctggagctc 
841 agcgacaccc acagggcact ccagctcctt cacggcggag gctacagcaa gaacggggcc 
901 aatcgctggt acgacaagtc cctgcagttt gtggtgggcc gagacggcac ctgcggtgtg 
961 gtgtgcgaac actccccatt cgatggcatc gtcctggtgc agtgcactga gcatctgctc 
1021 aagcacatga cgcagagcag caggaagctg atccgagcag actccgtcag cgagctcccc 
10 81 gccccccgga ggctgcggtg gaaatgctcc ccggaaattc aaggccactt agcctcctcg 
1141 gcagaaaaac ttcaacgaat agtaaagaac cttgacttca ttgtctataa gtttgacaac 
12 01 tatgggaaaa cattcattaa gaagcagaaa tccagccctg atgccttcat ccaggtggcc 
1261 ctccagctgg ccttctacag gctccaccga agactggtgc ccacctacga gagcgcgtcc 
1321 atccgccgat tccaggaggg acgcctggac aacatcagat cggccactcc agaggcactg 
1381 gcttttgtga gagccgtgac tgaccacaag gctgctgtgc cagcttctga gaagcttctg 
1441 ctcctgaagg atgccatccg tgcccagact gcatacacag tcatggccat aacagggatg 
1501 gccattgaca accacctgct ggcactgcgg gacgtggccc gggccatgtg caaggagctg 
1561 cccgagatgt tcatggatga aacctacctg atgagcaacc ggtttgtcct ctccactagc 
1621 caggtgccca caaccatgga aatgttctgc tgctatggtc ctgtggtccc aaatgggtat 
1681 ggtgcctgct acaaccccca gccagagacc atccttttct gcatctctag ctttcacagc 
1741 tgcaaagaga cttcttctag caagtttgca aaagctgtgg aagaaagcct cattgacatg 
1801 agagacctct gcagtctgct gccgcctact gagagcaagc cattggcaac aaaggaaaaa 
1861 gccacgaggc ccagccaggg acaccaacct tga 
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LOCUS (LOC) : AR268949 GenBank (R) 

GenBank ACC. NO. (GBN) : AR268949 



AR268949. 1 GI : 29699686 

504700-34-9 

1743 



GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : DNA; linear 

DIVISION CODE (CI) : Patent 
DATE (DATE) : 10 Apr 2003 

DEFINITION (DEF) : Sequence 1 from patent US 6500643. 

SOURCE : Unknown . 

ORGANISM (ORGN) : Unknown. 

Unclassified 

NUCLEIC ACID COUNT (NA) : 411 a 395 c 405 g 532 t 
REFERENCE: 1 (bases 1 to 1743) 

AUTHOR (AU) ; Wu,D.-H.; Gu,Y.; Millard, W . J . ; He,Y.-J. 

TITLE (TI) : ***Human*** ***high*** ***af f inity*** 

* * * chol ine * * * * * * transport er* * * 

JOURNAL (SO) : Patent: US 6500643-A 1 31-DEC-2002; 

FEATURES (FEAT) : 

Feature Key Location Qualifier 

source 1 . . 1743 /organism=" unknown" 

SEQUENCE (SEQ) : 

1 atggctttcc atgtggaagg actgatagct atcatcgtgt tctaccttct aattttgctg 
61 gttggaatat gggctgcctg gagaaccaaa aacagtggca gcgcagaaga gcgcagcgaa 
121 gccatcatag ttggtggccg agatattggt ttattggttg gtggatttac catgacagct 
181 acctgggtcg gaggagggta tatcaatggc acagctgaag cagtttatgt accaggttat 
241 ggcctagctt gggctcaggc accaattgga tattctctta gtctgatttt aggtggcctg 

3 01 ttctttgcaa aacctatgcg ttcaaagggg tatgtgacca tgttagaccc gtttcagcaa 
361 atctatggaa aacgcatggg cggactcctg tttattcctg cactgatggg agaaatgttc 
421 tgggctgcag caattttctc tgctttggga gccaccatca gcgtgatcat cgatgtggat 

4 81 atgcacattt ctgtcatcat ctctgcactc attgccactc tgtacacact ggtgggaggg 
541 ctctattctg tggcctacac tgatgtcgtt cagctctttt gcatttttgt agggctgtgg 
601 atcagcgtcc cctttgcatt gtcacatcct gcagtcgcag acatcgggtt cactgctgtg 
661 catgccaaat accaaaagcc gtggctggga actgttgact catctgaagt ctactcttgg 
721 cttgatagtt ttctgttgtt gatgctgggt ggaatcccat ggcaagcata ctttcagagg 
781 gttctctctt cttcctcagc cacctatgct caagtgctgt ccttcctggc agctttcggg 
841 tgcctggtga tggccatccc agccatactc attggggcca ttggagcctc cacagactgg 
901 aaccagactg catatgggct tccagatccc aagactacag aagaggcaga catgatttta 
961 ccaattgttc tgcagtatct ctgccctgtg tatatttctt tctttggtct tggtgcagtt 

1021 tctgctgctg ttatgtcatc agcagattct tccatcttgt cagcaagttc catgtttgca 
1081 cggaacatct accagctttc cttcagacaa aatgcttcgg acaaagaaat cgtttgggtt 



1201 actgtgtatg 
1261 ctgctttgtg 
1321 tctggcctct 
1381 ttctaccctg 
1441 acacttgcca 
1501 ctatttgaaa 
1561 cacagtgaag 
1621 gaacttgcac 
1681 gaggccttcc 
1741 tga 



ggctctggta 
tactctttgt 
tcctgagaat 
gctattaccc 
tggttacatc 
gtggaacctt 
aaaacatgga 
ttgtgaagcc 
ttgatgttga 



cctcagttct 
taagggaacc 
aactggaggg 
tgatgataat 
attcttaacc 
gccacctaaa 
taagacaatt 
acgacagagc 
ttccagtcca 



gaccttgttt 
aacacctatg 
gagccatatc 
ggtatatata 
aacatttgca 
ttagatgtat 
cttgtcaaaa 
atgaccctca 
gaagggtctg 



acatcgttat 
gggccgtggc 
tgtatcttca 
atcagaaatt 
tctcctatct 
ttgatgctgt 
atgaaaatat 
gctcaacttt 
ggactgaaga 



cttcccccag 
aggttatgtt 
gcccttgatc 
tccatttaaa 
agccaagtat 
tgttgcaaga 
taaattagat 
caccaataaa 
taatttacaa 
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LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE: 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT 
COMMENT: 



GENBANK. RTM. COPYRIGHT 2004 on STN 

E4 9871 GenBank (R) 

E49871 

E49871. 1 GI: 22554902 

450667-38-6 
1743 

DNA; linear 
Patent 
27 Aug 2002 
***High*** 
* * * t ransporter * * * 
Homo sapiens . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 



***affinity*** 



***choline*** 



OS 
PN 
PD 
PF 
PI 
PC 
PC 
PC 
FH 
FT 

REFERENCE 
AUTHOR 



(NA) : 412 a 393 C 406 g 532 t 



) 



Key 
CDS 

(AU) : 
TITLE (TI) : 

JOURNAL (SO) 



Homo sapiens ( ***human*** 
JP 2001136976-A/3 
22-MAY-2001 

27-DEC-1999 JP 1999368991 
TATSUYA HAGA,TAKASHI OKUDA 

C12N15/09,A01K67/027,A61K38/00,C07K14/47,C07K16/l8,C07K19/00, 
C12N5/10, 

C12P2l/02,C12P2l/08,C12Ql/00,C12N15/00,A61K37/02,C12N5/00 CC 



Location/Qualifiers 
(1) . . (1743) . 
1 (bases 1 to 1743) 
Haga,T. ; Okuda,T. 

***High*** - ***aff inity*** 
* * * transporter * * * 
Patent: JP 2001136976-A 3 22-MAY-2001; SCIENCE & TECH 
AGENCY 



***choline*** 



FEATURES (FEAT) : 
Feature Key 

===============+======= 

source 1 . . 1743 



Location 



Qualifier 



/organism="Homo sapiens" 
/ db - xr e f = " t axon ; 9 6 0 6 " 



SEQUENCE (SEQ) : 

1 atggctttcc 
61 gttggaatat 
121 gccatcatag 
181 acctgggtcg 
241 ggcctagctt 
301 ttctttgcaa 
361 atctatggaa 
421 tgggctgcag 
481 atgcacattt 
541 ctctattctg 
601 atcagcgtcc 
661 catgccaaat 
721 cttgatagtt 
781 gttctctctt 
841 tgcctggtga 
901 aaccagactg 
961 ccaattgttc 
1021 tctgctgctg 
1081 cggaacatct 
1141 atgcgaatca 
1201 actgtgtatg 



atgtggaagg 
gggctgcctg 
ttggtggccg 
gaggagggta 
gggctcaggc 
aacctatgcg 
aacgcatggg 
caattttctc 
ctgtcatcat 
tggcctacac 
cctttgcatt 
accaaaagcc 
ttctgttgtt 
cttcctcagc 
tggccatccc 
catatgggct 
tgcagtatct 
ttatgtcatc 
accagctttc 
cagtgtttgt 
ggctctggta 



actgatagct 
gagaaccaaa 
agatattggt 
tatcaatggc 
accaattgga 
ttcaaagggg 
cggactcctg 
tgctttggga 
ctctgcactc 
tgatgtcgtt 
gtcacatcct 
gtggctggga 
gatgctgggt 
cacctatgct 
agccatactc 
tccagatccc 
ctgccctgtg 
agcagattct 
cttcagacaa 
gtttggagca 
cctcagttct 



atcatcgtgt 
aacagtggca 
ttattggttg 
acagctgaag 
tattctctta 
tatgtgacca 
tttattcctg 
gccaccatca 
attgccactc 
cagctctttt 
gcagtcgcag 
actgttgact 
ggaatcccat 
caagtgctgt 
^ttggggcca 
aagactacag 
tatatttctt 
tccatcttgt 
aatgcttcgg 
tctgcaacag 
gaccttgttt 



tctaccttct 
gcgcagaaga 
gtggatttac 
cagtttatgt 
gtctgatttt 
tgttagaccc 
cactgatggg 
gcgtgatcat 
tgtacacact 
gcatttttgt 
acatcgggtt 
catctgaagt 
ggcaagcata 
ccttcctggc 
ttggagcatc 
aagaggcaga 
tctttggtct 
cagcaagttc 
acaaagaaat 
ccatggcctt 
acatcgttat 



aattttgctg 
gcgcagcgaa 
catgacagct 
accaggttat 
aggtggcctg 
gtttcagcaa 
agaaatgttc 
cgatgtggat 
ggtgggaggg 
agggctgtgg 
cactgctgtg 
ctactcttgg 
ctttcagagg 
agctttcggg 
aacagactgg 
catgatttta 
tggtgcagtt 
catgtttgca 
cgtttgggtt 
gctgacgaaa 
cttcccccag 



1321 tctggcctct 
1381 ttctaccctg 
1441 acacttgcca 
1501 ctatttgaaa 
1561 cacagtgaag 
1621 gaacttgcac 
1681 gaggccttcc 
1741 tga 



tcctgagaat 
gctattaccc 
tggttacatc 
gtggaacctt 
aaaacatgga 
ttgtgaagcc 
ttgatgttga 



aactggaggg 
tgatgataat 
attcttaacc 
gccacctaaa 
taagacaatt 
acgacagagc 
ttccagtcca 



gagccatatc 
ggtatatata 
aacatttgca 
ttagatgtat 
cttgtcaaaa 
atgaccctca 
gaagggtctg 



tgtatcttca 
atcagaaatt 
tctcctatct 
ttgatgctgt 
atgaaaatat 
gctcaacttt 
ggactgaaga 



gcccttgatc 
tccatttaaa 
agccaagtat 
tgttgcaaga 
taaattagat 
caccaataaa 
taatttacag 
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LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE : 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 

AB084377 ' GenBank (R) 
AB084377 

AB084377.1 GI:22506641 

450515-00-1 
291 

mRNA; linear 
Primates 
21 Nov 2002 

Homo sapiens mRNA for ***high*** - ***af f inity*** 

* * * Choi ine * * * * * * transporter * * * CHTl , 5 ' UTR . 

Homo sapiens ( ***human*** ) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 
: 51 a 123 c 74 g 43 t 
1 

Okuda,T.; Okamura,M.; Kaitsuka^C; Haga.T,; Gurwitz,D. 

Single Nucleotide Polymorphism of the ***Human*** 
***High*** ***Af f inity*** ***Choline*** 

***Transporter*** Alters Transport Rate 

J. Biol. Chem., 277 (47), 45315-45322 (2002) 

CA 138:252025 

2 (bases 1 to 291) 

Okuda , T . 

Direct Submission 

Submitted (26-APR-2002) Takashi Okuda, University of 
Tokyo, Faculty of Medicine, Department of 
Neurochemistry; 7-3-1 Hongo, Bunkyo-ku, Tokyo 1130033, 
Japan (E-mail : okuda®m . u- tokyo . ac . ] p , 
Tel: 81-3-5841-3560, Fax: 81-3-3814-8154) 



FEATURES (FEAT) : 

Feature Key 
===============+== 

source 1 - 



Location 
291 



Qualifier 

/organism="Homo sapiens" 
/db-xref ="taxon: 9606" 
/tissue-type=" spinal cord" 
/gene="CHTl" 
/gene="CHTl" 

/note="high-aff inity choline 
transporter CHTl" 



gene 
5 'UTR 



1. 
1, 



,291 
.291 



SEQUENCE (SEQ) : 

1 gattcccacc 
61 gcgctcccag 
121 cgcactgcac 
181 cgcggttcca 
241 aagacttaat 



tcgccctgcg ccgcgcgccc 
gttcttggag acgccgagtg 
cccgacccac cccgcaccct 
gg999c:ggcg gccccgggcg 
gaagtagcca gctgcagaag 



tccgccggcg ccaacacctg tcaactctgc 
aggagccgcc ctgaccgccc ggccccgctc 
ccctttctgc accctcgcac ccacacccct 
gagcgctttc gcgtgcagcc accactccag 
aatctggatc attagataaa a 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE : 
ORGANISM (ORGN) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 

BD012 719 GenBank (R) 

BD012719 

BD012719.1 GI:22092908 

445761-48-8 

1743 

DNA; linear 
Patent 
2 Aug 2002 

***High*** 

* * * transporter * * * 
Homo sapiens. 
Homo sapiens 



***aff inity*** 



***choline*** 



Euteleostomi ; Mammalia ; Eutheria ; Primates ; Catarrhini ; 
Hominidae ; Homo 
NUCLEIC ACID COUNT (NA) : 412 a 393 C 406 g 532 t 

COMMENT : 

OS Homo sapiens ( ***human*** ) 

PN WO 0116315-A/3 
PD 08-MAR-2001 

PF 18-AUG-2000 WO 2000JP005545 

PR 27-AUG-1999 JP 99P 240642 , 27-DEC-1999 JP 99P 368991 PI 
TATSUYA HAGA,TAKASHI OKUDA 

PC C12N15/12 , C07K14/47 , C12Q1/68 , C07K19/00 , C07K16/18 , C12N5/10 , PC 
A61K38/17, 

PC A61K45/00,A61P25/28,G01N33/53,A01K67/027 
CC 



FH Key 
FT CDS 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) : 



Location/Qualifiers 

(1) . . (1743) . 
1 (bases 1 to 1743) 
Haga , T . ; Okuda , T . 

***High*** - ***af f inity*** 

* * * transporter * * * 
Patent: WO 0116315-A 3 08-MAR-2001; JAPAN SCIENCE AND 
TECHNOLOGY CORP, TATSUYA HAGA, TAKASHI OKUDA 



***choline*** 



FEATURES (FEAT) : 
Feature Key 

source 1 , 



SEQUENCE (SEQ) : 

1 atggctttcc 
61 gttggaatat 
121 gccatcatag 
181 acctgggtcg 
241 ggcctagctt 
301 ttctttgcaa 
361 atctatggaa 
421 tgggctgcag 
481 atgcacattt 
541 ctctattctg 
601 atcagcgtcc 
661 catgccaaat 
721 cttgatagtt 
781 gttctctctt 
841 tgcctggtga 
901 aaccagactg 
961 ccaattgttc 
1021 tctgctgctg 
1081 cggaacatct 
1141 atgcgaatca 
1201 actgtgtatg 
1261 ctgctttgtg 
1321 tctggcctct 
1381 ttctaccctg 
1441 acacttgcca 
1501 ctatttgaaa 
1561 cacagtgaag 
1621 gaacttgcac 
1681 gaggccttcc 
1741 tga 



Location 
1743 



atgtggaagg 
gggctgcctg 
ttggtggccg 
gaggagggta 
gggctcaggc 
aacctatgcg 
aacgcatggg 
caattttctc 
ctgtcatcat 
tggcctacac 
cctttgcatt 
accaaaagcc 
ttctgttgtt 
cttcctcagc 
tggccatccc 
catatgggct 
tgcagtatct 
ttatgtcatc 
accagctttc 
cagtgtttgt 
ggctctggta 
tactctttgt 
tcctgagaat 
gctattaccc 
tggttacatc 
gtggaacctt 
aaaacatgga 
ttgtgaagcc 
ttgatgttga 



Qualifier 

/ or gani sm= " Homo s ap i ens " 
/db-xref="taxon: 9606" 



actgatagct 
gagaaccaaa 
agatattggt 
tatcaatggc 
accaattgga 
ttcaaagggg 
cggactcctg 
tgctttggga 
ctctgcactc 
tgatgtcgtt 
gtcacatcct 
gtggctggga 
gatgctgggt 
cacctatgct 
agccatactc 
tccagatccc 
ctgccctgtg 
agcagattct 
cttcagacaa 
gtttggagca 
cctcagttct 
taagggaacc 
aactggaggg 
tgatgataat 
attcttaacc 
gccacctaaa 
taagacaatt 
acgacagagc 
ttccagtcca 



atcatcgtgt 
aacagtggca 
ttattggttg 
acagctgaag 
tattctctta 
tatgtgacca 
tttattcctg 
gccaccatca 
attgccactc 
cagctctttt 
gcagtcgcag 
actgttgact 
ggaatcccat 
caagtgctgt 
attggggcca 
aagactacag 
tatatttctt 
tccatcttgt 
aatgcttcgg 
tctgcaacag 
gaccttgttt 
aacacctatg 
gagccatatc 
ggtatatata 
aacatttgca 
ttagatgtat 
cttgtcaaaa 
atgaccctca 
gaagggtctg 



tctaccttct 
gcgcagaaga 
gtggatttac 
cagtttatgt 
gtctgatttt 
tgttagaccc 
cactgatggg 
gcgtgatcat 
tgtacacact 
gcatttttgt 
acatcgggtt 
catctgaagt 
ggcaagcata 
ccttcctggc 
ttggagcatc 
aagaggcaga 
tctttggtct 
cagcaagttc 
acaaagaaat 
ccatggcctt 
acatcgttat 
gggccgtggc 
tgtatcttca 
atcagaaatt 
tctcctatct 
ttgatgctgt 
atgaaaatat 
gctcaacttt 
ggactgaaga 



aattttgctg 
gcgcagcgaa 
catgacagct 
accaggttat 
aggtggcctg 
gtttcagcaa 
agaaatgttc 
cgatgtggat 
ggtgggaggg 
agggctgtgg 
cactgctgtg 
ctactcttgg 
ctttcagagg 
agctttcggg 
aacagactgg 
catgatttta 
tggtgcagtt 
catgtttgca 
cgtttgggtt 
gctgacgaaa 
cttcccccag 
aggttatgtt 
gcccttgatc 
tccatttaaa 
agccaagtat 
tgttgcaaga 
taaattagat 
caccaataaa 
taatttacag 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE : 
ORGANISM 



(ORGN) 



GENBANK. RTM. COPYRIGHT 2004 on STN 

BD005267 GenBank (R) 

BD005267 

BD005267.1 GI:18633228 

392945-96-9 

1743 

DNA; linear 

Patent 

31 Jan 2002 

***High*** - ***affinity 

* * * transporter * * * 
Homo sapiens . 
Homo sapiens 



** * 



***choline*** 



Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae ; Homo 
NUCLEIC ACID COUNT (NA) : 412 a 393 C 406 g 532 t 
COMMENT : 

OS Homo sapiens { ***human*** ) 
PN JP 03074455-T/3 
PD 19-JAN-2001 

PF 18-AUG-2000 JP 2000005545 -.^or.r^n t.t 

PR 27-AUG-1999 JP 99P 240642, 27-DEC-1999 JP 99P 368991 PI 
TATSUYA HAGA,TAKASHI OKUDA , , ^..^/-.^ 

PC C12N15/i2 , C07K14/47 , C12Q1/68 , C07K19/00 , C07K16/18 , C12N5/10 , PC 
A61K38/17, 

PC A61K45/00,A61P25/28,G01N33/53,A01K67/027 
CC 

FH Key Location/Qualif iers 

FT CDS (1)..(1743). 
REFERENCE: 1 (bases 1 to 1743) 

AUTHOR (AU) : Haga,T.; Okuda^T. ^ ^. 

TITLE (TI) : ***High*** - ***af f inity* ** ***choline*** 

* * * transporter * * * 

JOURNAL (SO): Patent: JP 03074455-T 3 19-JAN-2001; JAPAN SCIENCE AND 

TECHNOLOGY CORP, TATSUYA HAGA,TAKASHI OKUDA 



FEATURES (FEAT) : 

Feature Key Location 

source 1. . 1743 



Qualifier 

/organism="Homo sapiens" 
/db-xref="taxon: 9606" 



SEQUENCE (SEQ) * 

1 atggctttcc atgtggaagg actgatagct atcatcgtgt tctaccttct aattttgctg 
61 qttqqaatat gggctgcctg gagaaccaaa aacagtggca gcgcagaaga gcgcagcgaa 
121 Iccatcatag ttggtggccg agatattggt ttattggttg gtggatttac catgacagct 
181 acctgggtcg gaggagggta tatcaatggc acagctgaag cagtttatgt accaggttat 
241 qqcctagctt gggctcaggc accaattgga tattctctta gtctgatttt aggtggcctg 
301 ttctttgcaa aacctatgcg ttcaaagggg tatgtgacca tgttagaccc gtttcagcaa 
361 atctatggaa aacgcatggg cggactcctg tttattcctg cactgatggg agaaatgttc 
421 tqqqctqcag caattttctc tgctttggga gccaccatca gcgtgatcat cgatgtggat 
481 atqcacattt ctgtcatcat ctctgeactc attgccactc tgtacacact ggtgggaggg 
541 ctctattctg tggcctacac tgatgtcgtt cagctctttt gcatttttgt agggctgtgg 
601 atcagcgtcc cctttgcatt gtcacatcct gcagtcgcag acatcgggtt cactgctgtg 
661 catgccaaat accaaaagcc gtggctggga actgttgact catctgaagt ctactcttgg 
721 cttgatagtt ttctgttgtt gatgctgggt ggaatcccat ggcaagcata ctttcagagg 
781 gttctctctt cttcctcagc cacctatgct caagtgctgt ccttcctggc agctttcggg 
841 tgcctggtga tggccatccc agccatactc attggggcca ttggagcatc aacagactgg 
901 aaccagactg catatgggct tccagatccc aagactacag aagaggcaga catgatttta 
961 ccaattgttc tgcagtatct ctgccctgtg tatatttctt tctttggtct tggtgcagtt 
1021 tctgctgctg ttatgtcatc agcagattct tccatcttgt cagcaagttc catgtttgca 
1081 cggaacatct accagctttc cttcagacaa aatgcttcgg acaaagaaat cgtttgggtt 
1141 atgcqaatca cagtgtttgt gtttggagca tctgcaacag ccatggcctt gctgacgaaa 
1201 actgtgtatg ggctctggta cctcagttct gaccttgttt acatcgttat cttcccccag 
1261 ctgctttgtg tactctttgt taagggaacc aacacctatg gggccgtggc aggttatgtt 
1321 tctggcctct tcctgagaat aactggaggg gagccatatc tgtatcttca gcccttgatc 
1381 ttctaccctg gctattaccc tgatgataat ggtatatata atcagaaatt tccatttaaa 
1441 acacttgcca tggttacatc attcttaacc aacatttgca tctcctatct agccaagtat 
1501 ctatttgaaa gtggaacctt gccacctaaa ttagatgtat ttgatgctgt tgttgcaaga 
1561 cacagtgaag aaaacatgga taagacaatt cttgtcaaaa atgaaaatat taaattagat 
1621 gaacttgcac ttgtgaagcc acgacagagc atgaccctca gctcaacttt caccaataaa 
1681 gaggccttcc ttgatgttga ttccagtcca gaagggtctg ggactgaaga taatttacag 
1741 tga 



L3 ANSWER 92 OF 111 

LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE : 



GENBANK. RTM. COPYRIGHT 2004 on STN 

HSA308377 GenBank (R) 

AJ308377 

AJ308377.1 01:18375495 
388546-92-7 

610 

DNA; linear 
Primates 
23 Jan 2002 

Homo sapiens partial CHTl gene for 
***af f inity*** ***choline*** 

, exon 1 and joined mRNA. 
***human*** 



***high*** 

* * * transporter * * • 



NUCLEIC ACID COUNT 

REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



FEATURES (FEAT) : 
Feature Key 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 

Hominidae; Homo 
(NA) : 102 a 200 c 197 g 111 t 

1 (sites) 

Wieland,A.; Bonisch,H.; Bruss,M. 

Molecular cloning of the ***human*** and murine 
high affinity choline transporters and 
characterizationof the ***human*** gene structure 
Unpublished 

2 (bases 1 to 610) 
Bruess^M. 

Direct Submission 

Submitted (23- JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D-53113 
Bonn, GERMANY 



Location 



source 

exon 
mRNA 



gene 



intron 



1. .610 



<1. .196 

join(<l . . 196, 
AJ308378 . 1: 183 . .411, 
AJ308379. 1:879. .992, 
AJ308379-1:1745. .1900, 
AJ308380. 1:579. .727, 
AJ308381.1:35. .178, 
AJ308382. 1:663. .816, 
AJ3 083 83 . 1: 88 7. . 1104, 
AJ3 08384 .1:792. .>223 9) 

joind. .610, 
AJ308378, 1:183. .411, 
AJ308379*1:879. .1900, 
AJ308380.1:579. .727, 
AJ308381.1:35. .178, 
AJ308382. 1:663. .816, 
AJ308383. 1:887. .1104, 
AJ308384. 1:792. .2239) 
197. .>610 



Qualifier 

/organism="Homo sapiens" 

/db-xref = " taxon : 9606 " 

/ chromosome= " 2 " 

/map="2qll-13" 

/gene="CHTl" 

/number=l 

/gene="CHTl" 



/label=CHTl-mRNA 
/gene="CHTl" 



/gene="CHTl" 
/number=l 



SEQUENCE (SEQ) : 

1 cacctgtcaa 
61 ccgcccggcc 
121 tcgcacccac 
181 gcagccacca 
241 ggaaacgcct 
3 01 ctctcggtgg 
361 ggctacaaag 
421 cgggagagca 
481 agaagaaaca 
541 ttcccccaac 
601 tgtgtgtgag 
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ctctgcgcgc 
ccgctccgca 
acccctcgcg 
ctccaggtag 
tgggcaccga 
gatgcgaggg 
tctgggcgca 
tcgggctgct 
gcccgcgggc 
gtctctttac 



tcccaggttc 
ctgcaccccg 
gttccagggg 
gagcggacgg 
gcccgggcgg 
cgggtttgct 
ggaaatgggt 
gcggagagga 
gcttcttttg 
ccacctgcac 



ttggagacgc 
acccaccccg 
gcggcggccc 
tcattcctgc 
ttttagaggg 
gtctggcacc 

agagggggcc 

acttgcctgc 
cctcctttaa 
gcacaaatgt 



cgagtgagga 
caccctccct 
cgggcggagc 
gcctacgggc 
gagggacgcg 
gttgctgtca 
gaggaagggc 
ttggttgctc 
attccggaga 
gagagtttcc 



gccgccctga 
ttctgcaccc 
gctttcgcgt 
ccgcaccctc 
gcgtttttcc 
gggtgcaaga 
cgcagggggc 
acaaaggtag 
ggctgattct 
cccagaaact 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE : 
ORGANISM (ORGN) : 



HSA3 08378 GenBank (R) 

AJ308378 

AJ308378. 1 01:18375493 

388546-91-6 

1178 

DNA; linear 
Primates 
23 Jan 2002 

Homo sapiens partial CHTl gene for 
* * * aft inity * * * * * * choline* * * 

, exon 2 and joined CDS. 
***human*** 

Homo sapiens 



***high*** 

* * * transporter * * * 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) 



Euteleostotni; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 

: 329 a 210 C 262 g 377 t 

1 (sites) 

Wieland,A.; Bonisch,H.; Bruss,M, 

Molecular cloning of the ***human*** and murine 
high affinity choline transporters and 
characterizationof the ***human*** gene structure 
Unpublished 

2 (bases 1 to 1178) 
Bruess,M. 

Direct Submission 

Submitted (23 - JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D-53113 
Bonn, GERMANY 



FEATURES (FEAT) : 

Feature Key Location 



Qualifier 



source 



gene 



intron 
axon 

CDS 



intron 



1. .1178 



joind. .1178, 
AJ308379. 1:879. .1900, 
AJ308380. 1:579. .727, 
1:35 



AJ308381 
AJ308382 . 
AJ308383 , 
AJ308384, 
<1. .182 

183. .411 



663 . 
;887, 
792, 



178, 
.816, 
.1104, 
.1421) 



join(234. .411, 
AJ308379. 1:879. .992, 
AJ3 083 79. 1:1745. .1900, 
AJ308380. 1:579. .727, 
AJ308381.1:35. .178, 
AJ308382. 1:663 . .816, 
AJ308383. 1:887. .1104, 
AJ308384. 1:792. .1421) 



/organism="Homo sapiens" 
/ db - xr e f = " t axon : 9 6 0 6 " 
/chromosome= " 2 " 
/map="2qll-13" 
/gene=" CHTl" 



/gene="CHTl" 

/number =1 
/gene=" CHTl" 
/number=2 

/usedin=AJ3 08377: CHTl -mRNA 
/gene="CHTl " 



412. .>1178 



/function=" re -uptake of choline" 

/codon-start=l 

/label=CHTl-CDS 

/product = "high affinity choline 
transporter" 

/protein- id= " CAC8 8 115 . 1 " 

/db-xref="GI: 18375494" 

/ 1 rans lat ion= " MAFHVEGLI AI I VFYLLILL 

VGIWAAWRTKNSGSAEERSEAI IV 

GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVP 

GYGLAWAQAPIGYSLSLILGGLFF 

AKPMRSKGYVTMLDPFQQI YGKRMGGLLF I PALM 

GEMFWAAAI FS ALGAT I S VI I DVD 

MHISVI ISALIATLYTLVGGLYSVAYTDWQLFC 

IFVGLWISVPFALSHPAVADIGFT 

AVHAKYQKPWLGTVDSSEVYSWLDSFLLLMLGGI 

PWQAYFQRVLSSSSATYAQVLSFL 

AAFGCLVMAIPAILIGAIGASTDWNQTAYGLPDP 

KTTEEADMILPIVLQYLCPVYISF 

FGLGAVSAAVMSS7VDSSILSASSMFARNIYQLSF 

RQNASDKEIVWVMRITVFVFGASA 

TAMALLTKTVYGLWYLSSDLVYIVIFPQLLCVLF 

VKGTNTYGAVAGYVSGLFLRITGG 

EPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAM 

VTSFLTNICISYLAKYLFESGTLP 

PKLDVFDAWARHSEENMDKTILVKNENIKLDEL 

ALVKPRQSMTLSSTFTNKEAFLDV 

DSSPEGSGTEDNLQ" 

/gene="CHTl" 

/number=2 



SEQUENCE (SEQ) : 



61 caagggttgt 
121 ctggctgcct 
181 agaagactta 
241 tccatgtgga 
301 tatgggctgc 
361 tagttggtgg 
421 ccgccggctc 
481 agagtgacaa 
541 atatatttta 
601 gtagtacttg 
661 atcttggctc 
721 atgggaagct 
781 ctcaacgtgt 
841 tgaataccat 
901 ccaattgagt 
961 aggaagacaa 
1021 tataaaaggg 
1081 atctctcatt 
1141 taaagtggtt 



actctctctg 
acatgtgcat 
atgaagtagc 
aggactgata 
ctggagaacc 
ccgagatatt 
catgcagtcc 
agcaaatgtt 
aaataatttt 
aacaaaaatg 
cctgggctat 
gtcctagtat 
ccggtaatac 
ctagtgatgg 
gattccctgt 
tcacatgtta 
aaaagctagg 
caaaccttct 
ctgtttaaag 



tgagtggcac 
agtgcttgac 
cagctgcaga 
gctatcatcg 
aaaaacagtg 
ggtttattgg 
tcccttcctt 
ggtctttgtc 
ctattgattt 
attttataat 
tgagtgggtg 
tacaagctat 
atgcataaag 
tttttaatct 
tttgaaacct 
tgagtccaca 
gtgtgtcctc 
ttgtgttagc 
ggtgatttgt 



aggatctcgc 
tggtttataa 
agaatctgga 
tgttctacct 
gcagcgcaga 
ttggtggatt 
ggcatctgtg 
ctttggggat 
aaaacctaaa 
aaatcactgg 
gaaacagagt 
gtatctgcta 
ttgtgacttt 
gcataaataa 
aattctttca 
gcactaataa 
ataggaatcc 
aggagctgat 
ttcttttt 



atgagccagc 
tgcatccctg 
tcattagata 
tctaattttg 
agagcgcagc 
taccatgaca 
agtgtgcagc 
tctttatgtt 
aataaattgg 
actgaacagt 
aaaaacaaag 
attaatgtga 
gagatttaaa 
atttgggggt 
tttaataact 
3.gagatttca 
ccaggccacc 
attcttttct 



tggtttattc 
tattttcttc 
aaaatggctt 
ctggttggaa 
gaagccatca 
ggttcagacg 
gtgtggcttc 
cactagttat 
tttcaactca 
cacgaaacaa 
aggtggggga 
taggcagcat 
taagtgccaa 
gatgatattc 
aattcctcac 
tgaaccttgt 
atctgcagtc 
tactttgcac 
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LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE : 
ORGANISM (ORGN) 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



GI:18369775 



***high*** 

* * * transport er* * ■ 



HSA308384 
AJ308384 
AJ308384.1 
387812-77-3 
2239 

DNA; linear 
Primates 
23 Jan 2002 

Homo sapiens partial CHTl gene for 

***af tinity*** ***choline*** 
, exon 9 . 

***human*** 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

: 664 a 442 C 440 g 693 t 

1 (sites) 

Wieland.A.; Bonisch,H.; Bruss,M. 
Molecular cloning of the ***human*** 

high affinity choline transporters and 
characterizationof the ***human*** 
Unpublished 

2 (bases 1 to 2239) 
Bruess , M. 

Direct Submission 

Submitted (23- JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D-53113 
Bonn, GERMANY 



and murine 
gene structure 



FEATURES (FEAT) 
Feature Key 



Location 
"2239 



Qualifier 



source 



intron 

gene 
exon 



<1. .791 

1. .2239 
792. .>2239 



/organism= " Homo sapiens " 

/db-xref ="taxon: 9606" 

/chromosome= " 2 " 

/map="2qll-13" 

/gene="CHTl" 

/number =8 

/gene=" CHTl" 

/gene="CHTl" 

/number =9 

/usedin-AJ3 083 78: CHTl - CDS 
/usedin=AJ3 08377: CHTl -mRNA 



SEQUENCE (SEQ) : 

1 ctaagtgcct tccttgcacc accccccacg ccccccaaca ggccccagtg tgtgtgctcc 

61 ctccctgtat ccatgagttc tcattgtcca actcccactt atgagtgaga ccatgtggtg 

121 tttggttttc tgtccctgtg ctagtttgct gaggatgatg gcttccacct tcatccatgt 

181 ctctgcaaag gaatgatctt attccttttt atggccacat agtattccat ggtatacatg 

241 tgccacattt tctttatcca gtctatcact gatgggcatt tgcgttggtt ccatgacttt 

301 gttattgtaa atagtgctgc aataaacgta tgtgtgcatg tgtcttttca agagcaccag 

361 aaggtggtta gtttaaggat ggtagcactt ctacaaagag ataaacatag ggatggccta 



481 tgctttatta 
541 aggaaaaggc 
601 gtccacccag 
661 cccatagatt 
721 tatgtctcat 
781 atttcttaca 
841 ttggagcatc 
901 tcagttctga 
961 agggaaccaa 
1021 ctggagggga 
1081 atgataatgg 
1141 tcttaaccaa 
1201 cacctaaatt 
1261 agacaattct 
1321 gacagagcat 
13 81 ccagtccaga 
1441 tgcttttgca 
1501 aaaatatatt 
1561 gcacaaatac 
1621 ccatagtagt 
1681 tccacttaga 
1741 gatgtgaaaa 
1801 atgcactgat 
1861 ccttgaatgg 
1921 tttcatgttc 
1981 catgtaatat 
2041 ttatgtgcaa 
2101 gtatgtttcc 
2161 ttaaagcaat 
2221 tctcaagtat 



tacctccctg 
tcaggtggcc 
gaagtcacct 
gagcacattt 
tctttgccta 
ggcttcggac 
tgcaacagcc 
ccttgtttac 
cacctatggg 
gccatatctg 
tatatataat 
catttgcatc 
agatgtattt 
tgtcaaaaat 
gaccctcagc 
agggtctggg 
aacagaacac 
aaaaatataa 
aagccaagct 
attgattttg 
gaacaaaggg 
agcctaagaa 
gagtctagtt 
tgcaatgaat 
tgagctataa 
tctgagcacg 
ggtgatagga 
atagctacta 
aagttcatta 
ataaagttt 



acaccatgtt 
tgtgtgtcat 
tgattgtttg 
gaaccaggag 
aatgagccaa 
aaagaaatcg 
atggccttgc 
atcgttatct 
gccgtggcag 
tatcttcagc 
cagaaatttc 
tcctatctag 
gatgctgttg 
gaaaatatta 
tcaactttca 
actgaagata 
tgtaataggg 
acaatgttca 
agaaggaagc 
atgctagata 
ccaaatagag 
aaaggaaatt 
tcattatagc 
taaccagctg 
catttgctga 
ggcaaagaaa 
aaactgaatc 
tatgcataaa 
aacagaaggt 



agctcatcct 
acatctggta 
ctttgggtgc 
aattctttag 
gacactgtgc 
tttgggttat 
tgacgaaaac 
tcccccagct 
gttatgtttc 
ccttgatctt 
catttaaaac 
ccaagtatct 
ttgcaagaca 
aattagatga 
ccaataaaga 
atttacagtg 
tagttctgga 
ggagagtaaa 
acctatgaaa 
gttttgctag 
tttttatatt 
ggacagtttt 
acgaagctat 
atttttctta 
atatgcaatt 
acacacaaaa 
catctttgta 
caacagtacc 
aataggaaga 



gcctttgccc 
cacagaaacc 
tgtagctgta 
accagttttg 
aaaaagctga 
gcgaatcaca 
tgtgtatggg 
gctttgtgta 
tggcctcttc 
ctaccctggc 
acttgccatg 
atttgaaagt 
cagtgaagaa 
acttgcactt 
ggccttcctt 
accccatcta 
gagatggtat 
aattcatata 
gcaacaactt 
gtataaaaaa 
tgttatgata 
gatacaaact 
gagaataact 
gtgtgatgat 
tgttattctt 
aattatgtat 
gaagagcact 
tgaaggatta 
acagtacatt 



ctgtgcattc 
cctcccaaag 
gttggttttt 
aagcaaaaaa 
cactgtggca 
gtgtttgtgt 
ctctggtacc 
ctctttgtta 
ctgagaataa 
tattaccctg 
gttacatcat 
ggaaccttgc 
aacatggata 
gtgaagccac 
gatgttgatt 
aataaaatac 
gcagcataca 
aagtgcaatt 
tgtttctcat 
taagtaaagt 
aaaggaagta 
ttgtttgcta 
tcagtcactc 
taaccccttc 
ttattaatgg 
tggcatttat 
gggctaattt 
ttaagcaacc 
tttgtcttta 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE • 
ORGANISM (ORGN) 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



GenBank (R) 



01:18369774 



***high*** 

* * * transporter* * • 



HSA308383 
AJ308383 
AJ308383 .1 
387812-76-2 
1657 

DNA; linear 
Primates 
23 Jan 2002 

Homo sapiens partial CHTl gene for 

* * *af f inity* * * * * *choline* * * 

, exon 8 . 

***human*** 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 

: 529 a 304 c 287 g 537 t 

1 (sites) 

Wieland,A.; Bonisch,H.; Bruss,M. 
Molecular cloning of the ***human*** 
high affinity choline transporters and 
characterizationof the ***human*** 
Unpublished 

2 (bases 1 to 1657) 
Bruess^M. 

Direct Siibmission 

Submitted (23- JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D- 53 113 
Bonn, GERMANY 



and murine 
gene structure 



FEATURES (FEAT) 
Feature Key 



source 



gene 
intron 



exon 



Location 
1. .1657 



1. .1657 
<1. .886 

887. .1104 



Qualifier 



/organism="Homo sapiens" 

/db-xref="taxon: 9606" 

/chromosome= " 2 " 

/map="2qll-13" 

/gene="CHTl" 

/gene="CHTl" 

/number=7 

/gene="CHTl" 

/number=:8 



intron 



1105. ,>1657 



/usedin=AJ3 08377 : CHTl-mRNA 
/gene=" CHTl" 
/number =8 



SEQUENCE (SEQ) : 

1 tagatacaca 
61 tgcatgtaag 
121 atccaaagag 
181 taaaaagccg 
241 aactaaccag 
3 01 ctacccacaa 
361 aaaagggact 
421 tgctgaggca 
481 actattgtct 
541 tcccaccaca 
601 tttctatgat 
661 ccagtgtctg 
721 agagatataa 
781 gattctgagt 
841 atttggttcc 
901 gcatatgggc 
961 ctgcagtatc 
1021 gttatgtcat 
1081 taccagcttt 
1141 cattgctctt 
1201 aatattctat 
1261 ctatgctgaa 
1321 atcaatataa 
13 81 ttctgatgaa 
1441 ctctatgaca 
1501 tttgtaactt 
1561 cctctcaatt 
1621 ttgattaaag 



ttcttgtttg 
caggctttgt 
aacaattcct 
actatgctaa 
taaagaagct 
ccaaaacttc 
gttcttaacc 
gcttcaaaac 
ttcttcaatt 
tttccaagtg 
tctaggcaat 
tgcaggcatt 
gacatttcca 
ttatttcaaa 
ttggtggtta 
ttccagatcc 
tctgccctgt 
cagcagattc 
ccttcagaca 
gcatgcttct 
gttaaatctg 
tggatgctat 
ttatttccca 
agagtaatga 
ctggaaaatc 
ctgaactgtt 
tgattttctt 
cttttcctct 



gtatttctgt 
gacaattgta 
atttggtcca 
tgaacattta 
atgtattcaa 
ctgaaaattt 
tagtgaaatt 
caagtttatg 
acacatgaat 
gttgaaggaa 
gtaacaccat 
gtgaataaca 
cttgaccagc 
acaacctagt 
taatggttgt 
caagactaca 
gtatatttct 
ttccatcttg 
aaatgtaaga 
gatgttgtat 
actgtacttt 
cagtacctgt 
agaggtacag 
aacctaaact 
ttggtggaag 
ttatacttgc 
tattgaatca 
ctctaggttt 



atcataaatc 
atagaaaatg 
atgatcatga 
aggattccat 
agataaatct 
cagccacatg 
ctatgcaatg 
tacagagaga 
aactgagcct 
tactctagaa 
cattgcagaa 
ctaactataa 
acatggactt 
aaataggatg 
tgattctgtt 
gaagaggcag 
ttctttggtc 
tcagcaagtt 
acagtttctt 
ttgttgtata 
aagcatacga 
tcttattcat 
tcacagggct 
attcagcaaa 
cagggcatgg 
actccaaagt 
aagtggctgc 
tagttac 



tcagtaaggc 
agggaagaaa 
tattaaaaaa 
agttcttcaa 
gtgtttcaag 
actgtaattt 
tggaaatagg 
tgatggctga 
taaagaagaa 
ttctgtttat 
aggaaatata 
ccaatacctt 
cacaccagac 
accccagatg 
caacagactg 
acatgatttt 
ttggtgcagt 
ccatgtttgc 
tcaacctgat 
tacagtatta 
gattaaataa 
gtcaatacta 
atcatttaac 
acttttagta 
taagttgttc 
ccacctttgc 
aaaacagcat 



cacaaagtgc 
aaatggtagg 
gttcagcaaa 
cacatgctac 
tcttcttacc 
ttattcaata 
agaaaagtaa 
gaatagccct 
acagtgagtg 
gagagtttgt 
tatcttacca 
ttagaagcca 
taatgtatat 
gataaagaac 
gaaccagact 
accaattgtt 
ttctgctgct 
acggaacatc 
catttactag 
tatatttatt 
caaattatac 
aagggaacaa 
ctttttgtct 
taattaaatt 
gacacgtatt 
aggaagatat 
gcttctttgg 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE: 
ORGANISM (ORGN) 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



GI:18369773 



***high*** 

* * * transporter* * • 



HSA308382 
AJ308382 
AJ308382.1 
387812-75-1 
1467 

DNA; linear 
Primates 
23 Jan 2002 

Homo sapiens partial CHTl gene for 

* * *af f inity* * * * * *chpline* * * 

, exon 7 . 

***human*** 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae; Homo 

: 398 a 326 C 284 g 459 t 

1 (sites) 

Wieland,A.; Bonisch,H.; Bruss,M. 
Molecular cloning of the ***human*** 
high affinity choline transporters and 
characterizationof the ***human*** 
Unpublished 

2 (bases 1 to 1467) 
Bruess , M. 

Direct Submission 

Submitted (23- JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D- 53 113 
Bonn, GERMANY 



and murine 
gene structure 



FEATURES (FEAT) : 
Feature Key 



source 



gene 
intron 



Location 
.1467 



Qualifier 



1. .1467 
<1. .662 



/organism="Homo sapiens" 
/ db - xre f = " t axon : 9 6 0 6 " 
/chromosome="2 " 
/map="2qll~13" 
/gene=" CHTl" 
/gene="CHTl" 



exon 



intron 



663. .816 



817. .>1467 



/gene="CHTl" 
/number =7 

/usedin=A J3 08378: CHTl - CDS 
/usedin=A J3 08377: CHTl -mRNA 
/gene="CHTl" 
/number=7 



SEQUENCE (SEQ) : 

1 tgagaatatt 
61 agtttaaata 
121 atagagaaaa 
181 agttgggtaa 
241 cagtttttca 
3 01 aggattcgca 
361 aaactccttt 
421 ttgccaactt 
481 gggggaacat 
541 catgccacat 
601 aataaaactc 
661 agatgctggg 
721 ccacctatgc 
781 cagccatact 
841 atgtgccagt 
901 ctccacatag 
961 taaatcgtat 
1021 ggggtttcat 
1081 ctcggtctcc 
1141 tttgattgct 
1201 atttatccat 
1261 acatcatctg 
1321 aggaatttta 
13 81 aggtgagttg 
1441 atgacctccc 



ttatgatggt 
cagttcaaaa 
cttcaggaaa 
caaagcatca 
catggctggg 
tcctcataga 
gactctcttt 
gatacctaat 
aaacatcaga 
tttttttact 
tttaaaaaaa 
tggaatccca 
tcaagtgctg 
cattggggcc 
tagtttacca 
tgaattcttt 
tatagttcag 
catgttggtc 
caaagtgctg 
cactccacta 
tgtttatatg 
gggaatggtg 
ttttaataac 
agagccagag 
taaagatcac 



tagacccacc 
taggtaaatt 
ttctgcataa 
taaacggagt 
aagtccaaga 
aggcaccttc 
tataaggact 
accatcacct 
tcatagcaaa 
acttctaagt 
tgaatagatg 
tggcaagcat 
tccttcctgg 
attggagcat 
atccccaccc 
ctcaccacat 
cggcctccat 
aggctggtct 
gaattatagg 
tgaaacattg 
cactcctgtc 
gggagatctt 
tgaagataga 
ttgagagtca 
tttgaac 



agatgataac 
aatgaaaaga 
ttgaaagttg 
ggcttataaa 
tcaaggtgcc 
ttgctgcctc 
ccaccctcat 
tggggatcaa 
cataatgata 
tgtacttagg 
tttgcctctc 
actttcagag 
cagctttcgg 
caacaggtaa 
agacaccctt 
tcatatctat 
tttttttttt 
caaactcctg 
tgtgagctac 
gtgagcacac 
agcagtggat 
ttaaaaatac 
gatgggcact 
caggcttaga 



aaattgaagg 
aataattgct 
atgatacatt 
caacacagat 
ggcagattca 
ctcacatggt 
gggctaatca 
aatttcagtg 
ctaattgata 
cctattctaa 
catccttgtg 
ggttctctct 
gtgcctggtg 
atctcttgca 
ctgtcccact 
agatttgtta 
tttttttttt 
acctcaggtg 
tgcgcccagc 
cctcctccca 
ctcaaccaag 
ggatatgtgg 
agtgttcttt 
gacgtttacc 



cctctggggt 
gtatgtaatt 
gtcttagttc 
ttatttttca 
Gtgcctggtg 
ggaaggaaaa 
tctcccaaag 
tatgaattta 
tttgtggcta 
atgtgattgc 
tttcccgcac 
tcttcctcag 
atggccatcc 
gcttcaccac 
cccctctttc 
tagcattcag 
tagtagagac 
atccatctgc 
ctccaatttt 
gttttggtat 
gtgcatgctg 
cctacttcac 
taaaacccct 
tgtcatcctc 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE: 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) 



GenBank (R) 



GI:18369772 



***high*** 

* * * transporter* * ■ 



HSA308381 
AJ308381 
AJ308381.1 
387812-74-0 
736 

DNA; linear 
Primates 
23 Jan 2002 

Homo sapiens partial CHTl gene for 

***af t inity*** ***choline*** 
, exon 6 . 

***human*** 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia ; Eutheria; Primates ; Catarrhini ; 
Hominidae; Homo 

: 191 a 148 C 197 g 200 t 

1 (sites) 

Wieland,A.; Bonisch,H.; Bruss,M. 
Molecular cloning of the ***human*** 
high affinity choline transporters and 
character izationof the ***human*** 
Unpublished 

2 (bases 1 to 736) 
Bruess,M. 

Direct Submission 

Submitted (23 - JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D-53113 
Bonn, GERMANY 



and murine 
gene structure 



FEATURES (FEAT) 
Feature Key 



Location 
1. .736 



Qualifier 



source 



gene 
intron 



1. .736 
<1. .34 



/organism="Homo sapiens" 
/db-xref = " taxon : 96 06 " 
/ chromosome= " 2 " 
/map="2qll-13" 
/gene="CHTl" 
/gene="CHTl" 



exon 



intron 



35. .178 



179. .>736 



/gene="CHTl" 
/number=6 

/usedin= AJ3 08378: CHTl - CDS 
/usedin=A J3 08377: CHTl -mRNA 
/gene="CHTl" 
/number =6 



SEQUENCE (SEQ) : 

1 tcgcctaggg 
61 acatcctgca 
121 gctgggaact 
181 aagtaatgct 
241 cgattttcat 
301 ggactctcta 

3 61 ggagaacaaa 
421 tccgttgtga 

4 81 aggcaaaacc 
541 tgaaataagt 
601 ccctaagtga 
661 cctcaggagc 
721 ttgcaggagg 



ctccagtgtc 
gtcgcagaca 
gttgactcat 
cttacctgaa 
attcatagta 
tcgggagggt 
tgaggccctg 
tgtaagtcaa 
ttgtaaaggc 
ctctcattca 
ggcaggttgg 
cagaggaagc 
acatgg 



actttctgtt 
tcgggttcac 
ctgaagtcta 
gaatgtgatt 
aaaaaatgtg 
ggagccaggg 
aggggacagc 
ggaataacat 
tttattttgt 
ctcttgaatt 
aaagggacat 
ttgaaaacag 



gcagtggatc 
tgctgtgcat 
ctcttggctt 
taattgttcc 
cttgtgggct 
ccggactatc 
ataaggtgtt 
gccctacccc 
gttttgaatt 
acagagtaag 
gtcactcagg 
gcagaacttt 



agcgtcccct 
gccaaatacc 
gatagttttc 
tgaaatcaaa 
catggccatt 
gtggctggca 
ctgggtggtg 
caggactttc 
aaggcatgaa 
tgaattagta 
aaagatgcca 
gggcaaaagg 



ttgcattgtc 
aaaagccgtg 
tgttgttggt 
tttgttttca 
tctgaattga 
gtgtcagggt 
ggtcctcatt 
cccaacctga 
acccaactta 
gtgctagagg 
agagagaggc 
tgcaggcact 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE : 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 
GenBank (R) 



GI:18369771 



***high*** 

* * * transporter * * 



HSA308380 
AJ308380 
AJ308380.1 
387812-73-9 
1308 

DNA; linear 
Primates 

23 Jan 2002 

Homo sapiens partial CHTl gene for 

* * * af f ini ty * * * * * * chol ine * * * 

, exon 5 . 

***human*** 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 

: 415 a 234 C 224 g 435 t 

1 (sites) 

Wieland,A.; Bonisch,H.; Bruss,M. 
Molecular cloning of the ***human*** 
high affinity choline transporters and 
characterizationof the ***human*** 
Unpublished 

2 (bases 1 to 1308) 
Bruess,M. 

Direct Submission 

Submitted (23- JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D- 53 113 
Bonn, GERMANY 



and murine 
gene structure 



FEATURES (FEAT) : 

Feature Key Location 



Qualifier 



source 



gene 
intron 



exon 



intron 



1308 



1. .1308 
<1. .578 

579. .727 



728 . . >1308 



/organism="Homo sapiens" 

/db-xref = " taxon : 9 6 0 6 " 

/chromosome= " 2 " 

/map="2qll-13" 

/gene=" CHTl" 

/gene="CHTl" 

/number=4 

/gene="CHTl" 

/number=5 

/usedin=AJ3 083 78: CHTl - CDS 
/usedin=AJ3 083 77: CHTl -mRNA 
/gene="CHTl" 
/number=5 



SEQUENCE (SEQ) : ^ ^ 

1 ttcaaatagt taccatcatg ttacacttgc ctacagtgtt tagtacggta acaatctgca 
61 cgaatttgta gcccaggagc aataggccat accatgtagc ctaggtgtgt agtagttaca 
121 ccatcaagat ttgtgagtga gtacatactg tgaagtgcac aacagtgaaa tcacctaaca 



241 tcagagttcc 
301 aattcagggt 
361 caaaccagta 
421 atttcaatat 
481 aaaaatgttg 
541 gggcaccttg 
601 cgatgtggat 
661 ggtgggaggg 
721 agggctggta 
781 ttttatacta 
841 ctaagtcaca 
901 attttgataa 
961 cttgctttcc 
1021 aaggcaatcc 
1081 aagtctccct 
1141 ccattggaaa 
1201 aaaattctta 
1261 atgagacatg 



agatagtgaa 
cttctataac 
atttcagaaa 
aaaaattaac 
tttatcactg 
accacagaac 
atgcacattt 
ctctattctg 
agtgggagca 
actcattaaa 
tacatgaaat 
aatttgacag 
ttaatgccta 
atttccctcg 
ttacgttagt 
actgtggatt 
attatataag 
agtatttatt 



attattttga 
cataccatgg 
gaaaaagtat 
tttaggtgtt 
aactttcatt 
tctcttgttt 
ctgtcatcat 
tggcctacac 
cccaagattc 
aatgagtaga 
ttcaaatttc 
aaaagtgtgc 
gatttgatgg 
gcagagataa 
tggtgaatcc 
tcaaatatta 
gcaccagaaa 
agtattatta 



gttcttgtaa 
atctgtgtgt 
gtatctcact 
ttcatccact 
atgtccttat 
gtttgcagga 
ctctgcactc 
tgatgtcgtt 
tcctcctttt 
aatattattt 
tccataaatt 
tctccattgg 
aaacttatgc 
caaaattcaa 
taagaatgct 
atcatcacta 
gatttattat 
gcactatatt 



tgtaattaaa 
cctccaaata 
gaatttatta 
gcatttcata 
acaaagagca 
gccaccatca 
attgccactc 
cagctctttt 
tttctgtaaa 
tccctcatga 
aattagatga 
actgttagaa 
atgatctcct 
accttagtgg 
cataagacat 
tttgtggtct 
ctattgtttt 
aagagcac 



tttctcaggt 
aataagttag 
agatatacaa 
tgacagagag 
gaatctgtct 
gcgtgatcat 
tgtacacact 
gcatttttgt 
aggtaatgta 
gtaatacata 
aaaactcgtc 
aggtctaggt 
tttagtattt 
taacttgtac 
agaaatgatc 
aaaaacaaag 
tattataatt 
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LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



SOURCE : 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 
GenBank (R) 



01:18369728 



***high*** 

* * * transporter * * ' 



HSA308379 
AJ308379 
AJ308379.1 
387812-40-0 
2326 

DNA; linear 
Primates 
23 Jan 2002 

Homo sapiens partial CHTl gene for 

***af f inity*** ***choline*** 
, exons 3-4. 

***human*** 

Homo sapiens ^ ^ 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; ^ ^ 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 
: 785 a 405 c 397 g 

1 (sites) 

Wieland,A.; Bonisch^H.; 
Molecular cloning of the 
hiqh affinity choline transporters and 
characterizationof the ***human*** gene structure 
Unpublished 

2 (bases 1 to 2326) 
Bruess,M. 

Direct Submission 

Submitted (23- JAN-2001) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Reuterstr. 2b, D- 53 113 
Bonn, GERMANY 



739 t 

Bruss,M. 

***human*** 



and murine 



FEATURES (FEAT) : 

Feature Key Location 



Qualifier 

/organism="Homo sapiens" 

/db-xref ="taxon: 9606" 

/chromosome= " 2 " 

/map="2qll-13" 

/gene="CHTl" 

/number=2 

/gene="CHTl" 

/gene="CHTl" 

/number=3 

/usedin=AJ3 083 78: CHTl - CDS 

/usedin=AJ3 083 77: CHTl -mRNA 

/gene="CHTl" 

/number=3 

/gene="CHTl" 

/number=4 

/usedin=AJ308378 : CHTl-CDS 
/usedin=AJ308377 :CHTl-mRNA 
/gene="CHTl" 
/number=4 



source 



intron 

gene 
exon 



intron 
exon 

intron 



1. .2326 



<1. .878 

1. .2326 
879. .992 



993. .1744 
1745. . 1900 

1901. .>2326 



SEQUENCE (SEQ) : 



61 atagtaaata 
121 ttcacatttt 
181 tcactttgga 
241 taaggtagta 
3 01 agagtgttca 

3 61 tacatagata 
421 ataaaaacat 

4 81 acctcagact 
541 acatctaggg 
601 tttctacaaa 
661 tgtaactcaa 
721 tttgaaatct 
781 caggtgctca 
841 aagacagtat 
901 tatcaatggc 
961 accaattgga 

1021 cactcagtaa 
1081 tctgaataac 
1141 tggtaagtag 
1201 aaatcctctg 
1261 tctttcaaaa 
1321 ttcttttagc 
1381 tttttggggc 
1441 atgttgagag 
1501, tcaattcaca 
1561 ctaaataaat 
1621 gacttgttcc 
1681 catagtaaaa 
1741 ttaggtggcc 
1801 ccgtttcagc 
1861 ggagaaatgt 
1921 ctagctgcat 
1981 ggcttctgag 
2041 ttaatggaaa 
2101 acactttcca 
2161 tgcttttatg 
2221 gtgcagttta 
2281 atactaatac 



tcttactaaa 
accttcttct 
caccttgtag 
agtacaactt 
agaaaagact 
catgtgtaca 
caatacaagt 
atattacata 
cagcactatc 
cctctaaaaa 
ttcattcaac 
ggtgtttatt 
gtaacttcta 
cactccctca 
acagctgaag 
tattctctta 
agtatacaga 
cattgtaaaa 
cagaacacta 
gacctgtggc 
caagtatgga 
aagtcaaatg 
actcaaaagg 
atagaaatgc 
tacctcaaca 
tatattagat 
tcaatccata 
gactattata 
tgttctttgc 
aaatctatgg 
tctgggctgc 
ctttgtagtt 
gacattttta 
ttctacgttt 
tacttgaatt 
agtcatatat 
cttaacataa 
atatttttgg 



agatccatat 
gagtgctagg 
tagaagtgtc 
^gtggtgatg 
ctattagtga 
cacatgcacg 
aaaatggaag 
taatatcatg 
caatagaaat 
caaaaagaaa 
atgttttcat 
ttatattcct 
gtagccactg 
cttttcattc 
cagtttatgt 
gtctgatttt 
acaagagtat 
aaatgtcccc 
atgcttaaat 
tgtcagcaca 
tatatcatta 
aataagaggt 
cttttattga 
attgaggcat 
tgacccactt 
tgctaaacct 
caggtgttct 
ttacattctt 
aaaacctatg 
aaaacgcatg 
agcaattttc 
aactaaacat 
tcaaaaccat 
acttgcaata 
gcctcattag 
gattttgaac 
atgtttaaaa 
tgttgaataa 



ataaatatag 
tagtctaggc 
atctcgatcc 
gagataagat 
aaagcacatt 
tacacatgca 
ggcctcgaaa 
aaaatgctgc 
ataatatgag 
taggtaaatt 
ttcaacatgt 
aaaacaactc 
gtttggcagg 
tgtttcagct 
accaggttat 
aggtaagtga 
aaataacaag 
agagatcttt 
tatttgatct 
gtgaatttct 
aaaatgaatc 
caaaagtatt 
atctgtgctt 
ttaagtgagc 
atcctttgga 
aatatttata 
ggaaaatgca 
tttattttct 
cgttcaaagg 
ggcggactcc 
tctgctttgg 
cagatacgca 
atgctgaacc 
tggtggcaga 
accaataact 
aaagaataat 
caaggattca 
atttaaataa 



aattgcaagt 
cttaaggtaa 

tctcttcact 
ttgcaagctc 
tctacaaaca 
caccactccg 
taaagaaaat 
tgactcaatg 
ccacaaacat 
aatgttaata 
gattaatgaa 
aatctaggct 
cagatgtagg 
acctgggtcg 
ggcctagctt 
aagtgcaaat 
tggagtaaca 
aaggagtcat 
cttggttggg 
taacataatc 
aacttgaaat 
atataacaga 
attcaaaacc 
aatgtgctgg 
cacatacaat 
gtaggttggt 
ttttccttga 
cctaaacagt 
ggtatgtgac 
tgtttattcc 
gtaaggacca 
accttacttt 
atatttatat 
ggttaaataa 
atgaattata 
gaattaagca 
tgaaccaggg 
atgatt 



atgtatgcag 
acatttaaac 

cctaataact 
tgatatcccc 
tacactcacc 
tgaggacagt 
ttaccccttg 
catttagaaa 
aatgttacat 
gtataactta 
agaaaaaaca 
cgtcacattt 
tataacaggt 
gaggagggta 
gggctcaggc 
ctcagtgact 
agtcaaacac 
taccagaaac 
aatatattta 
atgttttatt 
gtacctgtgt 
aggtacctca 
tctaaatgct 
ttcacgtttt 
ggtaagtgta 
actgtgtgca 
taaatggcag 
tgcttaattt 
catgttagac 
tgcactgatg 
gctaagttgt 
cctcagcctt 
gcataaaagc 
aacctcagct 
atgatcaatt 
tagctaaaaa 
gttagtgaga 
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LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE ' 
ORGANISM (ORGN) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



(R) 



AJ414146 GenBank 
AJ414146 AL590842 
AJ4 14146.1 GI: 15979072 
361328-59-8 
210050 
DNA; linear 
Bacteria 
21 Nov 2002 

Yersinia pest is strain C092 complete genome; segment 

Yersinia pestis C092 
Yersinia pestis C092 

Bacteria; Proteobacteria; Gammaproteobacteria; 
Enterobacteriaceae ; Yersinia 
53724 a 46781 c 53011 g 56534 t 



NUCLEIC ACID COUNT (NA) 
COMMENT : 

Notes: 

Details of Y. pestis sequencing at the Sanger Centre are available 
on the World Wide Web. 

(URL, http://www.sanger.ac.Uk/Projects/Y pestis/) . 



REFERENCE: 

AUTHOR (AU) 



TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE 



(OS) 



1 (bases 1 to 21005 or 
Parkhill,J.; Wren,B.W.; Thomson, N. R. ; Titball,R.W. ; 
Holden,M.T.G, ; Prentice, M.B. ; Sebaihia,M. ; James, K.D. ; 
Churcher,C.; Mungall, K. L. ; Baker, S.; Basham,D.; 
Bent ley, S.D. ; Brooks, K. ; Cerdeno-Tarraga, A.M. ; 
Chillingworth, T. ; Cronin,A. ; Davies,R.M. ; Davis, P. ; 
Dougan,G.; Feltwell,T.; Hamlin, N.; HolroYd,S.; 
Jagels,K.; Leather, S.; Karlyshev, A. V. ; Moule,S.; 
Oyston, P.C.F. ; Quail, M.; Rutherf ord, K. ; Simmonds,M.; 
Skelton,J.; Stevens, K.; Whitehead, S. ; Barrell,B.G. 
Genome sequence of Yersinia pestis, the causative agent 
of plague 

Nature, 413 (6855), 523-527 (2001) 
CA 135:314279 



AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



Parkhill, J. 
Direct Submission 

Submitted (04-OCT-2001) Submitted on behalf of the 
Yersinia sequencing team, Sanger Centre, Wellcome Trust 
Genome Campus, Hinxton, Cambridge CBIO ISA E-mail: 
parkhill@sanger .ac.uk 



FEATURES (FEAT) : 

Feature Key Location 

source 1. .210050 



Qualifier 

^7organism= "Yersinia pestis C092" 
/strain=" C092" 
/db-xref = " taxon : 2 14 0 92 " 
/note="biovar: Orientalis" 
/gene="yapH" 

/note=" synonym: YPO1004" 
/gene="yapH" 

/note=" Previously sequenced as 
Yersinia pestis YapH protein 
TR:CAC14227 (EMBL : AJ27763 1) (3 705 
aa) fasta scores: E(): 0, 99.6% id 
in 3705 aa. Similar in regions to 
Escherichia coli YchA protein 
TR:Q9JMS3 (EMBL: AP001918) (1371 
aa) fasta scores: E(): 0, 39.1% id 
in 14 05 aa, and to Escherichia 
coli adhesin Aida-I precursor 
SW:AIDA-EC0LI (Q03155) (1286 aa) 
fasta scores: E(): 0, 29.2% id in 
1286 aa. Contains a possible 
N-terminal signal sequence . " 
/codon- start=l 
/transl-table=ll 

/product= "putative autotransporter 

protein" 

/protein- id="CAC89847 .1" 

/db-xref ="GI : 15979073 " 

/ db - xr e f = " S PTREMBL : Q 8 ZHAl " 

/ trans lat ion= " IVDSTTI FKVIWNASLNVWVWS 

ELAKGRIKTKSSRNLISEGVLPKF 

EQSMVSKLFRKNLLALSLGS I VFLSTGPVFAAD I 

TVSTQAELSAALSNGTYDKIILGA 

D I TL I GS LTVNMTSNQ W I DGQGKFGLTVNNTTN 

YGLWSSGSGTLTLQNMSKIDSAN 

Y YSMWLNGANTAVNVI YNN I DFLGS SQLI YMGA 

YGAATNS IMTFGDI LNDWVNDRA 

QEIGEVNKLAFTGRFHVTHTGSSVTSFVSTGGAN 

NTSTMDFASGADVKIDRTGSTGDL 

TSTGVNAFAYTFADGASFELIANQNVFSGTTTNR 

GLEIGSYNSIDGFGSGVKIVLQSR 

SDGS 1 1 SGNGIDNATTNAAGINNNASGDANVI YN 

LGTGS ILKATNTGILATKNANNAS 

DIYIRSAGDITAATGISATHNGTGTVKIKNDGTI 

TSTTAGI AI S S AS I KE I S VDNTDG 

T ITATAGTGVNVLAS AI LNLFGGT INTS ATANGI 

TFAGTEGGHTLTDLTINLLGTGIA 

LSNVAGVNLTLSNVTLNTLNGTALNSLTGLTLVD 

SLNGRNTINIEGAGIGIAATNTEL 

NTFDAEALDINVNGAGIGIQATGGGVNLSASNLI 

INVANTLGTALQITDGIDNTTTIG 

NEIQLNAENATAINFLGSSSKTLNNNGTIKGSVI 

FAGVADHI INNNGTLDGTLTTGAG 

NDTLVLDSSSQSNDVINLGDGNNSVTIQNGATVS 

S IITGNGNDTFTINGMSVGSTYLG 

SLDAGTGLNTLNFNASTDELAAATSLQGFTNINL 

VDSHITLVSDDNIGSGMVNIDSSS 

ELLFGSTFDGILHATLGAGTGSAIVNNSANVSLE 

QASMFAGTWQVNQGGALTASNSNQ 

LGSAKIGLDGTLNLDNIALFNHVLTGNGTLNVAK 

NLATTAFDFGSTVGGAFSGIVNLT 

KTTFALSADNAAALASATLKLSDDSVTTVGTTDR 

TLHGLDLSGGTLIFDGAVPQSQTS 

GWTVTDLALNSGTVNITGSGSWDNTDPLATNVS 

ILEQDRAGSTLELINATNVTGDID 

ALDLLVNGTAITSGTQGVQSAIQQGGSTVANAIH 



gene 
CDS 



829. . 11946 
829. .11946 



ELLADGADALLLATESGLTANRVLNAELFGVGGL 

WDAQNGALTLANGSNRYEGTTTV 

TAGELILGANGAFGQTSLLDIASGASANINGYSQ 

TVGAVTNVGTVTLGSGGVLTSGLL 

TNGGILDLTGGALNLTAGGASTVAGGLTGAGTLN 

INGGNLSVSAANSGLSGQTHIADV 

ASVTLTDTGTLGTSAVEVLGTLNLNGANAAMTNV 

LSGDGTINTNAAVTLSGNNSFSGA 

HQ I GTDGELTVGQASNLGAS S ATVNLGTLTSHL I 

LNGVSES lANVLSGVAGSTVDI IG 

GADTALTANNSGFLGQYALAGNSKLTVASTNNLG 

ASSSVALAGAGDTLSLSGFNGTFG 

NSVTGSGVLQVTDDAEVTLTSSNGV 

SNAVTID I ADATLNLDD I ALFNHVLTGNGLLNV 

AKNDASTAFDFGSTVGGAFSGIVNLTNTTFALSA 

DNAAALARATLKLSDDSVTTVGAT 

DRTLHGLDLNGGTLIFDGSPPQSQANGWTVTDL 

ALNSGTIS ITGAGNWENEHPVTPP 

NVS LLEQDRGD I LLEL INAANVTGNANNLDLLVD 

GTAITSGTQGVESAIQQGGSTVAN 

AIHNYGLTSSNGNGGSGLYVNYTLSALELLANGA 

NALLLATESGLTANRVLNAELFGV 

GGLWDAQNGALTLANGNNRYEGTTTVTAGELIL 

GANGAFGQTSLLNIASGASANING 

YRQTVGAVTNSGAVTLGNGGVLTSGLLTNGGILD 

LTGGALNLAAGGSSTVAGGLTGAG 

TLNINGGDLAVSATNSGLSGQTHIADVASVTLTG 

TGTLGTSAVEVLGTLNLNGANAAM 

TNVLSGGGVINTNAAVTLSGNNSFSGAHQIGTDG 

ELTVGQASNLGASSATVNLGTLTS 

HLILNGVSESIANVLSGVAGSTVDIIGGADTALT 

ANNSGFLGQYALAGNSKLTVASTN 

NLGASSSVALAGAGDTLSLSGFNGTFGNSVTGSG 

VLQVTDDAEVTLTSSNGVGNTVKV 

DIADATLNLNDIALFDHVLTGNGTLNVAKNLATT 

AFDFGSTVGGAFSGIVNLTNTTFA 

LSADNAAALARATLKLSDDSVTTVGTTDRILHGL 

DLNGGTL I FDGS PPQSQANGWTV 

TDLALNSGTISITGAGNWENEHPVTPPNVSLLEQ 

DRGDILLQLIDADNVTGNANDLEL 

MINGTTISAGQGVQSTVQQGGYTVANATHNYGMT 

SNGGSGLYVNYTLSALELLADGAN 

ALLLATESGLTANRELNAELSGVGGLWDAQNGA 

LTLANGNNRYEGTTTVTAGEL I LG 

ANGAFGQTSLLNIASGASANINGYRQTVGAVTNT 

GTVTLGNGGELTSTDTLINTGMIN 

VTDGILNLENGGASSISGGLTGNGILNIKGGDFT 

I S IDNNGLAGQTNI SDGASVTLGN 

GGT I IGTGNLGS S VIDVLGDLNLVADNSLANVI S 

GDGTINTTATVTLSGNSSFSGAHQ 

IGTNGELTVGQASNLGASSATVNLGTLTSHLILN 

GVSES lANVLSGVAGSTVDI IGGA 

DTALTANNSGFLGQ YALAGNS KLTVASTNNLGAS 

SSVALAGTGDTLSLSGFNGTFGNS 

VTGSGVLQVTDDAEVTLTS SNGVSNAVT I D I ADA 

TLNLDDIALFNHALTGNGLLNVAK 

NDASTAFDFGATVGGAFTGTVNLNNSTFDLSGNN 

TTVLAQATLKLSSGNLTSVGNGVQ 

NIGTLAMNGGTLLFDNIVDNAGI ITSDGTIAANS 

INTTGGGEVRVNLPSNLAPSLDGL 

S VMELDEGE I I VTLATGAATGTGHELTLTDENGD 

P I S AVT YQGVHNAGSTS AAATGSF 

NYGMTTGEDYDGLYVNYGLTALELLSTGSEALVL 

TAILANNGTQSNDLSA 

QITGSGDL 

AFASANDGSTASLSNSTNSYTGTTWVSSGNLRLD 

ADSALGQTSLLAMSTATHVDINGT 

QQWGELATEGGSTLDLNDGKLTVTGGGQIDGAL 

TGGGELVLSGGLLNVSYDNAGFTG 

STDIANGAVAHLSQAQGLGNGTINNNGTLHLDNT 

IGTLFNALTGSDGEVLLSNNASVQ 

LAGDNSGYSGLFTNQAGSILIANSAEHLGGSSIA 

NSGALILDTGSVWELTNTISGTGT 

LVKRGS GTVKI EGDTVS AGLTT I EEGLLQLGS S A 

VTQTLSLEESLQERALLVSFASNM 



gene 
CDS 



NALTGGDFGTFTIDGNYTGDEGMI 
TFNTILAGDTSVTDRLVITGDTAGQSYVTVNNIG 
GVGARTFEGIKI IDVGGDSAGQFT 
LNGRAVGGAYEYFLYQGGASTPDDGNWYLRTEAD 
DRRPEPASYTANLAAANNMFVTSL 
ADRMGETLYTDVFTGEQKTTSLWLRNEGSHNRSR 
DDSGELKTQDNRYVMQLGGDVAQW 
SRNAQDLWRVGVMAGYANSS SSTVAQVAGYRSTG 
S VDGYS VG I YGS WLADNADDTGAY 
VDSWVQYSWFDNRVSGQDLATEKYDSKGFTASVE 
GGYAFKVGESVNQSYFIQPKAQW 
WMGVKADDHTETNGTVISGDGNGNIQTRLGAKAF 
INPSDKAKVSGPAFKPFVEANWIH 
NTKDFGTTLDGVTVKQAGTANIAELKLGVDGQVN 
SQLNLWGNIGQQVGNKGYSETSW LGVKYNF" 
13929 /gene="YPO1005" 



complement (12049 . 
complement (12049. .13929 /gene=" YPO1005" 



misc- feature 



complement (13099, 



misc- feature complement (13219 



misc-f eature complement (13294 



misc-f eature complement (13468 



/note=" Similar to neighbouring 
CDSs YPO1007 and YPO1006. Similar 
to Salmonella typhimurium 
leucine -rich repeat protein SlrP 
TR:Q9XCV2 (EMBL: AF127079) (765 aa) 
fasta scores: E(): 0, 3 9.0% id in 
644 aa, and to Shigella flexneri 
65.4 kDa antigen IpaH 4.5 
SW: IPA4-SHIFL (P18009) (574 aa) 
fasta scores: E() : 0, 40.4% id in 
589 aa" 

/codon- start=l 
/transl-table=ll 
/product = "putative antigenic 
leucine-rich repeat protein" 
/protein-id="CAC89848 .1" 
/db-xref ="GI : 15979074 " 
/db-xref = " SPTREMBL : Q8 ZHAO " 
/translation^ "MNLSNITSNVSMPNIEPDRE 
IHSARTSTAALTPADYYAIWEKWE 
SEAT PGTDEQRRFAVECMKDCLEKNTYHLCLRDL 
NLASLPDILPPCNELDLMGNKLTE 
LPATLPDNLQKLNASFNQLRTLPDTLPASLLSLN 
VYGNELERLPESLPEGLKELDVND 
NESLQLPNRLPPNLESLGIASCGLTELPTLPNSL 
KRLDADSNQLRTLPDTLPISLLNL 
SVTSNQLTQLPETLPASLSFLMVLSNRLTKLPEN 
LPGSLRCISAEYNQLSQLPDLARL 
PQNCEILLEGNPLSTSTLQVLQHLRINPYYQGPR 
INWSELDNLPPASLRNIVATWLPP 
EQQNRLAGDWANIETEANSAAFSVFLHRLATTQN 
ANNIPEFKQQIAAWLLQLADSPTL 
REQTFLIAQEASATCEDRITLTHNDMQKAVMLHE 
VEKGKYDEKLPELMARGREMFRLE 
QLENIAREKVKILKTLNVNSVDDIEVYLAYQVKL 
LNSLQLSSVNKEMRFFGVSHVTAD 
DLLSAETRVKTAENQDFSRWLSQWSPWKSWQRI 
EPERYAAAVEKQYHALENIYPDKL 
AAELAANGMTGDVDANRIVGKRINDELMGEIDMA 
LTHEVLSAKGASSLLDNLWMEYLI SP" 
13164 /gene="YPO1005" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
9.30, E- value 44" 
.13290 /gene="YPO1005" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
12.80, E-value 8.4" 
.13353 /gene="YPO1005" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
10.40, E-value 30" 
.13539 /gene="YPO1005" 



/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
17.30, E-value 0.36" 
complement (13543 . . 13602 /gene="YPO1005" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
12.90, E-value 7.8" 
complement (14492. .15367 /gene=:"YPO1006" 

complement (14492. .15367 /gene=" YPO1006" 

/note=" Similar to neighbouring 
CDSs YPO1005 and YPO1007. Similar 
to regions of Salmonella 
typhimurium secreted protein h2 
TR:Q9RPH0 (EMBL : AF160727) (788 aa) 
fasta scores: E(): 2.5e-24, 41.9% 
id in 272 aa, and to regions of 
Yersinia pestis outer membrane 
protein YopM TR: 068701 
(EMBL:AF053946) (409 aa) fasta 
scores: E(): 7e-23, 40.5% id in 
259 aa" 

/codon- start=l 
/transl-table=ll 
/product^ "putative antigenic 
leucine -rich repeat protein" 
/protein- id= " CAC8 98 4 9.1" 
/db-xref ="GI : 15979075" 
/db-xref="SPTREMBL:Q8ZH99" 
/ trans lat ion= " MYLSNITSNVSMPNIGPDRE 
IHADRPAATALTPADYHAIWEKWE 
NDPRTVAGEQRGQAVARMKECLENNAERLNLSSL 
DLTSLPDTLPPCNELNIICNNLTE 
LPTTLPDNLQTLKASYNQLRTLPNTLPASLLSLK 
VHMNELERLPEPLPEGLKTLDVGC 
NTSLQLPSRLPPVLESLDISNCNLTELPTLPNSL 
KELDAHGNQLRILPDTLPISLLRL 
NVAYNQLTALPENLPGSLRCIYTEYNQLSQLPDL 
AHLRQNCNICLDGNPLSPSTLLAL 
LRLSTKPNYQGPRIS " 
complement (14657. .14728 /gene="YPO1006" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
15.40, E-value 1.3" 
complement (14732. .14791 /gene="YPO1006 " 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
9.30, E-value 43" 
complement (14 981. .15040 /gene="YPO1006" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
11 3 0 E" value 22" 
complement (15095. .15163 /gene="YPO1006" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
9.40, E-value 43" 
complement (15575. .17392 /gene="YPO1007" 

complement (15575. .17392 /gene="YPO1007" 

/note=" Similar to neighbouring 
CDSs YPO1005 and YPO1006. Similar 
to Salmonella typhimurium 
leucine-rich repeat protein SlrP 
TR:Q9XCV2 (EMBL : AF127079 ) (765 aa) 
fasta scores: E(): 0, 36.9% id in 
642 aa, and to Shigella flexneri 
65.4 kDa antigen IpaH 
SW:IPA4-SHIFL (P18009) (574 aa) 



misc- feature 



complement (16625 . 



misc- feature complement (16757 . 



misc- feature complement (16931 . 



misc- feature complement (17006 



misc-f eature complement (17117 



gene 
CDS 



18111. .19667 
18111. .19667 



569 aa, and to Salmonella 
typhimurium secreted protein H2 
TR:Q9RPH0 (EMBL: AF160727) (788 aa) 
fasta scores: E(): 0, 40.3% id in 
608 aa" 

/ codon- s t ar t = 1 
/transl-table=ll 
/product= "putative antigenic 
leucine-rich repeat protein" 
/protein-id="CAC89850 . 1" 
/db-xref ="GI : 15979076 " 
/db-xref="SPTREMBL:Q8ZH98" 
/translation= "MNLSNITSNESMPNIEPDRE 
IHSARTSTAALTPADYYAIWEKWE 
NDPRTVAGEQRGQAVARMKECLENNTERLDLDEL 
GLTSLPDTLPPCNKLNIIENKLTE 
LPTTLPDNLQTLNAAFNQLRTLPNTLPASLLSLN 
VYGNELERLPESLPEGLKKLDVGR 
NESLQRPNRLPPNLESLGMANCRLTELPTLPNSL 
EKLEVDNNQLHTLPDTLPALLSSL 
LVSSNRLTALPENLPGSLRDIYAKDNQLSQLPDL 
AHLPQNCS IRLDGNPFSTSTLQAL 
QHLYINLDYQGPRISCRELDNLPPVSLRNIVATW 
VPPEQQKSLAEDWAKIEKETNSGD 
FNVFLCRLATTQNVKNIPEFKQQIAAWLLQLAGS 
FTLREQTFLIAQEASATCEDRITL 
THNDMQKAVMLHEVEKGKYDEKLPELIARGREMF 
RLEQLENIAREKVKTLKTLNVHSV 
DDIEVYLAYQVKLRGSLELSSVNKEMRFFGVSDV 
TTDDLLSAETRVKTAENQDFPRWL 
SQWSPWKSWQRIEPERYAAAVEKQYHALENIYP 
DKLAAELAANGMTGDVDANRIVGK 
RINDEIMEEIDMALTHEVLSAKGASSLLDNLWME 
SLISF" 
.16690 /gene="YPO1007" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
13.70, E-value 4.6" 
.16816 /gene="YPO1007" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
10.80, E-value 26" 
.17002 /gene="YPO1007" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
17.30, E-value 0.36" 
.17065 /gene="YPO1007" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
9.90, E-value 36" 
.17188 /gene="YPO1007" 

/note="Pfam match to entry PF00560 
LRR, Leucine Rich Repeat, score 
10.00, E-value 34" 
/gene="YPO1008" 
/gene="YPO1008" 

/note=" Similar to Vibrio cholerae 
hypothetical protein VC0863 
TR:Q9KTN8 (EMBL : AE0 04171) (519 aa) 
fasta scores: E(): 0, 47.5% id in 
512 aa, and to Neisseria 
gonorrhoeae putative efflux pump 
component MtrF TR:Q9RNX2 
(EMBL:AF176820) (522 aa) fasta 
scores: E(): 0, 37.2% id in 524 
aa, and to Escherichia coli 
arsenical pump membrane protein 
ArsB SW:ARB1-EC0LI (P08691) (429 
aa) fasta scores: E(): 1.1, 26.1% 
id in 111 aa" 



mi 



Lsc-f eature 18198 . . 18257 



misc-feature 18300. .18368 



misc-f eature 18381. .18449 



misc-feature 18492 . . 18560 



misc-feature 18597 . . 18665 



misc-feature 18735. .18803 



misc-feature 18921. .18974 



misc-feature 19032 . . 19100 



misc-feature 19161. .19229 



misc-feature 19272 . . 19340 



misc-feature 19359 . . 19412 



misc-feature 19440. .19508 



misc-feature 19545 . . 19613 



/transl-table=ll 

/product = "putative integral 

membrane efflux protein" 

/protein- id= " GAC8 9851.1" 

/db-xref="GI: 15979077" 

/db-xref = " SPTREMBL : Q8ZH97 " 

/ 1 rans lat ion= " MSDVKPQNDRQTGILNAI ER 

IGNAMPDITMLFIYALGVCWALSF 

GL S F I NFD Y FH PLTGDQ I KVTNMLAP VELVT FIT 

TMVKNFINFPPLAITIVATLGIGI 

AEGSGFIQVMLKKMLNVTPQRILTPAWFIGWS 

HVASDSAYVILMPVAAMMFYASGR 

HPLAGIAAAFAGLAGGFSAS YTPS I IDPIMQGFT 

QGAAHIIDPSYNVNVLCNYFLSFG 

STFGVILVCWFITDKIVEPRLLKSMPLNSDLSGL 

QSNNELQKVTPQESKAFKRAGLVM 

LLMLIGLAAALYPENSLLRSPDGSLTRPDAPIMQ 

AIVPLLFFMFAIPGLIYGFSSGTF 

KSTKDVTASMENI TRSLVS FLVFS FFCAQFLYS F 

SHSNIGTLMAISGADFLRTLAMPA 

GFTILGVILLSCALDIIITSASSKWSILAPILVP 

MLMAVGISPELTQASFRVSSTVNI 

STPMFAFYPLI IMYCRQYCSKTGVGTLCAMMLPY 

TLGLTWLTLMLYLFWGLDIPLGF 

QSGYTYPATQ" 

7gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TMHMM2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by Tiy[HiyiM2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TMHiyiM2 . 0 " ' 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TiyiHiyiM2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TiyiHMiyi2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TMHiyiM2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TiyiHMM2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TMHiyiM2.0" 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TMHMM2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO10 08 by TMHiyiM2 . 0 " 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TMHiyiM2.0" 

/gene="YPO1008" 

/note="one of 13 probable 

transmembrane helices predicted 

for YPO1008 by TiyiHMM2.0" 

/gene="YPO1008" 

/note="one of 13 probable 



for YPO1008 by TMHMM2.0" 
/gene="YPO1009" 
/gene="YPO1009" 
/note=" Similar to Bacillus 
subtilis Peptidase T PepT 
SW:PEPT-BACSU (P55179) (410 aa) 
fasta scores: E{): 0, 39.6% id in 
396 aa, and to Escherichia coli 
Peptidase T PepT SW: PEPT-ECOLI 
(P29745) (408 aa) fasta scores: 
E(): 0, 38.2% id in 408 aa" 
/codon-start=l 
/transl-table=ll 
/product = "probable peptidase" 
/protein-id="CAC89852 . 1" 
/db-xref="GI: 15979078" 
/db-xref = " SWISS - PROT : Q8 ZH9 6 " 
/ 1 r ans 1 at ion= " MT I ALSEQLTNRFFRYLAVS 
SQSDAASPTLPSTEGQHKMAQMLA 
EELRQLGLED I L IDEHATVTARKPGNQPTAPRI G 
FITHIDTVDVGLSPDIHPQRLRFT 
GSDLCLNAEQGIYLRTAEHPEILRYQGEEIIFSD 
GTSVLGADNKAAVTWMTLLENLT 
ADDCHGDIWAFVPDEEIGLRGAKALDLARFDVD 
FAYT I D CCELGEWYENFNAAS AE 
IDIIGVTAHPMSAKNVLINPIRVAYDIISEFSPQ 
ETPEHTEGREGYVWFTDMTANPNS 
AKLKIAIRDFDNVSFAARKAYIGEWAKVSAQYP 
RAKISYSVTDVYSNISNSIGEDKR 
AIDLIFSSMAELNIEPKVIPMRGGTDGAALSTQG 
LLTPNYFTGAHNFHSPFEFLPISS 
FVKS YQLTRTI CLSAAKA " 
/gene="YPO1009" 

/note="Pfam match to entry PF01546 
Peptidase"M20, Peptidase family 
iyi20/iyi25/M4 0, score 19.30, E- value 
0.00013" 
/gene="YPO1009" 

/note="Pfam match to entry PF01546 
Peptidase-M20, Peptidase family 
M20/M25/M40, score 15.90, E-value 
0.0012" 

/gene="YPO1009" 

/note="Pfam match to entry PF01546 
Peptidase-M20, Peptidase family 
M20/iyi25/M40, score 15.90, E-value 
0.0013" 

/gene="YPO1011" 
/gene="YPO1011" 

/note=" Similar to Azospirillum 
irakense putative outer membrane 
salicin receptor salC TR:Q9LAE4 
(EiyiBL:AF143227) (726 aa) fasta 
scores: E(): 0, 25.3% id in 730 
aa, and to Escherichia coli 
probable TonB- dependent receptor 
YncD SW: YNCD-ECOLI (P76115) (700 
aa) fasta scores: E{) : 2.1e~15, 
22.9% id in 721 aa" 
/ codon- start = 1 
/transl-table=ll 

/product= "putative TonB- dependent 

outer membrane receptor" 

/protein-id="CAC89853 .1" 

/db-xref="GI: 15979079" 

/db-xref = " SPTREMBL : Q8ZH95 " 

/translation= "MMVFYRKFFTVTVLISLPCL 

AWSQSHNNSKDELDTITWAQKIN 

QQQQKTPISISVLTGFDLERENIENIYESIMRIP 

NVYMVKAGNPSDAGFFTMRGTTPG 

MEGIQSVGFFIDGVYANTFDTELLDVDRIEVLRG 

PQATLYGRNTESGVINVITKDPEF 

SPEYKIGLSYGNYNRTQVTTVLGGSINDSEQFSY 

RAALKYLYGNGYFKRDYDGKNNVD 

NLNDFSGRFKLRWQPMDDGWDVMTTFDIQNRRNG 



misc-feature 21365. .21487 
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23618. .26928 
23618. .26928 



misc-feature 23705. .25051 
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misc-feature 



25637. .26854 



misc-feature 



misc-feature 
misc-feature 
gene 

CDS 



26081. .26131 



26929. .28882 

26929 . . 26956 
27015. .28037 

27015. .28037 



SDVDAYKGQVNAVYTFDDIDFTSVSAYVDERKVD 

NQDLDFTRLSISELLMNRKTKQFS 

QEFRLNSKYSGPFNWLIGSYYFYQDDENEIDFRY 

LP YNLAQLRKSD IKTNNYAVFGNV 

NYYLLNDVELVAGARYDYEKKKLNFLMDNGFNPY 

QPYSHDNNSNSFGAFLPKVGLNYY 

ITGDAMLYTS lARGYKSGGFNTLGPQSSRAYNAE 

YMTTYEAGVKTEWFDRTVRWNTSL 

FWNDMKDQQVEVAYYPISYSVNSGKSLSRGLESE 

LAWRI TRGLTVS ANVGYTDAYFKN 

FPTE I KVDNNYI PVNYKGNRPANS PGYTYS IGAD 

YNFLNGYFVNATYNVKGSTYLDNA 

NSKKQPAYGLLDLTAGYENKDYGVNVWIKNILDE 

TYVTRAFKMDDGIWYGRAGEPINF GVNFNVKF" 

/gene="YPO1011" 

/note="PS00430 TonB- dependent 

receptor proteins signature 1." 

/gene="YPO1011" 

/note="Pfam match to entry PF00593 

TonB-boxC, TonB dependent receptor 

C- terminal region, score 28.70, 

E- value 1.3e-06" 

/gene="YPO1012" 

/pseudo 

/gene="YPO1012" 

/note="This CDS is disrupted by 
the insertion of ISlOO element . 
Similar to regions of polyketide 
synthases/peptide synthetases eg. 
Bacillus subtilis MycA TR:Q9R9J1 
(EMBL:AF184956) (3971 aa) fasta 
scores: E(): 0, 29.5% id in 1112 
aa, Streptomyces coelicolor 
polyketide synthase SC3f7.12 
TR: 054 155 (EMBL : AL021409 ) (2297 
aa) fasta scores: E() : 0, 31.1% id 
in 1199 aa and Sorangium 
cellulosum polyketide synthase 
EpoC TR:Q9L8C7 (EMBL : AF210843 ) 
(7257 aa) fasta scores: E(): 0, 
39.3% id in 499 aa." 
/pseudo 

/codon- start =1 
/transl-table=ll 
/product = "putative 
peptide/polyketide synthase 
subunit (pseudogene) " 
/gene="YPO1012" 

/note="Pfam match to entry PF00501 
AMP-binding, AMP-binding enzyme, 
score -62.80, E- value 6.5e-12" 
/pseudo 

/gene="YPO1012" 

/note="Pfam match to entry PF00550 
pp-binding , Phosphopantetneine 
attachment site, score 29.50, 
E-value 1.5e-06" 
/pseudo 

/gene="YPO1012" 

/note="Pfam match to entiry PF00109 
ke toacyl - synt , Beta- ketoacyl 
synthase, score 419.00, E-value 
4.3e-122" 
/pseudo 

/gene="YPO1012" 
/note="PS00606 Beta-ketoacyl 
synthases active site." 
/pseudo 

/note=" insertion sequence, ISlOO" 
/note="IS100 inverted repeat" 
/gene="YPO1013" 

/note= " synonyms : yl093, ypmtl" 
/gene="YP01013" 

/note=" Similar to Escherichia coli 
transposase for insertion sequence 



misc-feature 27039. .27122 
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27360. .27893 

28034 . .28816 
28034. .28816 



(P15025) (390 aa) fasta scores: 

E(): 3e-27, 33.1% id in 329 aa. 

Identical to the previously 

sequenced Yersinia pestis, 

Yersinia pseudotuberculosis, and 

Escherichia coli pesticin plasmid 

insertion sequence transposase 

Y1055 TR:P74993 (EMBL : U59875 ) (340 

aa) fasta scores: E() : 0, 100.0% 

id in 340 aa" 

/codon-start=l 

/transl-table=ll 

/product=" transposase for 

insertion sequence ISlOO" 

/protein- id= " CAC8 9855.1" 

/db-xref="GI: 15979080" 

/db-xref = " SPTREMBL : P74 993 " 

/translation= "MVTFETVMEIKILHKQGMSS 

RAIARELGISRNTVKRYLQAKSEP 

PKYTPRPAVASLLDEYRDYIRQRIADAHPYKIPA 

TVIAREIRDQGYRGGMTILRAFIR 

SLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRS 

PLHVFVAVLGYSRMLYIEFTDNMR 

YDTLETCHRNAFRFFGGVPREVLYDNMKTWLQR 

DAYQTGQHRFHPSLWQFGKEMGFS 

PRLCRPFRAQTKGKVERMVQYTRNSFYIPLMTRL 

RPMGITVDVETANRHGLRWLHDVA 

NQRKHETIQARPCDRWLEEQQSMLALPPEKKEYD 

VHLDENLVNFDKHPLHHPLSIYDS FCRGVA" 

/gene="YPO1013" 

/note="Pfam match to entry PF0023 9 
recombinase , Site- specif ic 
recombinases, score 25.70, E-value 
4 .7e-06" 
/gene="YPO1013" 

/note=" Predicted helix- turn-helix 
motif with score 2147 (+6.50 SD) 
at aa 18-3 9, sequence 
MSSRAIARELGISRNTVKRYLQ " 
/gene="YPO1013" 

/note="Pfam match to entry PF00665 

rve, Integrase core domain, score 

81.90, E-value 1.6e-22" 

/gene="YPO1014" 

/note= " synonym : ypmt 1 . 57c " 

/gene="YPO1014" 

/note=" Similar to Escherichia coli 
insertion sequence IS21 putative 
ATP-binding protein, IstB 
SW:ISTB-ECOLI (P15026) (265 aa) 
fasta scores: E(): 0, 47.4% id in 
24 9 aa. Also almost identical to 
Yersinia pestis and Escherichia 
coli putative ISlOO transposase 
ypmt 1.57c TR:Q9R3L5 
(EMBL:AL117211) (260 aa) fasta 
scores: E() : 0, 99.6% id in 260 
aa. " 

/codon-start=l 

/transl-table=ll 

/product=" insertion sequence 

ISlOO, ATP-binding protein" 

/protein-id= "CAC89856 . 1" 

/db-xref="GI: 15979081" 

/db-xref = " SPTREMBL : Q9R3L5 " 

/translation= " MMMELQHQRLMALAGQLQLE 

SLISAAPALSQQAVDQEWSYMDFL 

EHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEY 

DFTFATGAPQKQLQSLRSLSFIER 

NENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRF 

TTAADLLLQLSTAQRQGRYKTTLQ 

RGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKR 

YEKSAMILTSNLPFGQWDQTFAGD 

AALTSAMLDRILHHSHWQIKGESYRLRQKRKAG 

VIAEANPE" 
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29051. .29659 
29051. .29659 



raise-feature 
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29656 . .30126 
29656 . .30126 



/note="Pfam match to entry PF01695 
IstB, IstB-like ATP binding 
protein, score 3 67.20, E-value 
1.7e-106" 
/gene="YP01014" 

/note="PS00017 ATP/GTP- binding 
site motif A (P-loop) 
/note="IS100 inverted repeat" 

/gene="YPO1014a" 

/partial 

/gene="YP01014a" 

/note=" Similar to the C-terminal 
regions of Escherichia coli 
prepilin peptidase dependent 
protein A PpdA SW: PPDA-ECOLI 
(P33554) (156 aa) fasta scores: 
E(): 0.00089, 41.304% id in 46 aa, 
and Escherichia coli 0157 :H7 
prepilin peptidase dependent 
protein A ECS3683 TR:BAB37106 
(EMBL:AP002563) (156 aa) fasta 
scores: E() : 0.00089, 41.304% id 
in 4 6 aa. Probable gene remnant 
resulting from insertion of the 
upstream IS element and 
recombination. " 
/codon-start=l 
/trans l-table=ll 
/product = "prepilin peptidase 
(partial) " 

/prot e in- id= " CAC8 9857.1" 

/db-xref="GI: 15979082" 

/db-xref = " SPTREMBL : Q8ZH94 " 

/translation= "QITLPLQKEMGFYGVRNTAQ 

AGNIILSSPAGRIRLVISSRGRIR 

LCSEQQSMAGIHLCL " 

/gene="YPO1015" 

/gene="YPO1015" 

/note= "Similar to Escherichia coli 
prepilin peptidase dependent 
protein B SW: PPDB-ECOLI (P08371) 
(187 aa) fasta scores: E() : 5e-17, 
30.4% id in 191 aa, and N-terminus 
of Pseudomonas aeruginosa general 
secretion pathway protein J 
precursor SWiGSPJ-PSEAE (Q00517) 
(237 aa) fasta scores: E(): 0.19, 
39.3% id in 61 aa" 
/ codon -St art = 1 
/transl-table=ll 
/product= "putative prepilin 
peptidase dependent protein" 
/protein-id="CAC89858.1" 
/db-xref="GI: 15979083" 
/db-xref = "SPTREMBL : Q8ZH93 " 
/ trans lat ion= " MPLWNVLSTPLNAQPTRTA 
GFTLPEMMLALSFGSLIALSTAQV 
LPKLSQQISVLQQHYRLELVMNQAMGAMEKDLRR 
AGFCHGKCQGEAITIEHYLGETAH 
SCLIVAYDLNCNGRWEGAKHQESEYFGYRLRNKA 
LESQRGELNCHGRGWEKLFDPRDV 
TVTHFSVTPLSGQLFKLRLVGHKTGNPAIHHQVT 
YLIRGNNV" 
/gene="YPO1015" 
/note="PS00409 Prokaryotic 
N- terminal methylation site." 
/gene="YPO1016" 
/gene="YPO1016" 

/note="No significant database 

matches" 

/ codon- s t art = 1 

/transl-table=ll 

/product = "putative membrane 

protein" 



/db-xref="GI: 15979084" 

/db-xref = " SPTREMBL : Q8ZH92 " 

/ 1 rans lat ion= " MTQVFQRGSSTLAAVMTLFS 

LGLFWLSAIHRQLDNIQQITGEEQ 

RYLRAYNQAESSLNWGVSQRWALRI PWRVGSAWH 

CMAHQELGLKACVKRSSLAGFFIL 

KGESLPLGSLPPLMLYQRVKLKAVTGSSGNYQLI 

DTPHGWLDFCPDKDAQFCLD " 

/gene="YPO1016" 

/note="l probable transmembrane 
helix predicted for YP01016 by 
TMHMM2.0" 
/gene="YPO1017" 
/gene="YPO10l7" 
/note= "Weakly similar to 
Escherichia coli prepilin 
peptidase dependent protein C 
precursor PpdC SW: PPDC-ECOLI 
(P08372) (107 aa) fasta scores: 
E() : 0.42, 23.7% id in 114 aa, and 
to Pseudomonas aeruginosa Pi IV 
TR:Q57003 (EMBL : L3 6117) (185 aa) 
fasta scores: E() : 0.91, 32.8% id 
in 58 aa" 
/codon-start=l 
/transl-table=ll 
/product = "putative prepilin 
peptidase dependent protein" 
/protein-id="CAC89860 . 1" 
/db-xref="GI: 15979085" 
/db-xref = " SPTREMBL : Q8ZH91 " 
/translation="MPRLITKHAYGQTISVETGG 
FNLRHLRGFSLLEVLIAALFFSIS 
LLGLLQYHQALLQGFASSWQQRQAWSWLHQYMES 
QSGTDPEERWAPEVKPGWHYRQFI 
DRIEGDCREFSVKLITRQDSPVELSRWFCSTEMI 
II 

/gene="YPO1017" 
/note="PS00409 Prokaryotic 
N- terminal methylation site." 
/gene="YPO1017" 

/note="l probable transmembrane 
helix predicted for YPO1017 by 
TMHiyiM2 . 0 " 
/gene=" recC" 

/note= " synonym : YPOlO 18 " 

/gene="recC" 

/EC- number = "3.1.11.5" 

/note=" Similar to Escherichia coli 

exodeoxyribonuclease V gamma chain 

RecC SW:EX5C-EC0LI (P07648) (1122 

aa) fasta scores: E(): 0, 63.5% id 

in 1123 aa, and to Vibrio cholerae 

exodeoxyribonuclease V, 125 kDa 

subunit VC2322 TR:Q9KPP4 

(EMBL:AE004302) (1148 aa) fasta 

scores: E() : 0, 45.8% id in 1130 

aa" 

/codon-start=l 
/transl-table=ll 

/product=" exodeoxyribonuclease V 
gamma chain" 

/protein- id= " CAC8 9861.1" 

/db-xref="GI: 15979086" 

/db-xref =: " SPTREMBL : Q8ZH90 " 

/ 1 rans 1 a t i on= " MFTVYHSNQLDLLKALTTAL 

lEREPLDNPFQQEWLVQSPGMAQ 

WLQMQLAQQFS lAANI VFPLPATFI WDMFTRVLP 

DIPKESAFSKEAMTWKLMWLLPDL 

LENPLFSPMKRYLSDDGDRRKIHQLAARVADLFD 

QYLVYRPEWLESWERGQLIEGLDD 

AQQWQALLWVELTRYTRQLEQPEWHRANLYQRFI 

HQLLKSDVCPQGLPKRVFICGISA 

LPPIYLQALQALGKHIDIHLMFTNPCRYFWGDIQ 

DYTFLAKLQSRKRRHYRES I ELSL 



MYLLSQIDEIQEVHAFVDIEPDNL 

LHGIQHDMLELEDHAVIGTTPETLARSDQKRRLD 

LDDRSLS FHVCHS PQREVEVLQDH 

LLGLLAADPELTPRDI IVMWDIDSYTPYIQAAF 

GNAPSERYLPFAISDRKASQAHPA 

LHAFITLLDLPQSRFTAEQVLALLEVPALATKFG 

ITEDGLRRLRQWVGESGIRWGLDD 

DNVRELSLPATGQHTWRFGLTRMLLGYAMDSTAG 

DWQGILPYDESSGLAAELxAGQLAD 

MLMHLSQWRQQLGQPRELSEWLPICRQLLDTFFD 

QDNDTEAALVL I EQQWQKV I GYG I 

AAQYPDWPLNLLRDELAARLDNERI SQRFLAGP 

INFCTLMPMRSIPFKWCLLGMND 

GVYPRTLPPQGFDLMAKKVRRGDRSRRDDDRYLF 

LEALLSAQQQLYISYIGRSIQDNS 

KRYPSVLVSELIEYVAQSYHLPGDEKLSADDSAQ 

RVTQHLLCWHARMPFSAENFIKNS 

ELQSYAAEWLPSAESKGHAHPNFNQPLQAEPLAE 

ITLDELVRFYRHPVRAFFQLRLGV 

NFVIEETELPDEEPFTLDNLSRYQFNTQLLNALI 

NEDDINSVFARARAAGVLPYGSFG 

ELYWESQQDEMVPLAEQIRSERKENHSIELNIEF 

ADITVTGWIHQVQDDGLVRWRPS I 

LTAVDGLLLWLEHLVYCSAGGEGESRIYGRKGTA 

WRYAPMAADEARPYLQQLIKGYQQ 

GLCEPLMLLSKSGWAWLSQCFDRESGQILWDEET 

QGKARMKLLQVWQGDQRVTGEGED 

HYIQRVCRRMDNQHLDIILHETERYLLPIARHNK 

A" 

/gene="ptrA" 

/note=" synonyms: ptr, YPO1019" 

/gene="ptrA" 

/EC-number= "3.4.24.55" 

/note=" Similar to Escherichia coli 
protease III precursor PtrA 
SWrPTRA-ECOLI (P05458) (962 aa) 
fasta scores: E(): 0, 63.0% id in 
94 7 aa, and to Vibrio cholerae 
peptidase, insulinase family 
VC2072 TR:Q9KQC8 (EMBL : AE004281) 
(93 9 aa) fasta scores: E() : 0, 
28.0% id in 919 aa" 
/codon- start =1 
/transl-table=ll 

/product= "protease III precursor" 

/protein-id="CAC89862 . 1" 

/db"-xref="GI: 15979087" 

/db-xref = " SPTREMBL : Q8ZH8 9 " 

/translation= "MHKQLARIVGIVFFLGLLAP 

SSWAAIPAWQPLAETIYKSEHDLR 

KYQAIKLSNGMTVLLVSDTQAPKSLAALALPVGS 

LEDPDNQLGLAHYLEHMVLMGSKH 

FPEPGSFSEFLKKHGGSHNASTASYRTAFYLEIE 

NDALAPAVERLADAIAEPLLDPIN 

ADRERNAVNAELTMARSRDGMRMAQVNAETLNPA 

HPSARFSGGNLETLRDKPDGKLHD 

ELVSFYHRYYSANLMVGVLYSNQSLEQLAQLAAD 

TFGRIPNRDAKVPTITVPWTPDQ 

TGI I IHYVPAQPRKQIKVDFRIANNSADFRSKTD 

TYISYLISNRSKNTLSDWLQKQGL 

ADAINAGADPMLDRNGGVFS ITVSLTDKGLAQRD 

VWAAIFDYINMLHKEGIKKSYFD 

EIAHVLNLDFRYPSITRDMDYIEWLVDMMLRVPV 

AHTLDAPYLADQYDPKAIASRLAE 

MTPENARIWFVSPEEPHNKVAYFVDAPYQVDKIG 

VQRMKEWQQLGQKIALSLPALNPY 

I PDNFTL I KADKNI TRPQNVADQPGLRVFYMPSQ 

YFADEPKADITVAFRNPHALNSAR 

HQVLFALTDYLAGLSLDQLSYQASIGGISFSTAP 

NNGLYVNAGGFTQRMPQLLTSLVS 

GYASFTPTEEQLVQAKSWYREQLDVAEKGKAYEL 

AIQPAKLLSNVPYSERSERRKLLD 

S I S VQDVLT YRDDLLKQS AI EVLAVGNMTAEQVT 

ELTESLKKQLNLIGTTWWVGEDVI 

lEKTQLANMERVGSSSDAALAAVYVPTGYTEIAG 



misc-feature 34274. .34693 

misc-f eature 34340. .34411 

gene 36997. .40659 

CDS 36997. .40659 



LGYAVFSFPMSVGHQWGIGFLLQSNSKEPNYLYQ 

RYLAFYPQAEKRLREMKPDDFEQY 

KQGLVNQLLQRPQTLDEEAERYRKDFNLNNFAFD 

SREKMIAQVKQLTANELADFFQQA 

VIKPQGLALLSQVKGQGQAGGFAVPEGWTTYPTT 

SALQATLPQKVLAP " 

/gene="ptrA" 

/note="Pfam match to entry PF00675 
Peptidase-M16 , Insulinase 
(Peptidase family M16) , score 
166.30, E-value 5e-46" 
/gene="ptrA" 

/note="PS00143 Insulinase family, 
zinc-binding region signature." 
/gene=" recB" 

/note=" synonyms: rorA, YPO1020" 
/gene="recB" 
/EC-number= "3.1.11.5" 
/note=" Similar to Escherichia coli 
exodeoxyribonuclease V beta chain 
RecB SW:EX5B-EC0LI (P08394) (1180 
aa) fasta scores: E(): 0, 62.0% id 
in 1190 aa, and to Vibrio cholerae 
exodeoxyribonuclease V, 135 kDa 
subunit VC2320 TR:Q9KPP6 
(EMBL:AE004302) (1208 aa) fasta 
scores: E(): 0, 48.0% id in 1203 
aa" 

/ codon - start = 1 
/transl-table=ll 

/product = "exodeoxyribonuclease V 
beta chain" 

/protein- id= " CAC8 98 63 . 1 " 
/db-xref="GI: 15979088" 
/db-xref = " SPTREMBL : Q8ZH88 " 
/ 1 rans lat ion= " MTPTTPQRLEPLALPLYGER 
L I EAS AGTGKT FT I GVL YLRLLLG 
LGGDAAFRRPLMVEEILWTFTEAATEELRGRIR 
DNIHGLRIACVRGVSDDPMYQALL 
AEITDWAVAAAQLLAAERQMDEAAIYTIHGFCQR 
MLANNAFESGILFEQTLVQDELPL 
WRQACADFWRRHCYPLPLVIARAVSQEWSGPEAL 
LSDLSAYLQGETPKFRQAPGDDET 
I LSRHQHWTQ I DALKAEWRLAAPELEAL I SRSG 
VDKRSYSSKHLPNWLEKVGLWAEQ 
ETEDYQLPKELEKFRQSVLLEKTKKGAAPQHTLF 
SAIDSIFEQPLTLRDLILARAISE 
IRFSVQQEKRQRAELGFDDLLSKLDAALQQSGGE 
LLAQS IRTRYPVAMIDEFQDTDPQ 
QYRI FHTVYGGQEACALLLIGDPKQAI YAFRGAD 
I FTYIRARSEVSAHYTLETNWRSS 
FPMVQSVNRLFSLVNTPFLFEQI PFINVAAAKKN 
HDLS FE I KGEKQPALHFWLQPGDG 
VGVS E YQQVMARQCAAQ I RDWLTAGQQGQAELLT 
TKGPKP VQASD I T I LVRSRAEAAL 
IRDALSALAI PS VYLSNRDSVFETTEAKDLLWLL 
QAVLAPEQERALRSAMATGILGLD 
ARMLDALNHDERAWDALVDEFDHYRQHWQRRGVL 
PMLREMMARRHLAENLLATPGGER 
. RLTDLLHLGELLQEAASQLDSEHALIRWLAQQI A 
QPNPQSDSQQLRLESDRHLVRWT 
IHKSKGLEYPLVWLPFIGNFRQQQNVLYHDRHSF 
EALLDLNADEESQALAEEERLAED 
LRLLYVALTRSVYHCSVGIAPLIKGGRKKQGESD 
MHRSALGYLVQHGQPGDAHVLSEK 
LTELARS SNGD I CVS PVEPPDDKPWQPQPEALPE 
LTARHFTRKMQDYWRVTSYSGLQQ 
SGSTKKLYGSATGDVSQALGAPLQVLLPRLDIDA 
VGEQAWTEDDHLTPHTFPRGAAP 
GTFLHDLLEPLDFSQPIELAWLSERLQQQGFSEQ 
WAPVLHQWLNDIVQTPLTDTGATL 
AGLSPERKQAELQFYLPIDSLLRAEVLDRLIKQY 
DPLSRQCPVLDFQQVRGMLKGFID 
LVFCWQGQYYLLDYKSNWLGEDSRAYTVEAMTQA 
MAEHRYDLQYQLYTLALHRYLRHR 



misc- feature 



misc- feature 



gene 
CDS 



37006. .39120 

37066. .37089 

40656. .42614 
40656. .42614 



misc -feature 
misc- feature 

gene 
CDS 



41223. .41246 



42384. .42524 



complement (42769 
complement (42769 



PEQKLIEEMDRLFSGTIGSNATAS 
DKSGSNVIHSGERASR" 
/gene=" recB" 

/note="Pfam match to entry PF00580 
UvrD-helicase, UvrD/REP helicase, 
score 367.30, E-value 1.6e-106" 
/gene="recB" 

/note="PS00017 ATP/GTP-binding 
site motif A (P-loop) 
/gene=" recD" 

/note= " synonym : YPO102 1 " 
/gene=" recD" 
/EC-number= "3.1.11.5" 
/note=" Similar to Escherichia coli 
exodeoxyribonuclease V alpha chain 
RecD SW:EX5A-EC0LI (P04993) (608 
aa) fasta scores: E(): 0, 56.7% id 
in 638 aa, and to Vibrio cholerae 
exodeoxyribonuclease V, 67 kDa 
subunit VC2319 TR:Q9KPP7 
{EMBL:AE004302) (706 aa) fasta 
scores: E(): 0, 45.3% id in 727 
aa" 

/codon- start=l 
/transl-table=ll 

/product = " exodeoxyribonuclease V 
alpha chain" 

/protein-id="CAC89864 .1" 

/db-xref-"GI: 15979089" 

/db-xref = " SPTREMBL : Q8ZH87 " 

/translation= "MMTLLAQAARDRLLRPLDVQ 

FSRMIAGDDDPRLQLAAAILSAEV 

GAGHVCLPLRYLQPELLFGGRQPDLSLALWQAAG 

SPDKAQWLQALQNAPWSDGSQPT 

PLVLQQERLYLQRMWQYEGDWQFIASDSVFINR 

DSDFMTSKGIATSVDESLLRETLD 

ALFGCAGS EVDWQKVAAAVAATRRI S VI SGGPGT 

GKTTTVAKLLTALIRLSQGQRLRI 

KLAAPTGKAAARLTESLGKAIRQFFLTDDERKLF 

PDQASTLHRLLGVQPNSQRLRYHR 

GNPmLDVLVVDEASMVDLPMMARLIAALPAKAK 

VIFLGDRDQLASVEAGAVLGDICR 

FAELGYS EQRAQQLTQLTGYLLTNNALTSNVLTN 

KALANKAQPDETYSDSANVRDSLC 

LLRKSYRFDEKSGIGQLALAVNAGEYRQALSVLN 

SAYSDVERFPLADEEDYQVLLEAC 

AVGYQHYLERVAALAPAAEVLAAFGRYQLLCALR 

SGPFGVSGLNERIEQVLHRKGFII 

RPSGPSGRWYVGRPVMIELNDSALGLFNGDIGIA 

LHDNEGELRVYFQLPDGNIKSVQP 

SRLPSHETAYAMTVHKSQGSEFEHTALVLPNTFM 

PVLTRELVYTAITRARQHLTLYCS 

DAVLSHAIRTPTLRLSGLVDRLNTLNRQ " 

/gene="recD" 

/note="PS00017 ATP/GTP-binding 
site motif A (P-loop)." 
/gene="recD" 

/note="Pfam match to entry PF01443 
Viral-helicasel, Viral 
(Superfamily 1) RNA helicase, 
score 2 9.70, E-value 6.5e-08" 
.44 094 /gene= " argA" 

/note= " synonym : YPO1022 " 
.44094 /gene= " argA" 

/EC-number="2 . 3.1.1" 
/note= "Similar to Escherichia coli 
amino-acid acetyltransf erase ArgA 
SW:ARGA-ECOLI (P08205) (443 aa) 
fasta scores: E(): 0, 87.3% id in 
441 aa, and to Vibrio cholerae 
N-acetylglutamate synthase VC2316 
TR:Q9KPQ0 (EMBL : AE0043 01) (456 aa) 
fasta scores: E(): 0, 65.8% id in 



misc- feature 



complement (42856 



/codon-start=l 
/transl-table=ll 
/product = " amino -acid 
acetyltransf erase" 
/protein-id="CAC89865.1" 
/db-xref="GI: 15979090" 
/db-xref = " SPTREMBL : Q8ZH8 6 " 
/ 1 rans 1 at ion= " MKERSTELVQGFRHS VPY IN 
AHRGKTFWMLGGEAIEHENFSSI 
VNDIGLLHSLGIRLVWYGARPQIDSNLADHNYE 
PIYHKHTRVTDARTLEMVKQAAGL 
LQLDITARLSMSLNNTPLQGAHINWSGNFI lAQ 
PLGVDDGVDYCHSGRIRRIDEEAI 
HRQLDNGAIVLLGPVAVSVTGESFNLTSEEVATQ 
LAIKLKAEKMIGFCSSQGVTDSEG 
NI ISELFPNDAQKRIEDLEQDGDYNSGTVRFLRG 
AVKACRSGVRRSHLLSYQEDGALI 
QELFSRDGIGTQIVMESAEQVRRATINDIGGILE 
LIRPLEQQGILVRRSREQLEMEID 
KFTIIERDNLTIACAALYPFPDEHIGEMACVAVH 
PDYRSSSRGEMLLNRITNQARQMG 
LKKLFVLTTRSIHWFQERGFTPAEVDVLPIQKQE 
LYNYQRRSKILLADL " 
43074 /gene="argA" 



misc- feature complement (43387 . .44 022 



gene 
CDS 



44330 , 
44330. 



.45580 
.45580 



misc -feature 



45077. .45553 



/note="Pfam match to entry PF00583 
Acetyltransf , Acetyltransf erase 
(GNAT) family, score 42.00, 
E-value 1.3e-08" 
/gene="argA" 



entry PF00696 

kinase 

E-value 



/note="Pfam match to 
aakinase, Amino acid 
family, score 65.00, 
1. 6e-15" 
/gene="YPO1023" 
/gene="YPO1023" 
/EC-number= "3.5.1.28" 
/note=" Similar to Escherichia coli 
N - ace ty Imuramoy 1 - L - alanine amidase 
AmiC precursor SW: AMIC-ECOLI 
(Q46929) (417 aa) fasta scores: 
E() : 0, 70.4% id in 419 aa, and to 
Neisseria meningitidis 
N- ace ty Imuramoyl - L - alanine amidase 
AmiC or NMA2 028 TR:Q9JT18 
(EMBL:AL162757) (416 aa) fasta 
scores: E(): 0, 49.9% id in 421 
aa" 

/codon- start =1 
/transl-table=ll 

/product ="N-acetylmuramoyl-L-alani 

ne amidase AmiC precursor" 

/protein-id= "CAC89866 .1" 

/db-xref ="GI : 15979091" 

/db-xref = "SPTREMBL : Q8ZH85 " 

/ 1 rans 1 at ion= " MADSNHNSGRRRLLQGAAAA 

WMLSISRVGFAASSHIVAVRVWPS 

STYTRVTLESNTPLKYRQFALTHPDRIWDIEGV 

QLNSVLKEI SRQVQSADPYLKQAR 

VGQFDKNTVRLVLELKQS I S PQLFTLKPFAEFRN 

RLWDLYPAEGGTSAEDDPLLALL 

EDYNKGNVDRTLPPETPKAGKAGRDRPI IIMLDP 

GHGGEDPGAIGKNKTKEKDIVLQI 

ARRLRALIQKEANMRVFMTRNEDVFIPLKVRVAK 

ARKLRADLFISIHADAFTNRAASG 

SSVFALSTKGATSTAARFLAQTQNEADQIGGVSK 

SGDPYLDHTIIDLLQTATINDSLK 

FGKEVLSRMSKINKLHKNRVDQAGFAVLKAPDI P 

SILVETAFISNLEEERKLRTSRFQ 

QQI AES I FAGI KAYFANGGAMARL " 

7gene="YPO1023" 

/note="Pfam match to entry PF01520 
Amidase -3 , 

N - ace ty Imuramoy 1 - L - alanine 



1.4e-67" 
45886 /product="tRNA-Met" 

/note="tRNA Met anticodon CAT, 
Cove score 86.07" 
46086 /product="tRNA-Met" 

/note="tRNA Met anticodon CAT, 
Cove score 8 6.07" 
/gene="YPO1025" 
/gene="YPO1025" 
/EC-number= "3.2.1.-" 
/note=" Similar to Escherichia coli 
membrane -bound lytic mure in 
transglycosylase A precursor 
SW:MLTA-ECOLI (P46885) (365 aa) 
fasta scores: E(): 0, 69.9% id in 
3 66 aa, and to Vibrio cholerae 
membrane -bound lytic mure in 
transglycosylase A precursor 
TR:Q9KPQ4 (EMBL: AE004301) (368 aa) 
fasta scores: E(): 0, 46.0% id in 
359 aa" 

/codon-start=l 

/ trans 1- tableau 

/product = "membrane -bound lytic 

mure in transglycosylase A 

precursor" 

/protein-id="CAC89867 .1" 

/db-xref ="GI : 15979092 " 

/db-xref = " SPTREMBL : Q8ZH84 " 

/ trans lat ion= " MTSRWGKYLLSGIMIAVLAG 

CQSRPTDRGQQYKDGRLEQSLELV 

NEPNAAGKPVNAKDYSDQVKVINQSSPGLYNRNS 

DTFNAVQNWMLAGADTSKLSLFGL 

NAYQMEGVDNFGNVQFTGYYTPVLQARYTPQGEF 

RHPLYRMPAKGKRRLPDRAAIYAG 

ALDNRNLIIAYTNSLVDNFMMEVQGSGYVDYGDG 

RPLTFFGYAGKNGHAYRS IGKVLI 

DRGEVARADMSMQAIRQWAENHSEAEVRELLEQN 

PSFVFFKPVMYAPVKGASAVPLIA 

KASVASDKSLIPPGTTLLAEVPLLDDQGKFTGKY 

QMRLMVALDVGGAI KGQHFDI YQG 

IGHEAGQAAGFYNHYGRVWVLKNAQSSGPLFTAY 

KGGTQSEPTSNDSSLLVNNQDR" 

/gene="YPO1025" 

/note="PS00013 Prokaryotic 

membrane lipoprotein lipid 

attachment site." 

/gene="YPO1026" 

/gene="YPO1026" 

/note=" Similar to Escherichia coli 
hypothetical protein YgdL 
SW:YGDL-ECOLI (Q46927) (268 aa) 
fasta scores: E () : 0, 80.5% id in 
267 aa, and to Vibrio cholerae 
HesA/MoeB/ThiF family protein 
VC2311 TR:Q9KPQ5 (EMBL: AE004301) 
(273 aa) fasta scores: E(): 0, 
65,8% id in 263 aa" 
/codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id= "CAC89868 .1" 

/db-xref ="GI : 15979093 " 

/db -xre f = " S PTREMBL : Q8 ZH8 3 " 

/translation= "MSTAYSEAYQQRFGGIARLY 

GQQALALFSQAHVCVIGIGGVGSW 

AAEALARTGIGAITLIDMDDVCVTNTNRQIHALR 

HNIGQAKTEVMAERILAINPECHV 

TCIDDFITADNVAELLNKNFSYVIDAIDSVRPKA 

ALLSYCRRYKIPWTTGGAGGQID 

PTRIAWDLAKT IQDPLAAKLRERLKSDFNWKN 

SKGKLGIDCVFSSEPLVYPQADGS 



VSHVLKKMMAKAARQHESAATAS " 

misc-feature 47701, .48114 /gene="YPO1026" 

/note="Pfam match to entry PF00899 
ThiF- family, ThiF family, score 
183.20, E-value 4.2e-51" 

misc-feature 48319. .48387 /gene="YPO1026" 

/note="l probable transmembrane 
helix predicted for YPO1026 by 
TiyiHMiyi2 . 0 " 

gene complement (48480 .. 48923 /gene="YPO1027" 

CDS complement (48480 . .48923 /gene="YPO1027" 

/note=" Similar to Escherichia coli 
hypothetical protein YgdK 
SW:YGDK-ECOLI (Q46926) (147 aa) 
fasta scores: E(): 5.7e-30, 59.0% 
id in 13 9 aa, and to Vibrio 
cholerae hypothetical protein 
VC2310 TR:Q9KPQ6 (EMBL : AE004301) 
(144 aa) fasta scores: E(): 
3.5e-19, 43.1% id in 144 aa" 
/codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89869 .1" 

/db-xref="GI: 15979094" 

/db-xref = " SPTREMBL : Q8ZH82 " 

/ t rans lat ion= " MMIAPHPFGHDITATDLI EK 

FSAHKQWEDRYRQLILLAKQLPPL 

QEAWKKNELELTGCENRVWLGHQHLPDGTLHFYG 

DSEGRIVRGLLAVILTAVEGKTPQ 

QVLAADPLALFEQLGLRQQLSTSRANGLQALAQG 

VQTIAAKYAER" 

gene complement (49058 . .50377 /gene="YPO1028" 

CDS complement (49058. .50377 /gene="YPO1028" 

/note=" Similar to Escherichia coli 
ORF-O401 TR:Q46925 (EMBL : U29581) 
(401 aa) fasta scores: E() : 0, 
67.1% id in 4 01 aa, and to Vibrio 
cholerae aminotransferase, class V 
VC2309 TR:Q9KPQ7 (EMBL : AE004 3 01) 
(404 aa) fasta scores: E(): 0, 
51.5% id in 396 aa" 
/codon-start=l 
/transl-table=ll 
/product= "putative 
aminotransferase class V" 
/protein-id="CAC89870 .1" 
/db-xref="GI: 15979095" 
/db-xref = " SPTREMBL : Q8ZH8 1 " 
/translation= "MRPSLMCITSICITSLLKHA 
TMIKAKKFRGHSQDANNSMKVFNP 
MDFRREFPALSDKLTYLDSAATALKPRAMIDATQ 
QFYQQDSATVHRSQHQSALSLTVR 
FENTRQQVADFINSSTAENI IWTRGTTEAINLIA 
QSYARPRLQPEDEIIVSEAEHHAN 
LI PWLMVAEQTGAKIVKLPLGLDHLPDLQQLPQL 
LNEKTRI LALGQMSNVTGGS PDLA 
QAIRLAHQYDCWWDGAQGIVHYPADVQALDID 
FYAFSSHKLYGPTGIGVLYGKTEL 
LEEMPAWQGGGKMLTHASFGGFTPHEVPYRFEAG 
TPNIAGVIGLSAVLKWLEHIDLEE 
AEVYSQGLATMAENKLAQLPGFHSYRCQQSSLLA 
FTFDGVHHSDLVALLAEQGIALRA 
GQHCAQPLMAALGVNGSLRASFAPYNTPQDVEML 
CSALGKALELLRD" 

misc-feature complement (49100 . .50176 /gene=" YPO1028 " 

) 

/note="Pfam match to entry PF00266 
aminotran- 5 , Aminotransferases 
class-V, score 5.20, E-value 



misc-feature complement (49568 . .49627 /gene=" YPO1028 " 



gene 
CDS 



) 

50923. .51840 
50923. .51840 



misc-f eature 



50944. .51357 



misc-f eature 



misc-f eature 



gene 
CDS 



50983. .51048 



50986. .51078 



52006 . .52401 
52006. .52401 



misc- feature 



52018. .52086 



/note="PS00595 Aminotransferases 
class -V pyridoxal -phosphate 
attachment site." 
/gene="gcvA" 

/note=" synonym: YPO1029" 
/gene="gcvA" 

/note=" Similar to Escherichia coli 
glycine cleavage system 
transcriptional activator GcvA 
SW:GCVA-ECOLI (P32064) (305 aa) 
fasta scores: E(): 0, 88.2% id in 
3 04 aa, and to Vibrio cholerae 
transcriptional regulator, LysR 
family VC08 96 TR:Q9KTK6 
(EMBL:AE004173) (306 aa) fasta 
scores: E(): 0, 69.3% id in 296 
aa" 

/codon- start =1 
/transl-table=ll 

/product=" glycine cleavage system 

transcriptional activator" 

/pro te in - id= " CAC8 9871.1" 

/db-xref="GI : 15979096" 

/db-xref = "SPTREMBL : Q8ZH80 " 

/translation= "MSKRLPPLNALRAFDAAARH 

LS FTKAAEELFVTQAAVS HQ I KSL 

EDFLGLKLFRRRNRSLLLTEEGQSYYLDIKEIFT 

S INEATRKLQARSAKGALTVSLPP 

SFAIQWLVPRLSGFNAAYPGIDVRIQAVDREEDK 

LADDVDVAIFYGRGNWSGLRTERL 

YAEFLLPVCAPSLLTGENGLKVPSDLANHTLLHD 

TSRRDWLAYTRQLGVPQINVQQGP 

IFSHSAMWQAAVHGQGIALVNNVMAQSEIEAGR 

LVCPFNDVLVSKNAFYLVCHDSQA 

ELGKIAAFRQWILARAASEQEKLRFRYEN" 

/gene="gcvA" 

/note="Pfam match to entry PF00126 
HTH-1, Bacterial regulatory 
helix-turn-helix protein, lysR 
family, score 185.90, E-value 

6.3e-52" 
/gene="gcvA" 

/note=" Predicted helix-turn-helix 
motif with score 1894 (+5.64 SD) 
at aa 21-42, sequence 
LSFTKAAEELFVTQAAVSHQIK" 
/gene="gcvA" 

/note="PS00044 Bacterial 
regulatory proteins, lysR family 
signature . " 
/gene="YPO103 0" 
/gene="YPO1030" 

/note="Similar to Escherichia coli 
hypothetical protein YgdD 
SWrYGDD-ECOLI (P32065) (131 aa) 
fasta scores: E() : 0, 71.8% id in 
131 aa" 

/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89872 .1" 

/db-xref ="GI : 15979097" 

/db-xref= "SPTREMBL :Q8ZH79" 

/ 1 rans lat ion= " MNSRLMLI FSALSGFFYVAF 

GAFGAHVLSTSLGLNEMAWIRTGL 

EYQGFHTLVILILAVAMQRQVSLWFYWSGALLAL 

GTLLFSGSLYCLALSHLRLWIYIT 

PVGGVCFLAGWILMLIGALRLRKRAERHE " 

/gene="YPO1030" 

/note="one of 4 probable 

transmembrane helices predicted 

for YPO103 0 by TMHMiyi2.0" 



misc- feature 



misc-f eature 



CD 



ene 
S 



52201. .52269 



52297. .52365 



52394. .53500 
52394. .53500 



/note="one of 4 probable 
transmembrane helices predicted 
for YPO1030 by TMHMM2.0" 
/gene="YPO1030" 
/note="one of 4 probable 
transmembrane helices predicted 
for YPO103 0 by TMHMM2.0" 
/gene="YPO1030" 
/note="one of 4 probable 
transmembrane helices predicted 
for YPO103 0 by TMHMiyi2.0" 
/gene="YPO1031" 
/gene="YPO1031" 

/note=" Similar to Escherichia coli 
hypothetical protein YgdE 
SW:YGDE"-ECOLI (P32066) (366 aa) 
fasta scores: E(): 0, 78.1% id in 
3 65 aa, and to Haemophilus 
influenzae hypothetical protein 
HI1195 SW:YGDE-HAEIN (P45100) (363 
aa) fasta scores: E() : 0, 60.9% id 
in 358 aa" 
/ codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89873 .1" 

/db-xref="GI: 15979098" 

/db-xref = "SPTREMBL : Q8ZH78 " 

/ 1 rans lat ion= " MNNKIAL YCRSGFEKECAAE 

ITEKAAQLEIFGFARVKENSGYVL 

FECYQLEDADRLIRE I PFREFI FARQMMWGELL 

KDLPPEDRVSPIVGMLVGVIEKAG 

ELRVEVADTNESKELLKFCRKLTVPLRSALREQK 

ILSARENAHRPWHVFFIAPGCCY 

VGYSYSNNNSPFYMGIPRLKFPSDAPSRSTLKLE 

EAFHVF I PADEWEERLASGMHAVD 

LGACPGGWTYQLVQRSMMIQAVDNGLMAQSLMDT 

GQVTHHRADGFKYEPTRSNIYWLV 

CDMVEKPTKVTQLITKWLVNGWCREAIFNLKLPM 

KKRYEEWQNLAMMDEQLKENGIN 

ADIHAKQLYHDREEVTVHVRRIWSGAPGRRDERY 



gene 
CDS 



complement (53751 . .54506 /gene="xni" 



complement (53751 . 



misc-f eature complement (53754 



/note= " synonyms : exo, YPO1032" 
54506 /gene="xni" 

/EC-number= "3.1.11.-" 
/note=" Similar to Escherichia coli 
exodeoxyribonuc lease IX Xni or Exo 
SW:EX9"-EC0LI (P38506) (281 aa) 
fasta scores: E(): 0, 71.7% id in 
24 7 aa, and to Vibrio cholerae 
exodeoxyribonuc lease IX VC08 98 
TR:Q9KTK4 (EMBL : AE004 173 ) (283 aa) 
fasta scores: E(): 0, 53.9% id in 
254 aa" 

/codon~start=l 
/transl-table=ll 

/product=" exodeoxyribonuc lease IX" 
/protein-id="CAC89874 .1" 
/db-xref="GI: 15979099" 
/ db - xr e f = " S PTREMBL : Q 8 Z H 7 7 " 
/translation="MQIHLLIVDALNLIRRIHAV 
QGSPCVKACQHALQQLIQHSQPSH 
AVAVFDEDDRSDSWRHQCLPDYKAGRSPMPDNLQ 
QEMPLIRQAFNELGVACWHSPGNE 
ADDLAATLWKVAGAGHQVTIVSTDKGYCQLLAP 
NIQIRDYFQKRWLDMPFVKQEFGV 
LPRQLPDYWGLAGI S S SKI PGVAGVGAKTATLLL 
QQADTLEVLYQNLESIPEKWRKKL 
QQHQQMAFTCKQIATLKTDLLLSGNLQQLRLKK" 
.54500 /gene="xni" 



5-3-exonuclease, 5 ' -3 ' 
exonuclease, score 123.20, E-value 
4.9e-33" 

complement (54766, .56130 /gene="YPO1033 " 

complement (54766. .56130 /gene="YPO1033 " 

/note=" Similar to Escherichia coli 
hypothetical protein YgdH 
SW: YGDH-ECOLI (P37350) (454 aa) 
fasta scores: E(): 0, 82.6% id in 

454 aa, and to Vibrio cholerae 
hypothetical protein VC0899 
TR:Q9KTK3 (EMBL : AE004 173 ) (457 aa) 
fasta scores: E(): 0, 69.9% id in 

455 aa" 

/codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CACB 9875 . 1 " 
/db-xref="GI: 15979100" 
/ db - xr e f = " S PTREMBL : Q8 ZH7 6 " 
/translation="MITHVSPLGSiyiDLLSQLEVD 
MLKRTASSDLYRLFRNCSLAVLNS 
GSLTDNSKELLSRNETFDINVLRRERGVKLELVN 
PPEHAFVDGKI IRSLQANLFAVLR 
DILFVNGQIVSASNFQHLNMESSSHLTNLVFSIL 
RNARALHIDEEPNMWCWGGHSIN 
ETE YL YARKVGSQLGLRELNI CTGCGPGAMEAPM 
KGAAVGHAQQHYKKGRFIGMTEPS 
I ITUIEPPNPLVNELVIMPDIEKRLEAFVRIAHGI 
IIFPGGVGTAEELLYLLGILMNPE 
NSEQVLPLILTGPKESADYFKWDEFIMNTLGDG 
ARKYYQIIIDDPDEVARQMKKAMP 
LVKEYRRNTGDAYSFNWS IRIEPDLQHPFEPNHE 
NMANLDLSHNQPPEKLAAALRRAF 
SGIVAGNVKEVGIQAIEAHGPFKLHGDPQLMKQM 
DRLLQDFIAQHRMKLPGSVYTPCY EICS" 
complement (56283. .57128 /gene="YPO1034 " 

complement (56283. .57128 /gene="YPO1034 " 

/note=" Similar to Escherichia coli 
hypothetical protein YqcD 
SW:YQCD-ECOLI (Q46920) (282 aa) 
fasta scores: E(): 0, 75.9% id in 
282 aa, and to Vibrio cholerae 
hypothetical protein VC0902 
TR:Q9KTK0 (EMBL:AE004174) (287 aa) 
fasta scores: E() : 0, 64.2% id in 
282 aa" 

/ codon-start=l 
/ transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id= "CAC89876 . 1" 

/db-xref ="GI : 15979101" 

/db-xref = " SPTREMBL : Q8ZH75 " 

/translation= "MSSYQNHKALAELTLGKPTA 

YCDYYDATLLQAVPRSMNREPLGL 

YPDNLPFHGADIWTLYELSWLNSNGLPQVAVGEI 

SLNADSINLIESKSFKLYLNSFNQ 

TI FADKESVRMTLQRDLAACAQGNVSVALYDLDE 

ITGQPISNFNGECLDKQDIRIDSY 

EFNADYLQGAAGKDHVEESLVSHLLKSNCLITHQ 

PDWGSVQIHYRGPQIDHEALLRYL 

VSFRHHNEFHEQCVERIFNDIMRFCQPETLTVYA 

RYTRRGGLDINPWRSNTDFVPLTG RLARQ" 

57266. .57811 /gene="syd" 

/note= " synonyms : ydr, YPO1035" 

57266. .57811 /gene="syd" 

/note="Similar to Escherichia coli 
Syd protein SW: SYDP-ECOLI (P43526) 
(181 aa) fasta scores: E(): 0, 



cholerae Syd protein VC0903 
TR:Q9KTJ9 (EMBL : AE004174 ) (198 aa) 
fasta scores: E{): 0, 50.0% id in 
176 aa" 

/codon-start=l 

/transl-table=ll 

/product="Syd protein" 

/protein- id= " CAC8 9877.1" 

/db-xref ="GI : 15979102 " 

/db-xref = " SPTREMBL : Q8ZH74 " 

/ 1 rans 1 at ion= " MNI STALRS FTQRYIDLWQQ 

QTGHLPASKELYGVPSPCIVETGE 

DQVFWQPQAFLPEATLTNIERALEIQLHPDIHDF 

YTQQYAGDMMADLGNHRFTLLQVW 

SEDDFIRLQENLIGHLVTQKRLKLSPTLFLATTS 

SEMTMASLCNVSGNWLEQFGSDK 

RTLLASTLSHFLDALRPVLPE " 

58074 . . 58355 /gene="csrB" 

58074. .58355 /gene="csrB" 

/note=" Similar to Escherichia coli 
regulatory RNA csrB, 62% identity 
in 283 nt overlap" 

58356.. 59067 /note=" IS200-like insertion 

sequence: IS1541. Contains a 'G' 
at nucleotide position 315" 

58501.. 58959 /gene="tnp" 

/note= " synonym : YPO1036 " 

58501. .58959 /gene="tnp" 

/note=" Similar to Salmonella 
typhimurium, and Salmonella typhi 
transposase for insertion sequence 
IS200 TnpA SW:T200-SALTY (Q57334) 
(152 aa) fasta scores: E(): 0, 
94.1% id in 152 aa. Identical to 
the Yersinia pseudotuberculosis 
IS1541 element transposase Tnp 
TR:Q9X9F5 (EMBL :AJ23 8014 ) (152 aa) 
fasta scores: E(): 0, 100.0% id in 
152 aa" 

/codon-start=l 
/transl-table=ll 
/product=" transposase for the 
IS1541 insertion element" 
/protein-id="CAC89878 .1" 
/db-xref ="GI : 15979103 " 
/db-xref = " SPTREMBL : Q9X9F5 " 
/ 1 rans lat ion= " MRDEKSLAHTRWNCKYHIVF 
APKYRRQVFYREKRRAIGS ILRKL 
CEWKNVNI LEAECCVDH I HMLLE I PPKMS VSGFM 
GYLKGKSSLMLYEQFGDLKFKYRN 
REFWCRGYYVDTVGKNTARIQEYIKHQLEEDKMG 
EQLS I P YPGS P FTGRK " 
58555. .58866 /gene="tnp" 

/note="Pfam match to entry PF01797 
Transposase -17, Transposase IS200 
like, score 236.10, E-value 
5.1e-67" 

59213 . . 59545 /gene= " YPO103 7 " 

59213. .59545 /gene= " YPO1037 " 

/note=" Similar to Escherichia coli 
hypothetical protein YqcC 
SW:YQCC-ECOLI (Q46919) (109 aa) 
fasta scores: E(): 1.3e-15, 41.3% 
id in 10 9 aa, and to the 
N-terminal region of Erwinia 
carotovora exoenzyme regulation 
regulon ORFl SW: YQCB-ERWCA 
(Q47417) (376 aa) fasta scores: 
E(): 1.8e-16, 67.1% id in 70 aa. 
Note that the Erwinia carotovora 
paralogue is fused to the 
downstream gene" • 
/codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 



gene 
CDS 



59545. .60318 
59545. .60318 



misc- feature 



misc- feature 



gene 
CDS 



59572. .60054 



59692. .59736 



60346. .60795 
60346. .60795 



gene 
CDS 



complement (60885. .61274 



7protein- id= " CAC8 9879.1" 
/db-xref ="GI : 15979104" 
/db-xref ="SPTREMBL: Q8ZH73 " 
/translation= "MSTENKVRQSLQDIEFAMRE 
TDLWQTVPPEAEAFESNEPFSIDT 
MAAEQWLQWVFLPRMYALLELNGPFPTRFAITPY 
FEEALSGDGRPDYSALLAQLRCLD - DLLNKESE " 
/gene="YPO103 8" 
/gene="YPO103 8" 

/note= "Similar to Escherichia coli 
hypothetical protein YqcB 
SW:YQCB-ECOLI {Q46918) (260 aa) 
fasta scores: E() : 0, 69.3% id in 
257 aa, and to the C-terminal 
region of Erwinia carotovora 
exoenzyme regulation regulon ORFl 
SW:YQCB-ERWCA (Q47417) (376 aa) 
fasta scores: E() : 0, 76.3% id in 
257 aa. Note that the Erwinia 
carotovora paralogue is fused to 
the upstream gene" 
/ codon - s t ar t = 1 
/transl-table=ll 

/product=" Conserved hypothetical 
protein" 

/protein-id="CAC89880 .1" 

/db-xref="GI: 15979105" 

/db-xref = "SPTREMBL : Q8ZH72 " 

/ 1 rans lat ion= " MLE 1 1 YQDEHI VAVNKPAGW 

LVHRSWLDRNETVFVMQTVRDQIG 

QHVYTVHRLDRPTSGVLLMALSSDVARMLSLQFE 

QHQIQKTYHAWRGYVLEGGTVDY 

AMAEELDKIADKFAKSDKAPQPSVSHYEALAQVE 

VPLAIGRYETARYSLVALKPETGR 

KHQLRRHMAHIRHPIIGDSTHGDLRQNRGVAQHF 

GCSRLMLHASHLHLNHPVTGEALT 

LTARWDEPWQGLMSQFGWSGIAPHLERVEFPLTA 

SQDNE" 

/gene="YPO1038" 

/note="Pfam match to entry PF0084 9 
PseudoU-synth-2, RNA 
pseudouridylate synthase, score 
176.90, E-value 3.4e-49" 
/gene="YPO103 8" 
/note="PS01129 Rlu family of 
pseudouridine synthase signature." 
/gene="YPO103 9" 
/gene="YPO103 9" 

/note=" Similar to Escherichia coli 
hypothetical protein TR:Q46917 
(EiyiBL:U29581) (149 aa) fasta 
scores: E(): 0, 66.2% id in 148 
aa, and to Erwinia carotovora 
SCRI193 TR:Q47418 (EMBL : X79474 ) 
(151 aa) fasta scores: E() : 0, 
68.9% id in 148 aa" 
/codon- s tart =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id- " CAC8 9881.1" 

/db-xref="GI: 15979106" 

/db-xref = "SPTREMBL : Q8ZH71 " 

/ 1 rans lat ion= " MAQVGI FVGTVYGNALLVAE 

EAENILKDQGHEVKLFDEGTLEDW 

QHYRQH YVLVITSTTGQGDFPDS I ASLFVAVRDQ 

VGFQPELRYGVIALGDSSYDNFCG 

AGRTFDELLQEQGATRIGERLEIDAIEHSEPEAI 

S APWVEQWGRLLQ " 

/gene="YPO104 0" 



complement (60885. .61274 /gene="YPO1040' 



) 



/note=" Similar to Escherichia coli 
hypothetical protein YaeH 



fasta scores: E{): 0, 85.7% id in 
126 aa, and to Vibrio cholerae 
hypothetical protein VC2264 
TR:Q9KPU8 (EMBL: AE004298 ) (137 aa) 
fasta scores: E() : 1.9e-27, 64.3% 
id in 126 aa" 
/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

7protein-id= "CAC89882 . 1" 
/db-xref ="GI : 15979107" 
/db-xref = " SPTREMBL : Q8ZH70 " 
/ 1 r ans 1 at ion= " MYDNLKSLGI TQPEDVDRYS 
LRQEANND I LKI YFRKDKGEFFAK 
SVKFKYPRQRKTWSDNASHGYKEINEINPNLRY 
VIDELDQLCKRDQIEVDLKRKILD 
DLRHLESWTNKIAEIEADLEKLTNGR" 
complement (61398 62222 /gene="dapD" 

/note= " synonym : YPO104 1 " 
complement (61398 . . 62222 /gene="dapD" 

/EC-number="2 . 3.1. 117" 
/note=" Similar to Escherichia coli 
2,3,4, 5-tetrahydropyridine-2-carbo 
xylate N-succinyltransf erase dapD 
SW:DAPD-ECOLI (P03 94 8) (274 aa) 
fasta scores: E(): 0, 92.3% id in 
274 aa, and to Mycobacterium bovis 
2,3,4, 5-tetrahydropyridine-2-carbo 
xylate N-succinyltransf erase dapD 
SW:DAPD-iyiYCBO (P56220) (274 aa) 
fasta scores: E(): 0, 93.8% id in 
274 aa" 

/ codon- start=l 
/transl-table=ll 

/product="2, 3,4, 5-tetrahydropyridi 
ne - 2 - carboxy lat e 
N-succinyltransf erase" 
/protein-id= "CAC89883 .1" 
/db-xref ="GI : 15979108" 
/db-xref = "SPTREMBL : Q8ZH69 " 
/ 1 rans lat ion= " MQQLQNVIETAFERRAD I TP 
ANVDTVTREAITHVIDLLDTGALR 
VAEKIDGQWVTHQWLKKAVLLSFRINDNQVMEGA 
ETRYYDKVPMKFAGYDEARFQREG 
FRWPPATVRKGAFIARNTVLMPSYVNIGAFVDE 
GTMVDTWATVGSCAQIGKNVHLSG 
GVGIGGVLEPLQANPTI lEDNCFVGARSEWEGV 
IVEEGSVISMGVFIGQSTRIYDRE 
TGEVHYGRVPAGSVWSGNLPSKDGSYSLYCAVI 
VKKVDAKTRSKVGINELLRTID " 
complement (61575 . . 61628 /gene="dapD" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapeptide (four repeats) , score 
11.30, E- value 23" 
complement (61629 61682 /gene="dapD" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapeptide (four repeats) , score 
10.60, E-value 28" 
complement (61725. .61778 /gene=:"dapD" 

/note="Pfam match to entry PF00132 
hexapep, Bacterial transferase 
hexapeptide (four repeats) , score 
13.30, E-value 6" 
complement (61737. . 61823 /gene="dapD" 

/not e = " PS 0 0 1 0 1 Hexapept ide - repeat 
containing- transferases 



misc- feature 



complement (61779 . 



misc-feature complement (61848 



gene 
CDS 



complement (623 97 , 
complement (623 97 . 



misc-feature complement (624 03 



misc-feature complement (63267 . 



misc-feature 



complement (64566 , 



61832 /gene="dapD" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapeptide (four repeats) , score 
5.50, E-value 1.4e+02" 
61901 /gene="dapD" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapeptide (four repeats) , score 
3.20, E-value 2.8e+02" 
,65135 /gene="glnD" 

/note= " synonym : YPO1042 " 
.65135 /gene="glnD" 

/EC-number= "2.7.7.59" 
/note=" Similar to Escherichia coli 
[protein- PIT] uridylyltransf erase 
GlnD SW:GLND-ECOLI (P27249) (890 
aa) fasta scores: E() : 0, 76.5% id 
in 880 aa" ' 
/codon- start=l 
/transl-table=ll 
/product=" [protein-PII] 
uridylyltransf erase" 
/protein-id="CAC89884 . 1" 
/db-xref="GI: 15979109" 
/db-xref = " SWISS - PROT : Q8 ZH6 8 " 
/ 1 rans lat ion= " MTSRHRRLFYGLRSSSGKHM 
SDNHTEHSLSLTLTPTISEQPALP 
STYLDSDIHCPILKQRLDAFQRWQAEAFNSGTSA 
EVLIAARSDYIDHLLQRLWTFYGF 
DKVPETALVAVGG YGRGELHPLSD IDVLVLS KQR 
LNDEQAQRVGQLITLLWDLKLEVG 
HSVRTLEECLLEGLADLT lATNMIESRLI CGDVA 
LFLQMQKHIFSDSFWPSPQFFHAK 
WEQQERHKRYHGTS YNLEPD I KS S PGGLRD IHT 
LLWVARRHFGATSLSEMVDFGFLT 
NAERNELNESQSFLWRIRFALHLVLTRYDNRLLF 
DRQLSVAQLLRYEGEGNEPVEHMM 
KDFYRMTRRVSELNNMLLQLFDEAILALDANEKP 
RPLDEEFQLRGDLIDLRDENLFVR 
QPEAIMRMFYLMVRNQDI KGI YSTTVRRLRHARR 
HLKAPLCHIPEARKLFMAILRHPG 
AVSRALLPMHRHS VLWAYMPQWGS I VGQMQFDLF 
HAYTVDEHTIRVLLKIESFADEDT 
RPRHPLCVELYPRLPQPELLLLAALFHDIAKGRG 
GDHS ILGAHDAVEFAEQHGLNSRE 
SQLVAWLVRCHLLMSVTAQRRDIQDPAVIQQFSA 
EVQSETRLRYLVSLTVAD I CATNE 
NLWNSWKQSLLRELYFATEKQLRRGMQNSPDLRE 
RVRHHRLQALALLRMDNIDEEALH 
RIWSRCRADYFLRHSPNQLAWHARHLLEHDSTKP 
LVLVSRQATRGGTEIFIWSPDRPS 
LFAAWGELDRRNL S VHDAQ I FTNRDGMAMDT F I 
VLE PDGS PLAQDRHP 1 1 SHALQQA 
INRSDYQHPPRVRRLSPKLRHFSVPTEANFLPTH 
NERRTYLELIALDQPGLLARVGKI 
FADLGLSLHSARITTIGERVEDLFVLADKDRRAL 
SLETRRELAQRLADTLNPNDKL " 
.62627 /gene="glnD" 

/note="Pfam match to entry PF01842 
ACT, ACT domain, score 43.20, 
E-value 6e-09" 
.63671 /gene="glnD" 

/note="Pfam match to entry PF01966 
HD, HD domain, score 75.50, 
E-value l.le-18" 
.64883 /gene="glnD" 

/note="Pfam match to entry PF01909 



gene 



CDS 



complement (65138 
complement (65138 



Nucleotidyltransferase domain, 
score 39.90, E-value 5.8e-08" 
65929 /gene="ampM" 

/note= " synonyms : b0168, map, 
YPO1043" 
,65929 /gene="ampM" 



/EC-number= "3.4.11.18" 
/note= "Similar to Escherichia coli 
methionine aminopeptidase 
SW:AiyiPM-ECOLI (P07906) (264 aa) 
fasta scores: E() : 0, 84.0% id in 
263 aa" 

/codon- start =1 

/transl-table=ll 

/product= "methionine 

aminopeptidase " 

/protein-id="CAC89885 . 1" 

/db-xref ="GI : 15979110 " 

/db-xref = " SPTREMBL : Q8ZH67 " 

/translation="iyiAISIKTPEDIQKiy[RVAGRL 

AAEVLEIIEPYVKPGVTTGELDRI 

CHDHITNHQQAISACLGYHGFPKSVCISVNEVIC 

HGIPSDEKVLKEGDIVNIDVTVIK 

DGFHGDTSKMFIVGKPTILGERLCRVTQESLYLA 

IKMVKPGIRLRTLGKAIQKFVEAE 

NFSWREYCGHGIGEGFHEEPQVLHYDADDGGW 

LQAGMAFTIEPMVNAGDYRIRTMK 

DGWTVKTKDRNLSAQYEHTIWTDNGCEIMTLRK 

DDTIPNIITHE" 

misc-feature complement (65192 . .65920 /gene="ampM" 



) 



misc-feature complement (65372 



gene 
CDS 



66364. .67089 
66364. .67089 



misc-feature 
misc-feature 

misc-feature 



66379. .66414 
66388. .67038 

66835. .66909 



/note="Pfam match to entry PF00557 
Peptidase-M24 , metallopeptidase 
family M24, score 336.80, E-value 
2.4e-97" 
65428 /gene="ampM" 

/note="PS00680 Methionine 
aminopeptidase subfamily 1 
signature . " 

/gene=" rpsB" 

/note=" synonym: YPO1044 " 
/gene=" rpsB" 

/note=" Similar to Escherichia coli 
3 OS ribosomal protein S2 RpsB 
SW:RS2-EC0LI (P02351) (240 aa) 
fasta scores: E(): 0, 93.8% id in 
240 aa" 

/codon- start=l 

/transl- tableau 

/product="3 0S ribosomal protein 

S2 " 

/protein-id= "CAC89886 . 1" 

/db-xref ="GI: 15979111" 

/db-xref = " SWISS - PROT : Q8ZH66 " 

/ 1 rans lat ion= " MATVSMRDMLQAGVHFGHQT 

RYWNPKMKPFIFGARNKVHIINLE 

KTVPMFNEALAELTKISSRKGKILFVGTKRAASE 

AVKEAANNCDQFFVNHRWLGGMLT 

NWKTVRQSIKRLKDLEIQSQDGTFDKLTKKEALM 

RTRELNKLENSLGGIKDMGGLPDA 

LFWDADHEHIAIKEANNLGIPVFSIVDTNSDPD 

GVDFII PGNDDAIRAVKLYLGAVA 

TAVREGRSQDLAVQAEESFVEAE" 

/gene="rpsB" 

/note="PS00962 Ribosomal protein 
S2 signature 1 . " 

/gene=" rpsB" 

/note="Pfam match to entry PF00318 
Ribosomal - S2 , Ribosomal protein 
S2, score 435.90, E-value 
3 .5e-127" 
/gene=" rpsB" 



gene 
CDS 



67217. .68074 
67217. .68074 



tnisc-f eature 67223 . . 67342 



tnisc-f eature 67385 . . 68011 



misc-f eature 67445 . . 67477 



gene 
CDS 



68283. .69008 
68283. .69008 



misc-f eature 68310 . . 68861 



gene 
CDS 



69144. .69701 



69144. .69701 



S2 signature 2 . " 
/gene="tsf " 

/note= " synonym : YPO104 5 " 
/gene="tsf " 

/note=" Similar to Escherichia coli 
elongation factor Ts Tsf 
SW:EFTS-ECOLI (P02997) (282 aa) 
fasta scores: E(): 0, 78.4% id in 
282 aa" 

/codon- start =1 

/transl-table=ll 

/product=" elongation factor Ts" 

/protein- id= " CAC8 9887.1" 

/db-xref="GI: 15979112" 

/db-xref=" SWISS -PROT:Q8ZH65" 

/ 1 rans lat ion= " MVAITAALVKELRERTAAGM 

MECKKALVEANGDIELAIDNMRKS 

GQAKAAKKAGRIAAEGI ILARVSADGKYGVILEL 

NCETDFVAKDAGFKAFGEEVINAA 

LAEKIADIDVLKAKFEEQRANLVAKIGENINIRR 

VAVLEGDILGTYLHGARIGVMVAA 

TGADEELVKHIAMHIAASKPEYVKPDDVPAEWA 

REHQIQLDIAIESGKPREIAEKMV 

EGRMRKFTGEVSLTGQNFVMDPSKTVGDLLKENN 

ADWNFIRFEVGEGIEKVETDFAA 

EVAAMSKQS" 

/gene="tsf " 

/note="Pfam match to entry PF02094 
TS-N, TS-N domain, score 71.00, 
E- value 2.6e-17" 
/gene="tsf " 

/note="Pfam match to entry PF00889 
EF-TS, Elongation factor TS, score 
356.30, E-value 3.2e-103" 
/gene="tsf " 

/note="PS01127 Elongation factor 

Ts signature 2 . " 

/gene="pyrH" 

/note= " synonyms : smbA, YPO104 6 " 
/gene="pyrH" 
/EC-number= "2.7.4.-" 
/note=" Similar to Escherichia coli 
uridylate kinase PyrH 
SW:PYRH-ECOLI (P29464) (240 aa) 
fasta scores: E(): 0, 90.8% id in 
240 aa" 

/codon- start=l 

/transl-table=ll 

/product = "uridylate kinase" 

/protein- id= " CAC8 9888.1" 

/db-xref = "GI : 15979113 " 

/db-xref = " SWISS - PROT : Q8ZH64 " 

/ 1 rans lat ion= " MATNAKPVYQRILLKLSGEA 

LQGAEGFGIDASVLDRMAQEVKEL 

VELGIQVGWIGGGNLFRGAGLAQAGMNRWGDH 

MGMLATVMNGLAMRDALHRAYVNA 

RLMSAI PLNGVCDNYSWAEAI SLLRHNRWI FAA 

GTGNPFFTTDSAACLRGI E I EADV 

VLKATKVDGVYSADPVKNPDATLYEQLTYQDVLE 

QELKVMDLAAFTLARDHNLPIRVF 

NMNKPGALRRWMGENEGTLIAK" 

/gene="pyrH" 

/note="Pfam match to entry PF00696 
aakinase, Amino acid kinase 
family, score 148.00, E-value 
1.7e-40" 
/gene="f rr" 

/note= " synonyms : b0172, rrf, 
YPO104 7" 

/gene="f rr" 

/note=" Similar to Escherichia coli 
ribosome recycling factor 
SW:RRF-ECOLI (P16174) (185 aa) 
fasta scores: E(): 0, 80.5% id in 
185 aa" 



/transl-table=ll 
/product="ribosome recycling 
f a.ct oir " 

/protein- id= " CAC8 988 9.1" 

/db-xref ="GI : 15979114 " 

/db-xref = " SWISS- PROT : Q8ZH63 " 

/ translat ion= "MINEIRKDAEVRMEKCLEAF 

QNHISKIRTGRASPSILDGIQVEY 

YGTATPLRQLANIWEDSRTLALTVFDRSLSAAV 

EKAIMTSDLGLNPSSAGTVIRVPL 

PALTEERRKDLIKWRAEAEQGRVSIRNVRRDAN 

DKVKALLKDKEISEDEDRRSQDDV 

QKLTDAYIKKVDAALAVKEAELMDF " 

7gene="frr" 

/note="Pfam match to entry PF01765 
RRF, Ribosome recycling factor, 
score 336.80, E-value 2.4e-97" 
/qene="dxr" 

/note=" synonyms: b0173, YPO1048" 
/gene="dxr" 
/EC-number= "1.1.1.-" 
/note=" Similar to Escherichia colx 
1-deoxy-D-xylulose 5 -phosphate 
reductoisomerase SW:DXR-ECOLI 
(P45568) (398 aa) fasta scores: 
E{): 0, 72.2% id in 396 aa" 
/codon- start =1 
/transl-table=ll 
/product="l-deoxy-D-xylulose 
5 -phosphate reductoisomerase" 
- /protein-id="CAC89890 . 1" 
/db-xref="GI: 15979115" 
/db-xref="SWISS-PROT:Q8ZH62" 
/ t rans lat ion= " MKQLTILGSTGS IGNSTLSV 
VRANPELFKVTALVAGRNVREMAQ 
QCLEFSPRYAAMSDEHSAKSLRLLLAEQGSDTEV 
YSGETAACELAALDDVDQVMAAIV 
GIAGLPSTLAAIRAGKQVLLANKESLITCGKLFM 
DEVKRSRAQLLPIDSEHNAIFQSL 
PERIQRQLGYSSLNENGVSRIILTGSGGPFRETP 
LSQFSDVTPDQACAHPNWSMGRKI 
SVDSATMMNKGLEYIEARWLFNASAEQIEWLHP 
QSVIHSMVRYHDGS ILAQMGTPDM 
RTPIAHAMAYPMRVSSGVAPLDFCKVGALTFTTP 
DYQRYPCLKLAIDACNAGQAATTA 
LNAANEISVMAFLDSKIRFTDIEVINRTWEGLL 
LSEPTSVEEVLVIDRKARDVAAQV lAKLNN" 
/gene="uppS" 

/note=" synonyms: rth, YPO1049" 

/gene="uppS" 

/EC-number= "2.5.1.31" 

/note=" Similar to Escherichia coli 

undecaprenyl pyrophosphate 

synthetase SW:UPPS-ECOLI {Q47675) 

(253 aa) fasta scores: E() : 0, 

78.8% id in 245 aa" 

/codon- s tart =1 

/transl-table=ll 

/product = "undecaprenyl 

pyrophosphate synthetase" 

/protein- id= " CAC8 9891.1" 

/db-xref ="GI : 15979116 " 

/db-xref = " SPTREMBL : Q8ZH61" 

/translation="MSPVKEDRANLSPRSPRHVA 

I IMDGNGRWAKNKGKLRVFGHKAG 

VKS VRRAVS FAAKHNLDALTL YAFS S ENWNRPDQ 

EVTALMELFVRALDSEVKSLHKHN 

VRLSIIGDISRFSGRLQERIRRSEKLTANNDGLK 

LNIAANYGGRWDIIQGVRHLAEQV 

QKGELQPTDISEESLNSYICLHEQSQVDLVIRTG 

GEHRISNFLLWQIAYAELYFTDVL 

WPDFDENVFEGALNAFAQRERRFGGTTPIDATAS 

/gene="uppS" 

/note="Pfam match to entry PF01255 



misc-f eature 71896 . . 71949 
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72103. .72951 
72103 . .72951 



misc-feature 72106. .72945 



misc-feature 72121. .72225 



misc-feature 72268 . . 72324 



misc-feature 72343 . . 72411 



misc-feature 72454 . . 72513 



misc-feature 72550 . . 72618 



misc-feature 72676 . . 72735 



misc-feature 



gene 
CDS 



72817. .72897 



72980. .74335 
72980. .74335 



undecaprenyl diphosphate synthase, 
score 454.70, E-value 8e-133" 
/gene="uppS" 

/note="PS01066 Undecaprenyl 
pyrophosphate synthetase family 
signature . " 
/gene="cdsA" 

/note= " synonyms : cds, YPO1050" 

/gene="cdsA" 

/EC-number= "2.7.7.41" 

/note=" Similar to Escherichia coli 

pho sphat idat e cy t idy ly 1 1 rans f e ras e 

cdsA or cds SW: CDSA-ECOLI (P06466) 

(24 9 aa) fasta scores: E(): 0, 

75.2% id in 246 aa" 

/codon-start=l 

/transl-table=ll 

/product="phosphatidate 

cy t idy ly 1 1 r ans f e ras e " 

/protein-id="CAC89892 .1" 

/db-xref="GI: 15979117" 

/db-xref = " SPTREMBL : Q8ZH6 0 " 

/translation="MLKYRLITALILIPWIGAL 

FLLPPVGFAIVTLWCMLAAWEWG 

QLAGFASRTQRIWLAILCGFLLVAMLLSLPEYQH 

SPHHLLVSTPLWLSMGWWVAALML 

VLTYPRSAVSWRNSRLLRIIFGILTIIPFFWGMF 

ALRQYGYEQNHNTGAWWLLYVMLL 

VWGADSGAYMFGKLFGKHKLAPKVSPGKTWEGLI 

GGLLTSALISLLFGRYAPLDIVPE 

KLLICSWAALASVLGDLTESMFKREAGIKDSGH 

LIPGHGGILDRIDSLTAAVPVFAC LMLLVF" 

/gene="cdsA" 

/note="Pfam match to entry PF01148 
Cytidylyl trans , Pho sphat i date 
cytidylyltransf erase , score 
155.80, E-value 7.5e-43" 
/gene="cdsA" 

/note="one of 6 probable 
transmembrane helices predicted 
for YPO1050 by TiyiHiyiM2 . 0 " 
/gene="cdsA" 

/note="one of 6 probable 
transmembrane helices predicted 
for YPO1050 by TMHMM2 . 0 " 

/gene="cdsA" 

/note="one of 6 probable 
transmembrane helices predicted 
for YPO1050 by TMHiyiM2.0" 
/gene="cdsA" 

/note="one of 6 probable 
transmembrane helices predicted 
for YPO1050 by TMHMiyi2 . 0 " 

/gene="cdsA" 

/note="one of 6 probable 
transmembrane helices predicted 
for YPO1050 by TMHiyny[2.0" 
/gene="cdsA" 

/note="one of 6 probable 
transmembrane helices predicted 
for YPO1050 by TiyiHMM2 . 0 " 
/gene="cdsA" 

/note="PS01315 Phosphatidate 
cytidylyltransf erase signature. " 
/gene="YPO1051" 
/gene="YPO1051" 

/note= "Similar to Escherichia coli 
hypothetical 49.1 kDa protein in 
cdsA-hlpA intergenic region 
SW:YAEL-ECOLI (P37764) (450 aa) 
fasta scores: E(): 0, 77.6% id in 
451 aa, and to Neisseria 
meningitidis putative integral 
membrane protein NiyiA0084 TR:Q9JX32 
(EMBL:AL162752) (446 aa) fasta 



misc-feature 72992.. 73060 



misc-feature 73034 . . 73063 



misc-feature 73271. .73339 



misc-feature 73574 . . 73816 



misc-feature 74108. .74176 



misc-feature 74258 . . 74317 



gene 
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74372. .76759 
74372. .76759 



aa" 

/codon- start =1 

/transl-table=ll 

/product =" putative membrane 

protein" 

/protein-id="CAC89893 .1" 

/db~xref="GI: 15979118" 

/db-xref="SWISS-PROT:Q8ZH59" 

/translation= "MMSILWSLAAFIVALGILIT 

VHEFGHFWVARRCGVRVERFS IGF 

GKALWRRTDRQGTE YVI AL I PLGGYVKMLDERVE 

AVAPELRHQSFNNKTVLQRAAIVS 

AGP I ANFLFAI VAYWLVF I IGVPS VRPVIGD ISP 

QSIAAQANISSGMELKSVDGIETP 

DWDSVRLALI SRIGDKQMQVGVAPFGSDNWEKT 

LDLRQWQFEPDKQDPWALGI IPR 

GPQIESVLAEVQPGSAAQKAGLQAGDRIVKVNGQ 

LLDRWQTFVLQVRDNPGQPLVLDI 

ERESTPLSLTLIPDTKSVGENRSEGFAGWPKVI 

PLPDEYKTIRQYGPFTAVYQAGDK 

TWQLiyiRLTVSMLGKLITGDVKLNNLSGPISIAQG 

AGLSAEYGLVYYLMFLALISVNLG 

IINLFPLPVLDGGHLLFLAIEKLKGGPVSERVQD 

FSYRIGSILLVLLMGLALFNDFSR L" 

/gene="YP01051" 

/note="one of 4 probable 

transmembrane helices predicted 

for YPO1051 by TMHiyiM2 . 0 

/gene="YPO1051" 

/note="PS00142 Neutral zinc 

metallopeptidases , zinc-binding 

region signature." 

/gene="YP01051" 

/note="one of 4 probable 

transmembrane helices predicted 

for YPO1051 by TMHiyiM2.0" 

/gene="YP01G51" 

/note="Pfam match to entry PF00595 
PDZ, PDZ domain (Also known as DHR 
or GLGF) . , score 46.40, E-value 
6.2e-10" 
/gene="YP01051" 
/note="one of 4 probable 
transmembrane helices predicted 
for YPO1051 by TMHMM2 . 0 " 
/gene="YPO1051" 
/note="one of 4 probable 
transmembrane helices predicted 
for YPO1051 by TMHMM2.0 one of 4 
probable transmembrane helices 
predicted for YPO1051 by TiyiHMiyi2 . 0 " 
/gene="YPO1052" 
/gene="YPO1052" 

/note=" Similar to Photorhabdus 
luminescens outer membrane antigen 
Oma TR:Q9S341 (EMBL : AJ236920 ) (797 
aa) fasta scores: E(): 0, 80.4% id 
in 797 aa, and to Haemophilus 
influenzae protective surface 
antigen D15 precursor 
SW:D151-HAEIN (P46024) (797 aa) 
fasta scores: E(): 0, 44.3% id in 
804 aa" 

/ codon- start=l 
/transl-table=ll 
/product= "putative surface 
antigen" 

/protein-id="CAC89894 .1" 
/db-xref="GI: 15979119" 

/db-xref = " SPTREMBL : Q8ZH58 " 

/ 1 rans lat ion= " MAMKKLLI ASLLFGSATVYG 

ADGF WND I HFEGLQRVAVGAALL 

NMPVRVGDTVSDDDIGKTIRALFATGNFEDVRVL 

RDGNTLIVQVKERPTIASITFSGN 

KAVKEDMLKQNLEASGVRVGEALDRTTISNIEKG 



NRVDLKLVFTEGVSAKIQQINIVGNHSFTTDELI 

SRFQLRDEVPWWNWGDRKYQKQK 

LAGDLETLRSFYLDRGYARFNIDSTQVSLTPDKK 

GIYVTINITEGPQFKLNSVIVSGN 

LAGHQSEAEKLTKIEPGELFNGSKVTRMEDDIKK 

MLGRYGYAYPRWTQPEINDDDKT 

VKLH I NVDAGNRF YVRH I RFEGNDTS KDS VLRRE 

MRQMEGAWLGNDQVEAGKERLNRL 

GYFETVDVETQRVPGAADLVDVTYKVKERNTGSL 

NFGIGYGTESGVSFQVGVQQDNWL 

GTGNTVGINGTKNDYQTYAEFTLMDPYFTVDGVS 

LGGRI FYNDFKADNADLSGYTNSS 

YGADGTLGFP INENNSLRVGVGYVHNDLSDMLPQ 

VAMWRYLESVGERPGYDGREGFTT 

DDFTLNLGWTYNNLDRGFFPTSGVKSSVNTKITV 

PGSDNEFYKVTFDTSAYQPLNEDR 

SWVLLGRGRLGYGDGIGSKEMPFYENFYAGGSST 

VRGFRSNNIGPKAAYYANGGATVT 

NSTDAVGGNAMAVAS lELITPTPFIS EKYSNS VR 

TS I FIDSGTVWDTNWENTAKTRAA 

GIPDYGKASNIRVSAGVALQWMSPLGPLVFSYAK 

PVKDYEGDKSEQFQFNIGKTW " 

/gene="YPO1052" 

/note="Pfam match to entry PF01103 
Bac-surface-Ag, Bacterial surface 
antigen, score 1444.20, E- value 0" 
/gene="YPO1052" 

/note="l probable transmembrane 
helix predicted for YPO1052 by 
TMHiyiM2.0" 
/gene="ompH" 

/note=" synonym: YPO1053" 
/gene="ompH" 

/note=" Similar to Yersinia 
enterocolitica cationic 19 kDa 
outer membrane protein precursor 
OmpH SW:OMPH-YEREN {P31519) (164 
aa) fasta scores: E(): 0, 93.3% id 
in 165 aa, and to Escherichia coli 
histone-like protein Hlp-1 
precursor SW:HLPA-ECOLI {P11457) 
(161 aa) fasta scores: E() : 
1.3e-30, 67,1% id in 167 aa" 
/codon-start=l 
/transl-table=ll 
/product=" cationic 19 kDa outer 
membrane protein precursor" 
/protein-id= "CAC89895 . 1" 
/db-xref="GI: 15979120" 
/db-xref="SWISS-PROT:P58607" 
/translat ion= "MKKWLCAASLGLALAASASV 
QAADKIAIVNVSSIFQQLPAREAV 
AKQLENEFKGRATELQGMERDLQTKMQKLQRDGS 
TMKASDRTKLENEVMKQRETFSTK 
AQAFEQDNRRRQAEERNKILSRIQDAVKSVATKG 
GYDWIDANAVAYADSSKDITADV LKQVK" 
/gene="lpxD" 

/note= " synonyms : firA, omsA, 

YPO1054" 

/gene=" IpxD " 

/EC-number="2.3 .1. -" 

/note=" Similar to Escherichia coli 

UDP'3-o- [3-hydroxymyristoyl] 

glucosamine N-acyltransf erase 

SW:LPXD-ECOLI (P21645) (340 aa) 

fasta scores: E(): 0, 82.6% id in 

339 aa, and to Yersinia 

enterocolitica 

UDP-3-0- [3-hydroxymyristoyl] 

glucosamine N-acyltransf erase 

SW:LPXD-YEREN (P32203) (339 aa) 

fasta scores: E() : 0, 95.3% id in 

339 aa" 

/codon-start=l 
/transl-table=ll 



misc-f eature 77715 . . 77801 



misc-f eature 77742 . . 1119S 



misc-f eature 77796 . . 77849 



misc-f eature 77805. .77891 



misc-f eature 77850 . . 77903 



misc-f eature 77904 . . 77957 



misc-f eature 78015 . . 78068 



misc-f eature 78081 . . 78134 



misc-f eature 78090 . . 78176 



misc-f eature 78135 . . 78188 



misc-f eature 



gene 
CDS 



78207. .78260 

78598. .79143 
78598 . .79143 



oyl] glucosamine 

N-acyltransf erase" 

/protein- id= "CAC89896 .1" 

/db-xref="GI: 15979121" 

/db-xref="SWISS-PROT:P58611" 

/ translat ion= "MPSIRLADLAQQLDAQVHGD 

GDLVITGIASMHSAQPEQITFLSN 

SRYREQIiASCNAGAWLTEADLPFCKVAALWEN 

PYFTYARMAQIMDTTPQPAQDIAP 

S AVI S PQATLGEGVS VGANAVI E SGWLGDNW I 

GAGCF I GKNTH I GAGSRLWANVS I 

YHEWI GQNCL IQSGTVIGADGFGYANDRGNWVK 

IPQLGSVHIGDRVEIGACTTIDRG 

ALDNTI IGNGVI IDNQCQIAHNWIGDNTAVAGG 

VIMAGSLKVGRYCMIGGASVINGH 

ME I CDKVT I TGMGMVMRP I TE PGL YS SGI PLQPN 

KMWRKTAALVMNIDGINKRLKAVE RKIDKE" 

/gene=" IpxD" 

/not e= " PS 0 0 1 0 1 Hexapept ide - repeat 
containing- transferases 
signature . " 
/gene=" IpxD" 

/note="Ptam match to entry PF00132 
hexapep. Bacterial transferase 
hexapept ide (four repeats) , score 
13.90, E-value 3.9" 
/gene=" IpxD" 

/note="Ptam match to entry PF00132 
hexapep. Bacterial transferase 
hexapept ide (four repeats) , score 
22.20, E-value 0.012" 
/gene="lpxD" 

/not e= " PS 0 0 1 0 1 Hexapept ide - repeat 
containing- transferases 
signature . " 
/gene=" IpxD" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapept ide (four repeats) , score 
16.70, E-value 0.54" 
/gene="lpxD" 

/note="Pram match to entry PF00132 

hexapep. Bacterial transferase 
hexapept ide (four repeats) , score 
18.50, E-value 0.15" 
/gene=" IpxD" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapept ide (four repeats) , score 
13.90, E-value 3.8" 
/gene=" IpxD" 

/note="Ptam match to entry PF00132 
hexapep, Bacterial transferase 
hexapept ide (four repeats) , score 
19.10, E-value 0.1" 
/gene=" IpxD" 

/note="PS00101 Hexapept ide -repeat 
containing- transferases 
signature . " 
/gene=" IpxD" 

/note="Pfam match to entry PF00132 
hexapep, Bacterial transferase 
hexapeptide (four repeats) , score 
12.90, E-value 7.7" 
/gene=" IpxD" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapeptide (four repeats) , score 
6.50, E-value le+02" 

/gene="fabZ" 

/note=" synonyms: sefA, YPO1055" 

/gene="fabZ" 

/ EC - number = "4.2.1.-" 

/note=" Similar to Escherichia coli 

FabZ SW:FABZ-ECOLI (P21774) (151 
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79147. .79935 
79147. .79935 
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in 151 aa, and to Yersinia 
enterocolitica FabZ SW: FABZ-YEREN 
(P32205) (201 aa) fasta scores: 
E{): 0, 90.1% id in 181 aa" 
/codon-start=l 
/transl-table=ll 

/product=" (3R) -hydroxymyristoyl- (a 

cyl carrier protein) dehydratase 

(EC 4.2.1.-) " 

/protein- id= " CAC8 98 97.1" 

/db-xref="GI: 15979122" 

/db-xref = "SWISS-PROT : Q8ZH57 " 

/translation="MTTDTHTLHIEEILDLLPHR 

FPFLLVDRVLDFEEGKFLRAVKNV 

S FNE P F FQGHFPGKP I F PGVL I LEAMAQ ATG I LA 

FKSRGKLEPGELYYFAGIDEARFK 

RPWPGDQMIMEVEFVKERRGLTRFTGVAKVDGE 

IVCTATMMCARSKPAAPAESVWK 

PDWKPDWKPDWNPWKES " 

/gene="fabZ" 

/note="Pfam match to entry PF01377 
Thioester-dehyd, Thioester 
dehydrase, score 286.40, E -value 
3 .5e-82" 
/gene="lpxA" 

/note= " synonym : YPO1056 " 

/gene=" IpxA" 

/EC-number= "2.3.1. 129" 

/note= "Similar to Escherichia coli 

acyl- [acyl-carrier-protein] --UDP-N 

-acetylglucosamine 

0-acyltransf erase LpxA 

SWrLPXA-ECOLI (P10440) (262 aa) 

fasta scores: E(): 0, 81.3% id in 

262 aa, and to Yersinia 

enterocolitica 

acyl- [acyl-carrier-protein] --UDP-N 
- acetylglucosamine 
0 - acyl t r ans f e r a s e LpxA 
SW:LPXA-YEREN (P32201) (262 aa) 
fasta scores: E(): 0, 90.1% id in 
262 aa" 

/codon- start=l 
/transl-table=ll 

/product="acyl- [acyl-carrier-prote 

in]--UDP-N- acetylglucosam ine 

O - acyl t rans f e rase " 

/protein- id= " CAC8 9898.1" 

/db-xref="GI: 15979123" 

/db-xref = " SWISS - PROT : Q8 ZH5 6 " 

/ 1 rans lat i on= " MIDKTAFIHPSS I VEEGAI I 

GAGVYIGPFCIVGSQVEIGAGTEL 

KSHVWNGITKIGCDNQI YQFAS IGEANQDLKYA 

GEPTRVEVGDRNRIRESVTIHRGT 

TQGGGVTKVGCDNLLMVNTHVAHDCVIGNRCILA 

NNAALGGHVEIDDYAI IGGMTAIH 

QFCVIGAHVMVGGCSGITQDVPPFVIAQGNIIATP 

FGINIEGLKRRGFDKESLHAIRSA 

YKLLYRSGRTLDEVKPEIAELAEQYPWKAFNDF 

FARSTRGIIR" 

/gene=" IpxA" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapeptide (four repeats) , score 
21.70, E-value 0.017" 
/gene="lpxA" 

/not e= " PS 0 0 1 0 1 Hexapept ide - repeat 
containing - trans f erases 
signature. " 

/gene="lpxA" 

/note="Ptam match to entry PF00132 
hexapep, Bacterial transferase 
hexapeptide (four repeats) , score 
11.10, E-value 24" 
/gene="lpxA" 



misc-f eature 79393 . . 79446 



misc-f eature 79468 . . 79521 



misc-f eature 79522 . • 79575 



misc-f eature 79594 . . 79647 



misc-f eature 79603 . . 79689 



gene 
CDS 



79939. .81123 
79939. .81123 



gene 
CDS 



81120. .81716 
81120. .81716 



hexapep, Bacterial transferase 
hexapeptide (four repeats) , score 
8.50, E-value 54" 
/gene=" IpxA" 

/note="Pfam match to entry PF00132 
hexapep, Bacterial transferase 
hexapeptide (four repeats) , score 
7.90, E-value 65" 
/gene="lpxA" 

/note="Pfam match to entry PF00132 
hexapep. Bacterial transferase 
hexapeptide (four repeats) , score 
0.80, E-value 5.9e+02" 
/gene=:" IpxA" 

/note="Pfam match to entry PF00132 
hexapep, Bacterial transferase 
hexapeptide (four repeats) , score 
14.40, E' value 2.7" 
/gene=" IpxA" 

/note="Pfam match to entry PF00132 
hexapep, Bacterial transferase 
hexapeptide (four repeats) , score 
17.80, E-value 0.26" 
/gene=" IpxA" 

/note="PS00101 Hexapept ide - repeat 
containing- transferases 
signature , " 
/gene=" IpxB" 

/note=" synonyms: pgsB, YPO1057" 
/gene=" IpxB" 
/EC-number= "2.4.1.182" 
/note=" Similar to Escherichia coli 
lipid-A-disaccharide synthase LpxB 
SW:LPXB-ECOLI (P10441) (382 aa) 
fasta scores: E(): 0, 82.2% id in 
377 aa" 

/codon- start =1 

/transl-table=ll 

/product= " lipid-A-disaccharide 

synthase" 

/protein- id= " CAC8 9899.1" 

/db-xref = "GI : 15979124 " 

/db-xref = "SWISS-PROT : Q8ZH55 " 

/ 1 rans lat ion= " MQNSPLTADCSLNAGRPLTI 

GLVAGETSGD I LGAGL I RALKVQV 

PNARFVGVAGPLMQAEGCEAWYEMEELAVMGWE 

VLERLPRLLKIRKDLTQRFSELSP 

DVFVGIDAPDFNITLEGRLKQRGIRTIHYVSPSV 

WAWRQKRVFKIGKATDMVLAFLPF 

EKAFYDRFNVPCRFIGHTMADAMPLVPDQQAARA 

ELGIAPNATCLALLPGSRHSEVEM 

LSADFLRTAVILRDKLPNLEWVPLVNSKRREQF 

ERIKAEIAPDLSVHLLDGKARVAM 

lASDAALLASGTAALECMLAKCPMWGYRMKPFT 

FWLAERLVKTPYVSLPNLLAGEEL 

VTELLQQECQPQKLAGALLPLLQGGSEIAALKER 

FLVLHQSIRCGADEQAAQAVLELA DR" 

/gene=" rnhB" 

/note=" synonyms: b0183, YPO1058" 

/gene=" rnhB" 

/EC-number- "3 . 1. 26 .4" 

/note=" Similar to Escherichia coli 

ribonuclease hll SW:RNH2-EC0LI 

(P10442) (198 aa) fasta scores: 

E(): 0, 84.5% id in 193 aa" 

/codon- start=l 

/transl-table=ll 

/product=" ribonuclease HI I" 

/protein- id="CAC8 99 00 .1" 

/db-xref="GI: 15979125" 

/db-xref = " SPTREMBL : Q8 ZH54 " 

/ 1 rans la t ion= " MSETF I YPQANLIAGVDEVG 

RGPLVGAWTAAVILDPNRPIVGL 

ADSKKLSEKRRLSLYDEITEKALSWSLGRAEPEE 

IDQLNILHATMLAMQRAVSGLHIV 



misc- feature 

gene 
CDS 



81159. .81692 

81846. .85361 
81846 . . 85361 



misc-f eature 81897 . . 82094 



misc-f eature 84873 . . 85097 



gene 



85374. .86333 



S ILAKVTRDREMTELDLLFPEYGF 

AQHKGYPTAFHLEKLAALGATVHHRRSFGPVKRV 

LGLV" 

/gene=" rnhB" 

/note="Pfam match to entry PF01351 
RNase-HII, Ribonuclease HIT, score 
305.80, E-value 5.1e-88" 
/gene="dnaE" 

/note=" synonyms: polC, YPO1059" 

/gene="dnaE" 

/EC-number="2.7.7.7" 

/note=" Similar to Escherichia coli 

DNA polymerase III, alpha chain 

DnaE SW: DP3A-EC0LI {P10443) (1160 

aa) fasta scores: E(): 0, 89.0% id 

in 1160 aa" 

/codon- start =1 

/transl-table=ll 

/product="DNA polymerase III, 

alpha chain" 

/protein- id= " CAC8 9901.1" 
/db-xref="GI: 15979126" 
/db-xref = "SPTREMBL : Q8ZH53 " 
/ 1 rans lat ion= " MIQSTHFWYLDMAEPRFVHL 
RVHSDYSMIDGLAKIGPLVKRAAA 
LGMPALAITDFTNLCGLVKFYGSAHGAGIKPIIG 
ADFYVQSEILGDELAHLTVLARNN 
EG YQNLTLL I S EAYQRG YGAAGP I I DRDWL I KHK 
EGLILLSGGRMGDVGKFLLRGNQV 
QVDQCLAFYQEHFPDCYYLELIRTGRPDEENYLH 
AAVALATERGLPWATNDVRFIDE 
SDFDAHEIRVAIHDGFTLVDPKRPKNYSPQQFMR 
DEEQMCELFADI PEALINSVE lAK 
RCNVTIRLGEYFLPQFPTGEMSTEDFLVEKAKQG 
LEERLEFLFPDPEVRLQKRPEYDE 
RLDIELKVINQMGFPGYFLIVMEFIQWSKDNGVP 
VGPGRGSGAGSLVAYALKITDIDP 
LEFDLLFERFLNPERVSMPDFDVDFCMEKRDLVI 
EHVAEMYGRDAVSQI ITFGTMAAK 
AVIRDVGRVLGHPYGFVDRISKLIPLDPGMTLEK 
AFAAEPQLAEIYEADEEVRALIDM 
ARKLEGVTRNAGKHAGGWIAPTKITDFAPLYCD 
AEGNNPVTQFDKNDVEYAGLVKFD 
FLGLRTLT I INWALEMINARRAKTGLE P I D I AS I 
PLEDKKSFDMLQRSETTAVFQLES 
RGMKDLIKRLKPDCFEDMIALVALFRPGPLQSGM 
VDNFIDRKHGREAISYPDIEWQHE 
SLKPVLEPTYGIILYQEQVMQIAQVLSGYSLGGA 
DMLRRAMGKKNPAEMAKQRSVFED 
GAKNQGIDGELAIKIFDLVEKFAGYGFNKSHSAA 
YALVSYQTLWLKAHYPAEFMAAVM 
TADMDNTDKWGLVDECWRMGLKILPPDINSGLY 
HFHVNDDGE I VYGIGAI KGVGEGP 
lEAILEARKEGGYFKELFDLCARVDTKKLNKRIL 
EKLIMSGAFDRLGPHRAALMSSLG 
DALKAADQHAKAEAIGQVDMFGVLADAPEQVEQS 
YANVPPWQEQWLDGERETLGLYL 
TGHPITQYLKEIERYAGGMRLKDMHPTDRGKMTT 
AVGLVIAARVMVTKRGNRIGICTL 
DDRSGRLEVMLFTDALEKYQHLLEKDRILIATGQ 
VSFDDFSGGLKMTARELMDISEAR 
EKYASGLAISLTDRQIDDQLLNRLRQSLEPHRAG 
T I PVHLYYQREDARARLRFGATWR 
VTPTDRLLIDLRTLVGNEQVELEFD " 
/gene="dnaE" 
/note="Pfam match 
PHP-N, PHP domain 
region, score 115 
le-30" 

/gene=" dnaE" 

/note="Pfam match to entry PF01336 
tRNA-anti, tRNA synthetase 
anti-codon binding domain, 
43.00, E-value 6.7e-09" 
/gene="accA" 



to entry PF02231 
N- terminal 
50, E-value 



score 



/gene=" accA"" 

/EC-number= "6.4.1.2" 

/note=" Similar to Escherichia coli 

acetyl -coenzyme A carboxylase 

carboxyl transferase subunit alpha 

AccA SW:ACCA-ECOLI (P30867) (318 

aa) fasta scores: E(): 0, 90.5% id 

in 317 aa" 

/codon-start=l 

/transl-table=ll 

/product=" acetyl -coenzyme A 

carboxylase carboxyl transferase 

subunit alpha" 

/protein- id="CAC8 9 902 .1" 

/db-xref="GI: 15979127" 

/db-xref ="SPTREiyiBL:Q8ZH52" 

/translation="MSLNFLDFEQPIAELEAKID 

SLTAVSRQDEKLDINLDEEVQRLR 

EKSVELTRKIFSDLGAWQIAQLARHPRRPYTLDY 

lANIFTDFEELAGDRAYADDKAIV 

GGIARLDGRPVMI IGHQKGRETKEKIRRNFGMPA 

PEGYRKALRLMEMAERFKLPIITF 

IDTPGAYPGVGAEERGQSEAIARNLREMSRLNVP 

IVCTVIGEGGSGGALAIGVGDKVN 

MLQYSTYSVISPEGCASILWKSADKAPLAAEAMG 

ITAHRLKELKMIDSVIPEPLGGAH 

RDYAAIAISLKAQLLADLNDLDVLNDEELLNRRY 

QRLMNYGYC" 

/gene="YP01G61" 

/gene="YP01061" 

/note=" Similar to Escherichia coli 
hypothetical protein YaeR 
SW:YAER-ECOLI (P52096) (129 aa) 
fasta scores: E(): 0, 71.1% id in 
128 aa, and to Vibrio cholerae 
glyoxylase I family protein 
VCA0890 TR:Q9KL58 (EMBL : AE004416) 
(127 aa) fasta scores: E{): 
l.le-27, 59.2% id in 125 aa" 
/ codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89903 .1" 

/db-xref="GI: 15979128" 

/ db -xr e f = " SPTREMBL : Q8 ZH5 1 " 

/ 1 rans lat ion= " MLAINKIHHIAIICSDYQAS 

KRF YCEVLGFNL I SEVYREERDSW 

KADLALHDQYTIELFSFPSPVPRPSRPEACGLRH 

LAFQVDDIDLALKELWAGWCEA 

VRIDPYTQSRFTFFNDPDGLPLELYELKAE " 

/gene="YPO1062" 

/gene="YPO1062" 

/note=" Similar to Escherichia coli 
putative cell cycle protein MesJ 
SW:MESJ-ECOLI (P52097) (432 aa) 
fasta scores: E() : 0, 55.2% id in 
44 0 aa" 

/ codon - s t ar t = 1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89904 . 1" 

/db-xref="GI: 15979129" 

/db-xref = " SPTREMBL : Q8ZH50 " 

/ 1 rans lat ion= " MNLVTSKPNVLLNPLFAQLG 

ENRHVLVGFSGGLDSTVLLHLLVC 

LRQQLIPELNIRAIHIHHGLNPQADSWVKHCMQQ 

CDQWKIELKWRVNIDPRQNGIEA 

AARTARYQAFSANLAAKEVLLTAQHLDDQCETFL 

LALKRGS GPAGLS AMAAKMP FAHS 

QLLRPLLiAFSREILENYAQAQQLQWIEDDSNQDD 

RFDRNFLRLNVLPILNQRWPHFAQ 

ATARSAGLCAEQEQLLDELLAENLQQLQGPDRSL 

SIDGLLQASiVIAKRAAILRRWLASL 



88536. .88853 
88536. .88853 



88602. .88847 



88638. .88655 



complement (89118 



VRRFRQHLYLLMPLAEITTNYLPW 

ATVKAAPNSSIIPLLPEPLWLPADLGVLRFVSAG 

GQAVRPAAVGEEISVRFGLQGDIK 

IVGRHHSRQSKKVWQELGIPPWQRERIPLLYFGE 

QLIAAAGVFVTQAGQANENEPCWH 

LDWDKPLRLG" 

/gene="YPO1063" 

/gene="YPO1063" 

/note=" Similar to Vibrio cholerae 
cytochrome C554 VC2241 TR:Q9KPX1 
{EiyiBL:AE004296) (107 aa) fasta 
scores: E(): 1.6e-14, 50.5% id in 
105 aa, and to Shewanella violacea 
soluble cytochrome CB CytCB 
TR:Q9RHJ5 (EMBL : AB032405 ) (206 aa) 
fasta scores: E(): 9.1e-10, 37.7% 
id in 106 aa, and to Pseudomonas 
aeruginosa cytochrome C4 precursor 
PA5490 TR:AAG08875 (EMBL : AE004 961) 
(201 aa) fasta scores: E(): 
2.5e-09, 40.4% id in 104 aa" 
/codon-start=l 
/transl-table=ll 
/product= "putative cytochrome" 
/protein- id= " CAC8 9905.1" 
/db-xref="GI: 1597913 0" 
/db-xref = " SPTREMBL : Q8ZH4 9 " 
/ trans lat ion= " MMKLWSMAVLLSLCSFQVW 
AKNDIEAGRAKSSSCVACHGMNGK 
VSVPMYPNLAGQNALYLQQSLEAYKKGSRSGGQA 
EVMRAYVSGLSDEDFSDLAAYYAS LKP" 
/gene="YPO1063" 

/note="Pfam match to entry PF00034 
cytochrome -Cy Cytochrome c, score 
47.20, E-value 3.6e-10" 
/gene="YPO1063" 

/note="PS00190 Cytochrome c family 
heme-binding site signature." 
.89378 /gene="YPO1064" 



complement (89118. .89378 /gene="YPO1064 " 



complement (89365 , 



/note="Similar to Escherichia coli 
ROF protein SW:ROF-ECOLI (P52098) 
(84 aa) fasta scores: E(): 
1.4e-26, 73.8% id in 84 aa" 
/ codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAC8 9906.1" 
/db-xref ="GI : 15979131" 
/ db - xr e f = " S PTREMBL : Q 8 ZH4 8 " 
/translations "MSMSEEYQPINCDDYDNLEL 
ACQHHL I LTLKLRGGE I VEGKASD 
LLLRKKVEYLIVEQEGTMRELRLDHIASFSHPEI 
GTVWSAS" 
89565 /gene="YP01064a" 



complement (89365. .89565 /gene=" YPO1064a" 



) 



/note=" Similar to Escherichia coli 

hypothetical protein YaeP 

SW: YAEP-ECOLI (P52099) (66 aa) 

fasta scores: E(): 1.5e-18, 74.2% 

id in 66 aa" 

/codon-start=l 

/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id="CAC8 9907. 1" 

/db-xref="GI: 15979132" 

/db-xref = " SPTREMBL : Q8 ZH4 7 " 

/ 1 r ans 1 a t i on= " MQQ YCELVRRFYAE I GSGDL 

GYVPDALRCVLKALDEVAANDALP 



gene 
CDS 



89807. .90355 
89807. .90355 



gene 
CDS 



90358. .90774 
90358. .90774 



gene 
CDS 



90814 . .91542 
90814 . .91542 



misc- feature 



90886. .90918 



/gene="YPO1065" 
/gene=:"YPO1065" 

/note= "Similar to Escherichia coli 
hypothetical protein YaeQ 
SW: YAEQ-ECOLI (P52100) (181 aa) 
fasta scores: E() : 0, 61.8% id in 
178 aa" 

/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id="CAC8 9908 .1" 

/db-xref="GI: 15979133" 

/db-xref = " SPTREMBL : Q8ZH46 " 

/ 1 rans 1 at ion= " MALKATIHKAAINIADMDRN 

FFQDINLTIAQHPSETDQRMMLRL 

LAWICHADERLLFTKGLSADDEPEVWRHNDHNGI 

ELWIELGLPEEKRLRKACNQSKQV 

VL YAYS ERAAKVWWPQVQEKLAGHRNLRVRFLDD 

EQMAKLAALSNRNMSLQATLQEGT 

IWLSDVQNNLEISFAEWQNYGQ " 

/gene="YPO1066" 

/gene="YPO1066" 

/note=" Similar to Escherichia coli 
hypothetical protein YaeJ 
SW: YAEJ-ECOLI (P4 0711) (14 0 aa) 
fasta scores: E(): 0, 65.2% id in 
135 aa, and to Pseudomonas putida 
hypothetical protein SW: YAEJ-PSEPU 
(P45388) (137 aa) fasta scores: 
E(): 8.6e-31, 65.2% id in 135 aa, 
and to Streptomyces coelicolor 
hypothetical protein SCD95A.11 
TR:Q9KXW3 (EMBL : AL3574 32 ) (145 aa) 
fasta scores: E(): 7.4e-13, 41.5% 
id in 135 aa" 
/codon- start=l 
/ trans l-table= 11 

/product^" conserved hypothetical 
protein" 

/protein- id= " CAC89909 . 1" 

/db-xref ="GI : 15979134 " 

/db-xref = " SPTREMBL : Q8ZH4 5 " 

/translation="MLIISNSVTLASGEIELTAI 

RAQGAGGQHVNKTSTAI HLRFD I K 

ASSLPEYYKERLLLLNSHLVTAEGWIIKAQEYR 

SQDMNREAALARLVALIRQAMWE 

KARRATKPTKGSKIRRVEGKVRKGATKALRGKIH 

QE" 

/gene=" cutF" 

/note= " synonyms : nlpE, YPO1067" 
/ gene =" cut F" 

/note="Similar to Escherichia coli 
copper homeostasis protein CutF 
precursor SW: CUTF-ECOLI (P40710) 
(236 aa) fasta scores: E() : 0, 
48.3% id in 230 aa" 
/codon- start =1 
/transl-table=ll 
/product = "putative copper 
homeostasis lipoprotein" 
/protein-id="CAC89910 .1" 
/db-xref =:"GI : 15979135" 
/db-xref = " SPTREMBL : Q8ZH44 " 
/ 1 rans lat ion= " MTS FH I YRTQYGRITVSKLT 
ITLLLAVGALSLLGCNNRSQPAEQ 
VLQPMQQSYQGVLPCADCSGLDTSLFLDSDGTFI 
LKEVYLGTKDGDQTFAEYGKWART 
ADKLVLTNGQGEKRYFHPVDKSLVMLDQQGLPIK 
STLNYQLEPSDQPLSKTPMPLSGM 
YKYFADAAIFTDCVTGKAFPVENNIELETGYLNA 
RRNPGEPVFLTLNGYFDSRPSMEE 
GRTNKTL I PEGD I QFNANKSCEKK " 
/gene="cutF" 

/note="PS00013 Prokaryotic 



gene 
CDS 



complement (91676 
complement (91676 



misc- feature complement (91730 



misc- feature complement (92918 

) 



gene 
CDS 



complement (93498 



attachment site," 
93394 / gene = " proS " 

/note=" synonyms: drpA, YPO1068" 
93394 /gene="proS" 

/EC-number="6 . 1 . 1 . 15" 
/note=" Similar to Escherichia coli 
prolyl -tRNA synthetase 
SW:SYP-ECOLI (P16659) (572 aa) 
fasta scores: E(): 0, 84.2% id in 
571 aa" 

/codon- start =1 
/transl-table=ll 

/product = " prolyl - tRNA synthetase " 
/protein-id= "CAC89911. 1" 
/db-xref="GI: 15979136" 
/ db - xr e f = " S PTREMBL : Q8 ZH4 3 " 
/ 1 rans la t ion= " MRTSQYLLSTQKETPADAEV 
ISHQLMLRAGMIRKLASGLYTWLP 
TGVRVLKKVENI VREEMNNAGAI EVSMPWQPAD 
LWQESGRWEQYGPELLRFVDRGER 
PFVLGPTHEEVITDLIRGEINSYKQLPLNFFQIQ 
TKFRDEVRPRFGVMRAREFLMKDA 
YSFHTTQESLQETYDAMYTAYSKIFSRMDLNFRA 
VLADTGSIGGSASHEFQVLAESGE 
DDIVFSTGSDYAANIEFAEALAPTEPRAPATEEL 
RIVDTPNAKTIAELVEQFKLPIEK 
TVKTLLVHAHEESGHKLVALLVRGDHDLNEIKAE 
KLPQVAKPLTFASEEEIRAAIGAG 
PGSLGPVNLSLPVIADHSVAVMSDFGAGANIDGK 
HYFGINWERDLALPLVADLRNWE 
GD I S PDGKGTLQIKRGIEVGHI FQLGTKYSEVMK 
ATVQGEDGRNQVMTMGCYGIGVSR 
WAAAIEQNHDDRGIIWPDAIAPFQVAILPMNMH 
KSFRVKELAEELYTTLRSHGIDVI 
LDDRKERPGVMFADMELIGVPHNIVIGDRNLDSE 
EVEYKNRRVGEKQMIKTSEIVEFL LSQIKR" 
.93010 /gene="proS" 

/note="Pfam match to entry PF00587 
tRNA-synt-2b, tRNA synthetase 
class II (G, H, P, S and T) , score 
244.70, E-value 1.3e-69" 
.92980 /gene="proS" 

/note="PS00179 Aminoacyl- transfer 
RNA synthetases class -II signature 
1. " 

.94205 /gene="YPO1069" 



complement (93498. .94205 /gene="YPO1069" 



) 



/note=" Similar to Escherichia coli 
hypothetical protein YaeB 
SW: YAEB-ECOLI (P28634) (235 aa) 
fasta scores: E() : 0, 71.1% id in 
235 aa, and to Vibrio cholerae 
hypothetical protein VC0876 
TR:Q9KTM6 (EMBL: AE004172) (231 aa) 
fasta scores: E() : 0, 50.0% id in 
232 aa" 

/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= "CAC89912 .1" 

/db-xref="GI: 15979137" 

/ db - xr e f = " S PTREMBL : Q8 ZH4 2 " 

/translation= "MSTFSFNQIGVIRSPYKEKF 

AVPRQPGLVEDGGGELQLLAPYNQ 

PES VRGLSDFSHLWVI FVFHQTMEGGWRPTVRPP 

RLGGNTRMGVFATRSTFRPNPIGM 

SLIELKGVRCQNGEVILALGSLDLVDGTPVIDIK 

PYLPFAESQPQARAGFAQSAPDTE 



misc-f eature complement (93774 . 



misc- feature 

gene 
CDS 



complement (93912, 

complement (94202 , 
) 

complement (942 02 . 
) 



misc- feature 



gene 
CDS 



complement (94562 . 



complement (94727. 



PRPAYRKDDNESREYAVLLLEFNV 
RWRVIGQQTEVLSLDPR" 
94154 /gene="YPO1069" 

/note="Pfam match to entry PF01980 
UPF0066, Uncharacterised protein 
family UPF0066, score 254.10, 
E-value 2e-72" 
93947 /gene="YPO1069" 

/note="PS01318 Uncharacterized 
protein family UPF0Q66 signature." 
94609 /gene="rcsF" 

/note=" synonym: YPO1070" 
94609 /gene="rcsF" 

/note="May be involved in 
regulation of exopolysaccharide 
biosynthesis. Similar to 
Escherichia coli RcsF protein rcsF 
SWrRCSF-ECOLI (P28633) (133 aa) 
fasta scores: E(): 2.1e-27, 63,5% 
id in 13 7 aa" 
/ codon-start=l 
/transl-table=ll 
/product= "putative lipoprotein" 
/protein- id= " CAC8 9913.1" 
/db-xref="GI: 1597913 8" 
/db-xref = " SPTREMBL : Q8ZH4 1 " 
/ translat ion= "MRALPLCLLALSLTGCTLLP 
SKPSTTDNPIKQPPPVIERSPTAA 
PRPAPVKLYKSAEELVGKPFRDLGEVSGESCQST 
VQDS PPS I STARKRMQIRAS YMKA 
NAVLLHECEIQSGVPGCYQQAVCQGSALNVSSK" 
,94594 /gene="rcsF" 

/note="PS00013 Prokaryotic 
membrane lipoprotein lipid 
attachment site . " 
.95542 /gene="YPO1071" 



complement (94727. .95542 /gene="YPO1071" 



misc -feature complement (95474 



gene 



complement (95606 < 



/note=" Similar to Escherichia coli 
putative lipoprotein YaeC 
precursor SW: YAEC-ECOLI (P28635) 
(271 aa) fasta scores: E() : 0, 
90.4% id in 271 aa, and to Vibrio 
cholerae lipoprotein VC0905 
TR:Q9KTJ7 (EMBL: AE004174) (275 aa) 
fasta scores: E() : 0, 68.3% id in 
271 aa" 

/codon-start=l 
/transl-table=ll 
/ produc t = " put at i ve 1 ipoprotein " 
/protein-id="CAC89914 . 1" 
/db-xref="GI: 15979139" 
/db-xref = " SWISS - PROT : Q8 ZH4 0 " 
/ 1 rans lat ion= " MSLKFKS I AAI SALIGTLTL 
VGCGPTEKAPNHIKVGVIVGAEQQ 
VAEVAQKVAKEKYGLDVELVTFNDYVLPNEALSK 
GDIDLNAFQHKPYLDQQIKDRGYK 
LVSVGNSFVYPIAGYSKKIKSLDELQPGSQVALP 
NDPTNLGRSLLLLQSVGLIKLKDG 
VGLLPTVLDWENPKNLKLVELEAPQLPRSLDDQ 
QIALAI INTTYASQIGLTPAKDGL 
FVEDKESPYVNLIVAREDNKDAENVKKFVQAYQS 
DEVYDAANKAFNGGAVKGW " 
95506 /gene="YPO1071" 

/note="PS00013 Prokaryotic 
membrane lipoprotein lipid 
attachment site." 
96259 /gene="YPO1072" 



CDS 



complement (95606 . .96259 /gene=:"YPO1072 " 



misc- feature 



misc- feature 



misc -feature 

gene 
CDS 



complement (95651. 



complement (95717 , 



misc-feature complement (95762 



misc -feature complement (9584 0 . 



misc-feature complement (95948 



misc-feature complement (96029 



complement (96134 . 



complement (96252, 
complement (96252 . 



/note=" Similar to Escherichia coli 
hypothetical ABC transporter 
permease protein YaeE 
SW:YAEE-ECOLI (P31547) (217 aa) 
fasta scores: E(): 0, 90.8% id in 
217 aa, and to Vibrio cholerae ABC 
transporter, permease protein 
VC0 906 TR:Q9KTJ6 (EMBL : AE004174 ) 
(225 aa) fasta scores: E() : 0, 
54.3% id in 210 aa" 
/codon-start=l 
/transl-table=ll 

/product="ABC transporter permease 
protein" 

/protein- id="CAC8 9915 .1" 
/db-xref="GI: 15979140" 
/db-xref="SWISS-PROT:Q8ZH39" 
/translation= "MSEAMMWLMARGVWETLMMT 
FVSGFFGFVLGLPVGVLLYVTRPG 
QIIANNKIYRTLSGWNIFRSIPFIILLVWMIPF 
TRMIVGTS IGLQAAI VPLTVGAAP 
F I ARMVENALLE I PSGLVEAARAMGATPMQ II KK 
VLLPEALPGLVNAATITLITLVGY 
SAMGGAVGAGGLGQIGYQYGYIGYNATVMNTVLV 
LLVI LVYL IQLSGDRI VKAVTHK " 
95716 /gene="YPO1072" 

/note="one of 5 probable 
transmembrane helices: predicted 
for YPO1072 by TMHiyiM2 . 0 " 
95929 /gene="YPO1072" 

/note="Pfam match to entry PF00528 
BPD-transp, 

Binding -prote in- dependent 
transport systems inner membrane 
component, score 66.00, E-value 
8 le-16" 
95827 /gene="YPO1072" 

/note="one of 5 probable 
transmembrane helices predicted 
for YPO1072 by TiyiHiyiM2 . 0 " 
.95926 /gene="YPO1072" 

/note="PS00402 
Binding-protein-dependent 
transport systems inner membrane 
comp. sign." 
.96013 /gene="YPO1072" 

/note="one of 5 probable 
transmembrane helices predicted 
for YPO1072 by TMHMM2.0" 
.96094 /gene="YPO1072" 

/note="one of 5 probable 
transmembrane helices predicted 
for YPO1072 by TiyiHMiy[2 . 0 " 
.96199 /gene="YPO1072" 

/note="one of 5 probable 
transmembrane helices predicted 
for YPO1072 by TMHiyiM2 . 0 " 
.97283 /gene="abc" 

/note= " synonym : YPO1073 " 
.97283 /gene="abc" 

/note=" Similar to Escherichia coli 
ATP -binding protein Abe 
SW:ABC-ECOLI (P30750) (343 aa) 
fasta scores: E(): 0, 80.2% id in 



misc-f eature complement (96633 . 



misc-f eature 



misc-f eature 



complement (96819 . 



complement (97149 . 



gene 
CDS 



97470. .98036 
97470. .98036 



gene 
rRNA 



tRNA 



gene 
rRNA 



gene 
rRNA 



98505. .99993 
98505. .99993 



100157. .100232 



100485. .103391 
100485. .103391 



103391. . 103624 
103391. .103624 



/codon- start =1 
/ transl-table=ll 
/product = "ABC transporter 
ATP-binding protein" 
/protein- id= " CAC8 9916.1" 
/db-xref ="GI : 15979141" 
/db-xref = " SWISS - PROT : Q8 ZH3 8 " 
/translation="MIKLTHISKVFQQGSRTITA 
LSDVSLHVPAGQIYGVIGASGAGK 
STLIRCANMLERPTSGQVLVDDQDLTTLSEGQLT 
RARRQIGMIFQHFNLLSSRTVYGN 
lALPLELDNTSRADIKKRVNELLDLVGLTDKQDA 
YPANLSGGQKQRVAIARALASNPK 
VLLCDEATSALDPATTRSILELLKDINRRLGLTI 
LL I THEMD WKR I CDQVAV I S EGK 
LIEKDSVSEVFSHPKTPLAQQFIQSTLHLDIPED 
YAKRMSPEPTVDHVPLLKLEFTGK 
SVDAPLISQAVRRFNIDIGILSSQMDYAGGVKFG 
VMLAELHGDVQDGLAAIKFLQDHH 
VKVEVLGYV" 
97193 /gene="abc" 

/note="Pfam match to entry PF00005 
ABC-tran, ABC transporter, score 
258.10, E-value 1.2e-73" 
96863 /gene="abc" 

/note="PS00211 ABC transporters 
family signature." 
97172 /gene="abc" 

/note="PS00017 ATP/GTP-binding 
site motif A (P-loop)." 
/gene="YP01074" 
/gene="YPO1074" 

/note="Similar to Escherichia coli 
hypothetical protein YaeD 
SW:YAED-ECOLI (P31546) (191 aa) 
fasta scores: E() : 0, 8 0.3% id in 
188 aa, and to Escherichia coli 
0157 :H7 phosphatase YaeD 
TR:BAA93568 (EMBL: AB035926) (191 
aa) fasta scores: E(): 0, 80.3% id 
in 188 aa" 
/codon- start =1 
/ transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89917 .1" 

/db-xref ="GI : 15979142 " 

/db-xref = " SPTREMBL : Q8ZH3 7 " 

/translation= "MTQSVPAIFLDRDGTVNVDH 

GYVHEIDNFQFIDGVIDACRELKE 

MGFALVLVTNQSGIARGMFTEEQFLSLTEWMDWS 

LADRGVDLDGIYFCPHHPDGSVAE 

FSETCECRKPLPGMLLQAQNELNIDMAASYMVGD 

KIEDMQAALAANIGTKYLVRTGKP 

VTAEGEAAADWVLNSLADLPKAIKARYK" 

/gene="16S-rRNA" 

/gene="16S-rRNA" 

/note="match to 16S-rRNA 1..1489 
(EMBL:YPE16SA) " 
/products "tRNA-Glu" 
/note="tRNA Glu anticodon TTC, 
Cove score 56.95" 
/gene="23S-rRNA" 
/gene="23S-rRNA" 

/note="match to 23S-rRNA 1..2906 

(Y. enterocolitica 23S 

EiyiBL:U77925) " 

/gene="5S-rRNA" 

/gene="5S-rRNA" 

/note="match to 5SrRNA 1..234 

blast score 1131 percent identity 

98 (incl. 23S-5S spacer)" 



/hote="tRNA Asp ahticodon GTC, 
Cove score 90.47" 

104177. .104980 /gene=" YPO1075 " 

104177. .104980 /gene="YPO1075" 

/note= "Similar to Escherichia coli 
hypothetical oxidoreductase in 
aspU-mltD intergenic region YafB 
SW: YAFB-ECOLI (P30863) (267 aa) 
fasta scores: E(): 0, 74.9% id in 
267 aa, and to Corynebacterium sp 
2, 5-diketo-D-gluconic acid 
reductase TR:P06632 (EMBL:M12799) 
(278 aa) fasta scores: E() : 
6.3e-31, 41.6% id in 255 aa" 
/codon-start=l 
/transl-table=ll 
/product= "putative aldo/keto 
reductase" 

/protein-id=:"CAC89918 .1" 
/db-xref="GI : 15979143 " 
/db-xref="SWISS-PROT:Q8ZH36" 
/ 1 rans lat ion= " MS I PVFGLGTFRLQDQI VXD 
SVSQALTLGYRAIDTAQIYENEAP 
VGQAIQESGINRDELFITTKIWIANLSKDKLIPS 
LRESIQKLKTDYVDLTLIHWPSPN 
DEVSVAEFMSELLKAKGLGLTRQIGISNFTIDLM 
KQAI AAVGAEE I ATNQ I ELS PLLQ 
NRNWDFAKQNGIAVTSYMTLAYGKALAEPVIKT 
lAEQHGATPAQVILSWAMQLGYGV 
I PSSTKAANLASNLLAQKLCLNAADMALI ATLDR 
NERLVSPDGLAPKWD " 
104177. .104941 /gene= " YP01075 " 

/note="Pfam match to entry PF00248 
aldo-ket-red, Aldo/keto reductase 
family, score 357.70, E-value 
1.3e-103" 

104261. .104314 /gene="YPO1075" 

/note="PS00798 Aldo/keto reductase 
family signature 1." 

104516. .104569 /gene="YPO1075" 

/note="PS00062 Aldo/keto reductase 
family signature 2." 

105015. .105479 /gene= " YPO1076 " 

105015. .105479 /gene= " YPO1076 " 

/note="no significant database 
hits" 

/codon-start=l 

/ transl-table=ll 

/product = "hypothetical protein" 

/protein- id="CAC8 9919 .1" 

/db-xref ="GI : 15979144 " 

/db-xref = " SPTREMBL : Q8 ZH3 5 " 

/ 1 rans lat ion= " MRWFFILVHPDGAETGSVKS 

VRMDMELPPSMCAVIVGTNLMRAY 

QRVGEQRVDNS I VKKHGNEAMDLQLSGLRPDNGS 

QVESAGQGNCEAS I GTERMDTLLP 

FNRGGRGDWNSWIGGDKENCVACYGGSGKGRIYE 

QRCLLEEVFYRKGLI YQQ " 

105840. .106622 /gene= " YPO1077 " 

105840. .106622 /gene=" YPO1077" 

/note=" Similar to Escherichia coli 
hypothetical protein YafD 
SW: YAFD-ECOLI (P30865) (259 aa) 
fasta scores: E(): 0, 75.2% id in 
254 aa" 

/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id= "CAC89920 . 1" 

/db-xref ="GI : 15979145" 

/ db -xref = " SPTREMBL : Q8 ZH3 4 " 

/translation="MPKRTYAMRYVAGQPVEQIF 

PGAAKQLDKGLPLGEPLPTAELLR 

VVVWNI FKQQRAGWLPVLKELGRDTQLMLLQEAQ 



PQHPSGVMTLAAAHPVYCCPLREREPLLRLSKSA 

LVTVYPIHDGRLLMWNIHAVNFS 

LGVDVYSKQLDPIGDQIASHRGPVILAGDFNAWS 

RQRINALQHFAQDAGLQEVEFRVD 

HRSRAFGRPLDFIFYRGLTVIDASVLVTRASDHN 

PLIVEFQP" 

complement (106664 . . 1080 /gene="mltD" 
91) 

/note=" synonyms: dniR, YPO1078" 
complement (106664 . . 1080 /gene="mltD" 
91) 

/EC-number= "3.2.1.-" 
/note=" Similar to Escherichia coli 
membrane -bound lytic murein 
transglycosylase D precursor MltD 
SW:MLTD-ECOLI (P23931) (452 aa) 
fasta scores: E(): 0, 63.4% id in 
470 aa" 

/ codon-start=l 

/transl -tableau 

/product =" membrane -bound lytic 

murein transglycosylase D 

precursor" 

/protein- id= " CAC8 9921.1" 

/db-xref="GI; 1597914 6" 

/ db - xr e f = " S PTREMBL : Q 8 ZH3 3 " 

/ 1 rans lat ion= " MIARLLSNHIDTHMKTKAIL 

LASVLLVGCQTSKQDAQAPEQHAQ 

SLSSAGQASEAGEYTNSTREGSARWLESESSYAQ 

QDLWNFIKDELKMKVPENSRIREQ 

KQKYLKNKSYLHDVTLRAEPYMYLIVEQIKKRKM 

PMELVLLP IVES AFDPHATS SANA 

AGLWQIVPSTGRNYGLKQNQWYDGRRDWASTKA 

ALD ILERLNKMFNGDWLLTVAAYN 

SGEGRVMQAIKANKAKGKPTDFWALSLPRETS I Y 

VPKMLALGDLLKNSKKYGITLPET 

DKDRALARVDVDQQIELTQAAEMAGMSLTKLKSF 

NSGYKRNVTAPDGHGPRYIMLPKA 

HAEQLKDSLADTDITAVQPTKLATNSTKSASSSQ 

YKYRPGDTLSTIAKRLNIKTSDLQ 

SWNNLRAKSTLKVGQTLQLASNTTSKSITYQVRK 

GDSFASIAKRHGVNTDDVMRWNSV 

VSKANNLQPGLKLTLFVNDKSTPEA" 

complement (106694 . . 1068 /gene="mltD" 

28) 

/note="Pfam match to entry PF014 76 
LysM, Putative peptidoglycan 
binding domain, score 44.10, 
E-value 3 .26-09" 

complement (106856. .1069 /gene="mltD" 

87) 

/note="Pfam match to entry PF01476 
LysM, Putative peptidoglycan 
binding domain, score 62.30, 
E-value le-14" 

complement (107372 . .1077 /gene="mltD" 

22 ) 

/note="Pfam match to entry PF014 64 
SLT, Transglycosylase SLT domain, 
score 122.70, E-value 6.7e-33" 
complement (107579. .1076 /gene="mltD" 

/note="PS00922 Prokaryotic 
transglycosylases signature." 

complement (108005. .1080 /gene="mltD" 

37) 

/note="PS00013 Prokaryotic 
membrane lipoprotein lipid 
attachment site . " 

complement (108124 . .1088 /gene="gloB" 

79) 

/note=" synonym: YPO1079" 
complement (108124. .1088 /gene="gloB" 
79) 

/EC- number= "3.1.2,6" 



misc- feature 



complement (108385 . 
61) 



gene 
CDS 



108924 
108924 



109643 
109643 



gene 



CDS 



complement (109698 
62) 



complement (109698 , 
62) 



grobable hydroxyacylglutathione 
ydrolase GloB SW:GL02-EC0LI 
(Q47677) (251 aa) fasta scores: 
E(): 0, 57.4% id in 251 aa, and to 
Arabidopsis thaliana 
hydroxyacylglutathione hydrolase, 
mitochondrial precursor Glx2-1 
SW:GL2M-ARATH (024495) (331 aa) 
fasta scores: E(): 2.9e-32, 39.0% 
id in 236 aa" 
/codon- start=l 
/transl-table=ll 
/product = " put at ive 
hydroxyacylglutathione hydrolase " 
/protein-ld= "CAC89922 .1" 
/db-xref ="GI : 15979147" 
/db-xref = " SPTREMBL : Q8 ZH3 2 " 
/ 1 r ans la t ion= " MNL I S I PAFQDNYI WLLANR 
QKHCVIVDPGESAPVLATLAQGQY 
VPQAILLTHHHNDHVGGVADLRHHFPDIPVYGPQ 
ETAKKGATVIVNDGDSLTIAGQNY 
TIIAVPGHTLGHIAYYSSPYLFCGDTLFSAGCGR 
LLEGTPEQMYAS IQRLAQLPDETL 
I CCAHE YTLSNLKFAHAI L P ADQD I AT YQQQ I EQ 
LRSKNLPSLPVKLQFERKINVFLR 
CNDIDLQRKIGTTSPPDSLVSVFCELRSRKDSF" 
,1088 /gene="gloB" 

/note="Pfam match to entry PF00753 

lactamase-B, 

Metallo-beta- lactamase 

superfamily, score 162.10, E-value 

9.3e-45" 

/gene="YPO1080" 

/gene="YPO1080" 

/note=" Similar to Escherichia coli 
hypothetical protein YafS 
SW:YAFS-ECOLI (P75672) (240 aa) 
fasta scores: E(): 0, 63.6% id 
236 aa, and to Vibrio cholerae 
hypothetical protein VC2235 
TR:Q9KPX7 (EMBL : AE004295) (245 
fasta scores: E(): 0, 44.2% id 
233 aa" 

/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89923 . 1" 
/db-xref ="GI : 15979148 " 
/db-xref = " SPTREMBL : Q8 ZH3 1 " 
/translation="MKPAQIRQNIETPASWAELP 
WGEYYRAALEQQLQPWWPKFFGFH 
LLKVGHLS AE I ASDKCAI AHQVNVGEWGKNMQVL 
ASPYQLPFAEKSVDACLLSHTLAY 
AANPHRILREVDRVLIDDGWLVISGFNPFSLLGL 
GKLVPVLRQRQP YI SRMFTQMRLL 
DWLSLLNYEWHLSCFHVLPFNKNGGHFISTHLP 
ALGCVNLI lARKRTVPLTFNPMKF 
GVRKPWFSRAVGATKS YRDQP " 
.1101 /gene="rnhA" 



aa) 
in 



/note= " synonyms : 
sdrA, YPO1081" 
1101 /gene="rnhA" 



dasF, herA, rnh. 



/ EC - number = "3.1.26.4" 

/note=" Similar to Escherichia coli 

ribonuclease HI SW:RNH-ECOLI 

(P00647) (155 aa) fasta scores: 

E(): 0, 84.8% id in 151 aa" 

/codon- start=l 

/transl-table=ll 

/product=" ribonuclease HI" 

/protein- id=: "CAC89924 .1" 



complement (109737 . 
59) 



110232. .110996 
110232, .110996 



110253. .110780 



111109. .111185 



complement (111242 , 
26) 

complement (111242 , 
26) 



/db-xref = " SWISS - PROT : Q8ZH3 0 " 
/ 1 rans lat ion= " MTKQVEI FTDGSCLGNPGPG 
GYGAILRYKQHEKTFSAGYYLTTN 
NRMELMAAIVALEALTSPCEVTLSTDSQYVRQGI 
TQWIHNWKKRGWKTADRKPVRNVD 
LWQRLDLAIQSHTIQWEWVKGHAGHPENERCDEL 
ARQGANS PTLDDTGYNPD " 
1101 /gene="rnhA" 

/note="Pfam match to entry PF00075 
rnaseH, RNase H, score 227.00, 
E-value 2.8e-64" 
/gene=" dnaQ" 

/note=" synonyms: mutD, YPO1082" 
/gene=" dnaQ" 
/EC-number= "2.7.7.7" 
/note=" Similar to Escherichia coli 
DNA polymerase III, epsilon chain 
SW:DP3E-EC0LI (P03007) (243 aa) 
fasta scores: E(): 0, 75.9% id in 
237 aa" 

/codon- start=l 

/transl-table=ll 

/product = "DNA polymerase III, 

epsilon chain" 

/protein- id= " CAC8 9925.1" 

/db-xref ="GI: 15979150" 

/db-xref = " SPTREMBL : Q8 ZH2 9 " 

/translation="MIITPTRQIVLDTETTGMNK 

LGVHYEGHRI lEIGAVEVINRRLT 

GRNFHVYVKPDRLVDPEAYGVHGI SDEFIiADKPT 

FADITPEFLDFIRGAELVIHNAAF 

DIGFMDYEFRMLQQDIPKTETFCTITDSLLMARR 

LFPGKRNNLDALCDRYQIDNTKRT 

LHGALLDAEILAEVYLAMTGGQTSLTFSMEGEVS 

QNNASEDIQRITRPASALKIIYAT 

EDELANHESRLDFVMKKGGSCLWRMPLEAEEETK 

AE" 

/gene=" dnaQ" 

/note="Pfam match to entry PF00929 
Exonuclease, Exonuclease, score 
171.30, E-value 1.6e-47" 
/product = " tRNA- Asp " 
/note="tRNA Asp anticodon GTC, 
Cove score 94.51" 
,1127 /gene="YPO1083" 

.1127 /gene="YPO1083" 

/note=" Similar to Pseudomonas 
aeruginosa probable permease 
TR:AAG03 865 (EMBL : AE0044 85) (575 
aa) fasta scores: E(): 0, 39.1% id 
in 465 aa, and to Streptomyces 
coelicolor putative transmembrane 
transporter SC1A6.06 TR: 069811 
(EMBL:AL023496) (522 aa) fasta 
scores: E() : 0, 32.6% id in 481 
aa" 

/codon- start =1 

/transl-table=ll 

/product =" putative permease" 

/protein- id= " CAC8 9926.1" 

/db-xref ="GI: 15979151" 

/db-xref = "SPTREMBL : Q8ZH28 " 

/translation= "MPNNEKISASSINPHYSPKL 

TNEDLAPTRKQNWGWYNIFSFWMT 

DVHSMGGYWAASFFALGLSSWQVLLSLLVGICI 

VQICANLVAKPSQIAGVPYAVICR 

QS FGVFGAN I PAVI RGL I AVAWYG I QT YLAGNAF 

MLVMLKFFPSLIPLAQSQWLGLST 

LGWICFSIMWVLQAMVFWHGMNAIKRFIDFAGPA 

VYWMLSLALWI VYQTGWSNI S FT 

LASKILSPSEQVWQMLTATALWSYFAGPLLNFG 

DFSRYGKSMQEIRRGNRWGLPFNF 



113065. .113283 



complement (113283 . 
36) 

113283 , .113310 
complement (11334 9 . 
31) 

complement (11334 9 < 
31) 



complement (113424 , 
63) 



complement (113781, 
04) 



complement (114128 
50) 

complement (114128 
50) 



/gene="YPO1084" 
/note= "Appears to have been 
interrupted by the insertion of 
downstream ISIOO. Similar to 
N- terminus of Pseudomonas 
aeruginosa hypothetical protein 
TR:AAG04906 (EMBL : AE004580 ) (308 
aa) fasta scores: E(): 6.7e-13, 
56.5% id in 69 aa, and to 
N-terminus of Schizosaccharomyces 
pombe hypothetical 36.2 kDa 
protein C19G12.03 in chromosome I 
SPAC19G12.03 TR: 013 842 
(EMBL: Z97209) (320 aa) fasta 
scores: E(): 3.6e-06, 44.4% id in 
63 aa" 
/pseudo 

/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein (pseudogene) " 
1152 /note=" insertion sequence, ISIOO" 

/note=" ISIOO inverted repeat" 
1141 /gene="YPO1085" 

/ no t e = " synonym : ypmt 1.57c" 
,1141 /gene="YPO1085" 

/note=" Similar to Escherichia coli 
insertion sequence IS21 putative 
ATP-binding protein, IstB 
SW: ISTB-ECOLI (P15026) (265 aa) 
fasta scores: E(): 0, 47.4% id in 
24 9 aa. Also almost identical to 
Yersinia pestis and Escherichia 
coli putative ISIOO transposase 
ypmt 1.57c TR:Q9R3L5 
(EMBL:AL117211) (260 aa) fasta 
scores: E() : 0, 99.6% id in 260 
aa. " 

/codon- start=l 
/transl-table=ll 
/product=" insertion sequence 
ISIOO, ATP-binding protein" 
/protein-id="CAC8 9928 .1" 
/db-xref="GI : 15979152" 
/db-xref = " SPTREMBL : Q9R3L5 " 
/translation= "MMMELQHQRLMALAGQLQLE 
SLISAAPALSQQAVDQEWSYMDFL 
EHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEY 
DFTFATGAPQKQLQSLRSLSFIER 
NENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRF 
TTAADLLLQLSTAQRQGRYKTTLQ 
RGVMAPRLLI IDEIGYLPFSQEEAKLFFQVIAKR 
YEKSAMILTSNLPFGQWDQTFAGD 
AALTSAMLDRILHHSHWQIKGESYRLRQKRKAG 
VIAEANPE" 
.1139 /gene="YPO1085" 

/note="Pfam match to entry PF01695 
IstB, IstB- like ATP binding 
protein, score 367.20, E-value 
1.7e-106" 
.1138 /gene="YPO1085" 

/note="PS00017 ATP/GTP- binding 
site motif A (P-loop) ." 
.1151 /gene="YPO1086" 



/note= " synonyms : 
1151 /gene="YPO1086" 



yl093, ypmtl" 



/note=" Similar to Escherichia coli 
transposase for insertion sequence 



misc-f eature complement (114272 . . 1148 

05) 



misc-f eature complement (115034 . . 1150 

99) 



misc-f eature complement (115043 . . 1151 

26) 



misc-f eature complement (115209 . .1152 

36) 

qene complement (115239 . .1153 

40) 

CDS complement (115239. .1153 

40) 



(P15025) (390 aa) fasta scores: 

E(): 3e-27, 33.1% id in 329 aa. 

Identical to the previously 

sequenced Yersinia pestis. 

Yersinia pseudotuberculosis, and 

Escherichia coli pesticin plasmid 

insertion sequence transposase 

Y1055 TR:P74993 (EMBL : U59875) (340 

aa) fasta scores: E(): 0, 100.0% 

id in 340 aa" 

/codon-start=l 

/transl-table=ll 

/product=" transposase for 

insertion sequence ISlOO" 

/protein- id=: " CAC8 9929.1" 

/db-xref ="GI : 15979153 " 

/ db - xr e f = " S PTREMB L : P 7 4 9 9 3 " 

/ 1 rans lat ion= " MVTFETVMEIKILHKQGMSS 

RAIARELGISRNTVKRYLQAKSEP 

PKYTPRPAVASLLDEYRDYIRQRIADAHPYKIPA 

TVIAREIRDQGYRGGMTILRAFIR 

SLSVPQEQEPAVRFETEPGRQMQVDWGTMRJSTGRS 

PLHVFVAVLGYSRMLYIEFTDNMR 

YDTLETCHRNAFRFFGGVPREVLYDNMKTWLQR 

DAYQTGQHRFHPSLWQFGKEMGFS 

PRLCRPFRAQTKGKVERMVQYTRNSFYIPLMTRL 

RPMG I TVDVETANRHGLRWLHDVA 

NQRKHETIQARPCDRWLEEQQSMLALPPEKKEYD 

VHLDENLVNFDKHPLHHPLSIYDS FCRGVA" 

/gene="YPO1086" 

/note="Pfam match to entry PF00665 
rve, Integrase core domain, score 
81.90, E-value 1.6e-22" 
/gene=:"YPO1086" 

/note=" Predicted helix- turn-helix 
motif with score 2147 (+6.50 SD) 
at aa 18-39, sequence 
MSSRAIARELGISRNTVKRYLQ " 
/gene="YPO1086" 

/note="Pfam match to entry PF00239 
recombinase , Site- specif ic 
recombinases, score 25.70, E-value 
4.7e-06" 

/note="IS100 inverted repeat" 
/gene="intA" 

/not e= " synonym : YPOl 0 86a" 
/partial 

/gene="intA" 

/note=" Probable gene remnant. 
Similar to N-terminal regions of 
Escherichia coli prophage Cp4-57 
integrase IntA SW: INTA-ECOLI 
(P32053) (413 aa) fasta scores: 
E(): 2.8e-05, 61.765% id in 34 aa, 
and Haemophilus influenzae Cp4-57 
integrase- like protein TR:Q9L8P3 
(EMBL:AF198256) (408 aa) fasta 
scores: E(): 0.00043, 52.941% id 
in 34 aa. CDS is truncated by the 
insertion of the downstream ISlOO 
element and subsequent, 
recombination events" 
/codon- start=l 
/transl-table=ll 
/product= "phage integrase 
(partial) " 

/protein- id=" CAC8993 0 . 1" 
/db-xref ="GI : 15979154 " 
/db-xref = " SPTREMBL : Q8ZH27 " 



gene 
CDS 



115752 . . 116063 
115752. .116063 



gene 
CDS 



116071. .116391 
116071. .116391 



misc- feature 



gene 
CDS 



116236. .116301 



complement (116740 
28) 

complement (116740 
28) 



KDLTLHDGDGLFLL " 
/gene="YPO1087" 
/gene="YPO1087" 
/note=" Similar to plasmid 
proteins. Similar to Xylella 
tastidiosa hypothetical protein 
XFA0045 TR:Q9PHG3 (EMBL : AE003851) 
(106 aa) fasta scores: E() : 3e-14, 
54.2% id in 72 aa, and to 
Pseudomonas alcaligenes 
hypothetical 11.2 kDa protein 
TR:Q9XAX7 (EMBL : U88088 ) (99 aa) 
fasta scores: E() : 4e-06, 50.0% id 
in 54 aa" 
/c6don-start=l 
/transl-table=ll 
/product = "putative prophage 
protein" 

/protein- id= " CAC8 9931.1" 
/db-xref ="GI : 15979155" 
/db-xref = " SPTREMBL : Q8ZH26 " 
/translation= "MMKTIKHYLTPEGRDLYMEY 
LKSLRDSIAKAKISSRVNRIASGN 
FGDHKPCREGVWELRIDQGPGYRVYYSLVAVKSC 
CCFWAVI SARRMPTS IRLLCVLKI I " 
/gene="YPO1088" 
/gene="YPO1088" 
/note=" Similar to Xylella 
fastidiosa hypothetical protein 
XFA0046 TR:Q9PHG2 (EMBL :AE003 851) 
(108 aa) fasta scores: E() : 
4.3e-10, 40.0% id in 95 aa, and to 
Pseudomonas alcaligenes 
hypothetical 10.3 kDa protein 
TR:Q9XAX6 (EMBL : U88 088 ) (96 aa) 
fasta scores: E(): 2e-05, 36.6% id 
in 82 aa" 
/codon- start =1 
/transl-table=ll 
/product= "putative DNA-binding 
prophage protein" 
/protein-id="CAC89932 .1" 
/db-xref="GI: 15979156" 
/db-xref = " SPTREMBL : Q8 ZH2 5 " 
/translation= "MIMAKARLHDDAMVQLLMED 
PEFAQVYLHQALLDIDEEGGQEAF 
LMALRHWEARGGMASVAKKAGVSRETLYRTLSP 
SGNPTLKTLLSWSATGFQFSHLA SITA" 
/gene="YPO1088" 

/note= " Predicted helix- turn-helix 
motif with score 1826 (+5.41 SD) 
at aa 56-77, sequence 
GGMASVAKKAGVSRETLYRTLS " 
.1178 /gene="YPO1089" 

.1178 /gene="YPO1089" 

/note=" Weakly similar to many from 
plasmids and bacteriophage. 
Similar to unidentified 
replication protein A repA 
TR:Q03086 (EMBL : M73777) (290 aa) 
fasta scores: E(): 6.1e-07, 30.7% 
id in 23 8 aa, and to uncultured 
eubacterium plasmid putative DNA 
helicase repA TR:Q9L3P0 
(EMBL:AJ271879) (279 aa) fasta 
scores: E(): 0.00012, 25.7% id in 
237 aa, and to Escherichia coli 
regulatory protein RepA repA 
SWiREPJ-ECOLI (P20356) (279 aa) 
fasta scores: E(): 0.00022, 27.6% 
id in 192 aa, and to uncultured 
eubacterium plasmid pIElllS 
putative DNA helicase repA 



gene 
CDS 



complement (117825. 
84) 

complement (117825 , 
84) 



gene 
CDS 



complement (118777 
19) 

complement (118777 
19) 



aa) fasta scores: E(): 0.00036, 
26.7% id in 191 aa, and to 
lactococcal bacteriophage bIL170 
complete genome e5 TR:O80155 
{EiyiBL:AF009630) (316 aa) fasta 
scores: E(): 0.51, 22.2% id in 203 
aa" 

/codon-start=l 
/transl-table=ll 
/product= "putative regulatory 
prophage protein" 
/protein-id="CAC89933 . 1" 
/db-xref ="GI : 15979157" 
/db-xref = " SPTREMBL : Q8ZH24 " 
/ trans lat ion= " MKTPLILLEREPVSLTTNLP 
LRKGSDGYNTPQDYNIKGHLPSNT 
LASIYGPSGSYKSFLAVSWACHIATGKPWASRRV 
TQGS WY I VGEGG I GVPRR I RAWG 
MEFNGGTPIESLYRIDCPVFPASPESAIQVIKAA 
QDVTEQTGSPVRLIVLDTLARCFG 
GSDENAAKDMGAFIQGCDYIKAETGATVLVIHHS 
GKDLDKGARGSSAFRAALDVEFNV 
RREGDGGALVLSCTKMKDSEEPTTRAYDLSPLNL 
YIDNDGEEVNSLVLCDKGREVSDE 
GSPYEAELASIQRLTANHIALWQSIRSRTASGEA 
CTKSLVRDDMRGMGFDVAKKFTRW 
LDKLETDGLIHIDGENICPNSLSNTARN" 
.1187 /gene="YPO1090" 

.1187 /gene="YPO1090" 

/note=" Similar to the N-terminal 
regions of bacteriophage P4 
putative P4- specific DNA primase 
alpha SW:PRIM-BPP4 (P10277) (777 
aa) fasta scores: E(): 0, 35.8% id 
in 324 aa, and Escherichia coli 
DNA primase TraC SW:TRC5-EC0LI 
(P27190) (1448 aa) fasta scores: 
E(): 1.5e-10, 27.8% id in 270 aa. 
Possible truncated protein." 
/codon- start =1 
/transl-table=ll 
/product= "putative prophage DNA 
primase" 

/protein- id=: " CAC8 9934.1" 
/db-xref ="GI: 15979158" 
/db-xref = " SPTREMBL : Q8ZH23 " 
/translation= "MSRFVSDIIRASQHHWSNIL 
SSLAI P I PHLNKHGPCPVGGGKDR 
FRFDDKKGRGTWFCNYCGHGDGLDLVTLVRQCDV 
I QAARE I SRLTDLAPTAPAKERTE 
QPLHSDMTQKVTALLAACSQGTSDYLLHKGLNHP 
VFLMPANSAKNIGGVHFNVGSMVL 
PLVDLSGNTTGALLINRRGEKRLLPGSRIKSSFI 
P I THHTLS QT III TEGFATGLVI S 
RFVTATWAAI SATNLTQVAVALRERYPDAQII I 
AADNDVTNSDNNPGKQQAEYAALA 
VNGLVTLPPTCGKADWDDYRQQVGTDAARIEFFR 
QLYNPREWI " 
,1193 /gene="YPO1091" 

.1193 /gene="YPO1091" 

/note= "Similar to Shigella 
flexneri insertion sequence IS2 
DNA TR:Q54157 (EMBL: Z23101) (179 
aa) fasta scores: E() : 0.0091, 
28.8% id in 132 aa, and to 
Bacteriophage P4 ASH protein cl 
SW:VASH-BPP4 (P05462) (137 aa) 
fasta scores: E(): 0.25, 37.5% id 
in 72 aa" 
/codon- start =1 
/transl- tableau 



protein" 

/protein- id= " CAC8 993 5 . 1 " 
/db-xref="GI: 15979159" 
/db-xref = " SPTREMBL : Q8ZH22 " 
/ 1 r ans 1 a t i on= " MI FYFFLAGALQYNPFAAAK 
SVAGISVPKIFLIPSLVIQTWSFI 
GMACAHPSMVAQAGAPKGAPVS KKSGNANSVWAT 
TQGISVSGGSHYNYFLEAAQWLLP 
FTVCTRNT FVP I RGCAMHNS S PLT I EE I VDHCHA 
LVLAMLEITDPTAKELLLFILAER 
LDLLQLMLDEVPDAEEANHE " 
gene complement (119512 1204 /gene="YPO1092" 

08) 

CDS complement (119512. .1204 /gene="YPO1092" 

08) 

/note=" Similar to Pseudomonas 
aeruginosa integrase/recombinase 
XerD TR:AAG07125 (EMBL : AE004793 ) 
(298 aa) fasta scores: E(): 0.027, 
28.0% id in 239 aa, and to 
Chlamydia pneumoniae 
integrase/recombinase XerC or 
CP0752 TR:Q9Z9F7 (EMBL : AE001587) 
(312 aa) fasta scores: E(): 0.16, 
23.3% id in 232 aa" 
/ codon- start = 1 
/transl-table=ll 
/product= "putative DNA-binding 
prophage protein" 
/protein-id="CAC89936 . 1" 
/db-xref="GI: 15979160" 
/ db - xr e f = " S PTREMBL : Q 8 ZH2 1 " 
/ trans lation= " MFVMSRLSREMKMLAKQAGG 
SHKTVHDRIRIMDRFSRHLLALNI 
QVRDVKHLKAKHIESYITDRFSQGIAIRSLHNEM 
AALRTVFRFAGRDKIVLSERLTNK 
ALGLGGASRAGSKFAIPEEVYQAALRTAQQQDKG 
LACALQLARLLGLRSQEAVQCANS 
LKTWQKELEQNRSKLTWFGTKGGRSRETRILDP 
EAVKRAVKEAQQIAETRGGKLIDK 
PDLKTAMNFWRSHTTRLGLTGH YS PHSLRYAWAQ 
DAINYYLAAGFSRIEARALTSMDL 
GHGDGRGRYVERVYTRKEANYV " 
misc-feature complement (120319 . .1203 /gene="YPO1092 " 

84) 

/note=" Predicted helix- turn-helix 
motif with score 1045 (+2.75 SD) 
at aa 9-30, sequence 
REMKMLAKQAGGSHKTVHDRIR" 
gene complement (120421 . .1207 /gene="YPO1092a" 

11) 

CDS complement (120421. .1207 /gene="YPO1092a" 

11) 

/note="No significant database 
matches" 
/codon- s tart =1 
/transl-table=ll 
/product = "hypothetical protein" 
/protein- id="CAC89937 . 1" 
/db-xref="GI: 15979161" 
/db-xref = "SPTREMBL : Q8ZH2 0 " 
/ 1 rans lat ion= " MTPQHKPASAFEI SDFTQHV 
MHLTPLQ YVRLTAF YKI FEP I PRQ 
ANSLGDDLQKQEAAPSASPIPRNTSTAARSGAQV 
SFKADNLPFPMRQRNAAS " 
gene complement (120803 .. 1209 /gene="YPO1094" 

76) 

CDS complement (120803. .1209 /gene=" YPO1094 " 

76) 

/note="No significant database 

matches" 

/codon- start =1 

/transl-table=ll 

/product= "hypothetical protein" 

/protein- id= " CAC8 9938.1" 



121207. .121434 
121207. .121434 



121566. . 124482 
complement (121596 , 
66) 

complement (121596 , 
66) 



.1232 



/db-xref = " SPTREMBL : Q8 ZH19 " 

/translation="iyiPIGDTVIGYQLSERLASVE 

QRPLKPWYHDGAVFAHLNQDASL 

HQKTPYTSISGAC" 

/gene="YPO1095" 

/gene="YPO1095" 

/note="No significant database 

matches" 

/codon- start =1 

/transl-table=ll 

/product=" hypothetical protein" 

/protein- id= " CAC8 9939.1" 

/db-xref ="GI : 15979163 " 

/db-xref = " SPTREMBL : Q8 ZH18 " 

/trans lat ion= " MQVIRI ILILETLGAVIDSE 

FTWGKLRQKLNS GKLLEAV I RKSN 

ERGATSLSNLCHLRIFPLASLPKRALHAYSA" 

/note="Low GC region" 

/gene="YPO1096" 



complement (123317 . 
05) 

complement (123317 . 
05) 



.1232 /gene="YPO1096" 

/note="No significant database 

matches" 

/ codon-start=l 

/transl-table=ll 

/product= "putative phage protein" 
/protein-id="CAC89940 . 1" 
/db-xref ="GI : 15979164 " 
/db-xref = "SPTREMBL : Q8ZH17 " 
/ 1 r ans lat ion= " MKKLILKKIMILSDSGKSAR 
QFEFGEHLTLITADDDNSVGKSTL 
AKMIFWTFGCEPIFSEVWRTLDCTSIIEFEINNA 
PYIIHRYKNEIKIRHSNGKLHSFP 
KITGDYSKYFSELVNFNVLLPKKGQLTLETPPPA 
Y Y F I P F Y I DQKRTWAKP WD S FENL 
QQYSSWAKPVISYHSGLFIKAHFEVEKDIYVIKR 
ELEEVEKGVTELNNAVI ILRQNLI 
DTDNVLPSTEFIFSVIQESEKNKKNLLEERTNLR 
VEKIRLESQIKLAKGIISELDKDY 
IFSVENMEDGDIECPTCGTIHENSIAHRTSILID 
MELAKNQLESLESEVNGI INLLVI 
KDEEITEQSNKSERSYNNVIESDSNSLISFTNDK 
FEKRAHEINSKKNITIEIKKSEEK 
TAKKSQSDILSKEQKSEIKQSFADRLSKYITKLK 
VNVD I SKI KS PLD YKKI YEVGGAA 
EDARAVLGYYLAIYEQVADSCEEALPPLVIDTPN 
QQEQASGNYTNI IKS I SDGINNDR 
QYVICAMEHKALEQIKTNAQVIKLDARKILLQDQ 
YEKISKLRNEVIFD" 
.1244 /gene="YPO1097" 

.1244 /gene="YPO1097" 

/note="No significant database 
matches" 
/codon- start =1 
/transl-table=ll 

/product= "putative phage protein" 

/protein-id="CAC89941. 1" 

/db-xref ="GI: 15979165" 

/db-xref- " SPTREMBL : Q8ZH16 " 

/ 1 rans lat ion= " MDSEDTSDNPLSEPQRESAG 

AQSYDRFEYQYHWALCKAFSSYKE 

SNDFAI FMEYHEDWYANS IDKNKVKFTFNQIKA 

NAKKTYTAKVLTKRENGTKPSLLG 

KLGSSVSDKKYFEKVEKLDFITTSGFNLTKNNKL 

NLASYKLSSLSSDEASEIIKCLSD 

ELANFGKFPQEIIHFTTSDIPLESYYEHTISRIT 

DSVEEVYPQHLLRTKDIYRILMDS 

LRIRGKNVFDYENWNDALTKKALTYKDIHTWTK 

IVSKGNSDLNFEAIKYILDDLNLK 

GTIKAKIIQKINSYSLTLLSPTLAIIKAQKEIQE 

LLKKNS KKI E I EMS YELLERL I ID 

LSKDTIGLFEDNISISSAIIYEMVEATN" 



83)"" 

complement (124484 . .1256 /gene="YPO1098 " 
83) 

/note=" Similar to Escherichia coli 
prophage CP4-57 integrase 
SW:INTA-ECOLI (P32053) (413 aa) 
fasta scores: E(): 0, 57,8% id in 
3 89 aa, and to Vibrio cholerae 
phage integrase VC0516 TR:Q9KUK2 
(Eiy[BL:AE004138) (413 aa) fasta 
scores: E(): 0, 52.7% id in 393 
aa" 

/codon-start=l 

/transl-table=ll 

/product =" putative prophage 

integrase" 

/protein-id= "CAC89942 . 1" 

/db-xref=:"GI: 15979166" 

/db-xref = " SPTREMBL : Q8ZH15 " 

/ translation= " MARTTRPLTHTEVQKAKATD 

KDLTLHDGDGLFLLVKTTGKKIWR 

FRYQLPNSSKRTMI SLGAYPALSLADAREIRAEK 

LAMLVRGTDPQTRAGEEAEKLQIA 

EESIFVNVARKWFELKESHVSAAHAKDIWRSIEK 

DILPSIENIPIQELKARTLIQVLE 

PIKARGALETVRRLVQRINEIMIYAVNVGLIDAN 

PASGIGNAFERPKKQHMPTIRPEE 

LPKLMRTIAMSNLSLPTRCLLEWQLLTLIRPAEA 

SATAWI E I DLENKQWC I PAERMKA 

KRDHIVPLSEQALELLEIMRPISGNRQHVFPSRN 

DPRNHMNSQTANAALKRIGYGGKL 

VAHGLRSIASTAMNEAGFNADVIEAALAHSDKNE 

VRKAYNRSTYLIQRQELMNWWGLE lYTKRSI" 

complement (124535. .1250 /gene="YPO1098" 

62) 

/note="Pfam match to entry PF00589 
Phage -integrase. Phage integrase 
famxly, score 116.60, E-value 
4.7e-31" 

complement (125799. .1265 /note="IS200-like insertion 

10) sequence: IS1541. Contains an 'A* 

at nucleotide position 315." 
complement (125907. .1263 /gene="tnp" 

65) 

/ no t e = " synonym : YPO 1099" 
complement (125907 .. 1263 /gene="tnp" 
65) 

/note=" Similar to Salmonella 
typhimurium and Salmonella typhi 
transposase for insertion sequence 
IS200, TnpA, SW: TNPA- SALTY 
(Q57334) (152 aa) fasta scores: 
E{) : 0, 93.4% id in 152 aa. Almost 
identical to the Yersinia 
pseudotuberculosis IS1541 element 
transposase Tnp TR:Q9X9F5 
(EMBL:AJ238014) (152 aa) fasta 
scores: E() : 0, 99.3% id in 152 
aa" 

/codon- start =1 

/transl-table=ll 

/product=" transposase for the 

IS1541 insertion element" 

/protein- id="CAC8 9943 .1" 

/db-xref ="GI : 15979167" 

/db-xref =" SPTREMBL : Q56 93 8 " 

/ 1 rans 1 at ion= " MRDEKSLAHTRWNCKYHI VF 

APKYRRQVFYREKRRAIGS ILRKL 

CEWKNWILEAEYCVDHIHMLLEIPPKMSVSGFM 

GYLKGKSSLMLYEQFGDLKFKYRN 

REFWCRGYYVDTVGKNTARIQEYIKHQLEEDKMG 

EQLS I P YPGS PFTGRK " 

complement (126000 . .1263 /gene="tnp" 

11) 

/note="Pfam match to entry PF01797 



126588. .126951 
126588. .126951 



complement (127000 
82) 



complement (127000 . 
82) 



complement (127243 
49) 



complement (127351. 
89) 



127643 . . 128077 
127643. .128077 



128070. .128354 
128070. .128354 



like/ score 230.20, E-value 3e-65" 

/gene="ssrA" 

/gene="ssrA" 

/note= "Similar to Escherichia coli 
small stable RNA molecule, tmRNA 
[lOSa RNA] (D12501) , 91% identity 
in 333 nt overlap" 
1274 /gene="smpB" 

/note=" synonyms: b262 0, smqB, 
YPOllOl" 
1274 /gene="smpB" 

/note= "Similar to Escherichia coli 
SsrA-binding protein SmpB 
SW:SMPB-ECOLI (P32052) (159 aa) 
fasta scores: E(): 0, 84.9% id in 
159 aa, and to Vibrio cholerae 
SsrA-binding protein small protein 
B TR:Q9KTQ3 (EMBL: AE004170) (161 
aa) fasta scores: E() : 0, 67.7% id 
in 161 aa" 
/codon- start =1 
/transl-table=ll 
/product=" SsrA-binding protein 
(small protein B) " 
/protein- id= " CAC8 9944.1" 
/db-xref="GI: 15979168" 
/db-xref = " SWISS - PROT : Q8ZH14 " 
/translation= "MTKKKAYKPGSATIAQNKRA 
RHEYFIEEEFEAGLALQGWEVKSL 
RAGKANISDSYVMFKNGEAFLFGATITPLNVAST 
HWCEPMRTRKLLLNKRELDSLFG 
RVNREGYTWALSMYWKNAWVKVKIGVAKGKKDN 
DKRDDIRDREWKLDKARIMKHANR " 
.1274 /gene="smpB" 

/note="Pfam match to entry PF01668 
SmpB, SmpB protein, score 161.50, 
E-value 1.4e-44" 
.1273 /gene="smpB" 

/note="PS01317 Protein smpB 

signature . " 

/gene="YPO1102" 

/gene="YPO1102" 

/note=" Similar to Escherichia coli 
hypothetical protein Yf jG 
SW:YFJG~ECOLI (P52121) (158 aa) 
fasta scores: E(): 0, 79.3% id in 
145 aa, and to Vibrio cholerae 
hypothetical protein VC084 9 
SW: Y849-VIBCH (P52122) (144 aa) 
fasta scores: E(): 0, 69.0% id in 
142 aa" 

/ codon -start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAC8 9945.1" 

/db-xref="GI: 15979169" 

/db-xref="S PTREMBL : Q 8 ZH 1 3 " " 

/translation="iyiPQISRSALVPFSVKQMYQL 

VNDVRSYPEFLPGCTGSRVLDATE 

NEMI AAVDVAKAGI S KTFTTRNTLTDNQS INMQL 

VDGPFRKLMGGWHFTPLSADACKV 

ELHLDFEFTNKLIELAFGKIFKELAGNMVQAFTQ 

RAKEVYSA" 

/gene="YPO1103" 

/gene="YPO1103" 

/note= "Similar to Escherichia coli 
hypothetical protein YfjF 
SW:YFJF-ECOLI (P52119) (96 aa) 
fasta scores: E(): 5.8e-24, 76.4% 
id in 89 aa, and to Vibrio 



SW: YFJF-VIBCH (P52120)" (101 aa) 
fasta scores: E() : 2.4e-17, 60.9% 
id in 92 aa" 
/ codon - start = 1 
/transl -tableau 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAC8 9946.1" 

/ab-xref="GI: 15979170" 

/db-xref = " SWISS - PROT : Q8ZH12 " 

/ 1 rans lat ion= " MPD IRVEWYALSERQYLRT 

VSLWGSTVEDAI KASGLLELRPD 

IDLEKNKVGIYSRPVKLGDKLNDGDRVEIYRPLI 

ADPKELRRQRAEQAKK " 

complement (12 8504 . .1288 /gene="YPO1104" 

54) 

complement (128504. .1288 /gene=" YPO1104 " 
54) 

/note="Similar to Escherichia coli 
small protein A precursor SmpA 
SW:SMPA-ECOLI (P23089) (113 aa) 
fasta scores: E() : 0, 78.8% id in 
113 aa, and to Vibrio cholerae 
small protein A homolog precursor 
SmpA TR:Q9KTQ0 (EMBL : AE004170 ) 
(13 6 aa) fasta scores: E(): 
3.1e-12, 36.8% id in 114 aa" 
/codon- start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id= " CAC8 9947.1" 

/db-xref ="GI : 15979171" 

/db-xref = " SPTREMBL : Q8 ZHll " 

/ 1 rans lat ion= " MITMRCKMLTAAAVMLAMLT 

AGCSTLEKWYRPDINQGNYLSPI 

DASKIHKGMTQQQVAYTLGTPMLQDPFGTQTWFY 

VFRQQPGHEKITQQTLTLTFDSSG 

VLTDIKNEPALTGS " 

complement (128786. .1288 /gene="YPO1104 " 

18) 

/note="PS00013 Prokaryotic 
membrane lipoprotein lipid 
attachment site . " 

complement (128959. .1306 /gene="YPO1105" 

3 8) 

complement (128959. .1306 /gene="YPO1105" 
38) 

/note=" Similar to Escherichia coli 
DNA repair protein RecN 
SW:RECN-ECOLI (P05824) (553 aa) 
fasta scores: E(): 0, 74.3% id in 
553 aa, and to Vibrio cholerae DNA 
repair protein RecN SW:RECN-VIBCH 
(P52118) (554 aa) fasta scores: 
E() : 0, 53.7% id in 553 aa" 
/codon- start =1 
/transl-table=ll 

/products "DNA repair protein RecN" 

/protein-id="CAC89948 .1" 

/db-xref="GI: 15979172" 

/db-xref = " SPTREMBL : Q8 ZHIO " 

/translations "MYRGFTMLVQLTISNFAIVR 

ELEIDFQPGMTAITGETGAGKSIA 

VDALGLCLGNRSDGSMVRLGATRADICARFSLAD 

TPSARQWLENNHLDDNNECLLRRA 

IGADGRSRGFINGTPVPVSQLRELGQHLIQIHGQ 

HAHQLLLKPDHQKQLLDAYANQSS 

LLAEMKAAYQIWHQSCRDLALHQQQSLERTARQE 

LLQYQLKELNS FS PQAGE YEQ ID I 

EYKRLANSGQLLSLSQQTLQLLSDDEQNNILSQL 

YAAKHQLTELAS^©EQFNNLLNML 

EEAS IQ I SEASDELRHYAEQLDMDPNRLYELEKR 

LSRQLNLARKHHVAPEELPQFHQQ 

LLDEQELLSQQENDHEQLSNAVNTHYQQALAIAK 



mi sc- feature 

gene 
CDS 



complement (130513 
36) 



complement (130707 , 
88) 

complement (130707 . 
88) 



misc- feature 



gene 
CDS 



complement (130746 , 
22) 



131712. .132290 
131712. .132290 



misc- feature 



misc- feature 



131778 . . 132278 



132141. .132272 



SMPHGKFAIETIFEPEHLSAEGATRIEFCVTTNP 
GQPLQALVKVASGGELSRIALAIQ 
VITARKMDTPALIFDEVDVGISGPTAAIVGRLLR 
QLGESTQVMCVTHLPQVAGCGHQH 
FFVSKQTDGTETETHMHRLDKKARLQELARLLGG 
SEVTKNTLANAKELLAA" 
1305 /gene="YPO1105" 

/note="PS00017 ATP/GTP-binding 
site motif A (P-loop) . " 
1315 /gene="YPO1106" 

1315 /gene="YPO1106" 

/note= "Similar to Escherichia coli 
hypothetical protein Yf jB 
SW: YFJB-ECOLI (P37768) (292 aa) 
fasta scores: E(): 0, 82.9% id in 
293 aa, and to Vibrio cholerae 
hypothetical protein VC0853 
TR:Q9KTP8 (EMBL : AE004170 ) (294 aa) 
fasta scores: E(): 0, 59.3% id in 
290 aa" 

/codon- start =1 
/transl-table=ll 

/product= " conserved hypothetical 

protein" 

/protein-id= "CAC89949. 1" 
/db-xref ="GI : 15979173 " 
/db-xref = " SWISS - PROT : Q8 ZHO 9 " 
/translat ion= "MNNRRFDCIGIVGHPRHPAA 
LATHEILYHWLKARGYAVMVEQQI 
AHDLNLTDAITGSLADIGQKADLAVWGGDGNML 
GAARVLARYDIKVIGVNRGNLGFL 
TDLDPDNALQQLSDVLEGEYLSEQRFLLETHVRR 
TNQQSRISTAINEWLHPGKVAHM 
lEFEVYIDDRFAFSQRSDGLIIATPTGSTAYSLS 
AGGPILTPTLDAIVLVPMFPHTLT 
ARPLVISSSSTIRLKFSHITSDLEISCDSQIALP 
IQEGEEVLIRRSDFHLNLIHPKDY 
S YFNTLSTKLGWSKKLF " 
1315 /gene="YPO1106" 

/note="Pfam match to entry PF01513 

DUF15, Domain of unknown function 

DUF15, score 3 97.30, E-value 

1.5e-115" 

/gene="YPO1107" 

/gene="YPO1107" 

/note=" Similar to Escherichia coli 
GrpE protein SW:GRPE-ECOLI 
(P09372) (197 aa) fasta scores: 
E(): 0, 59.2% id in 179 aa, and to 
Vibrio cholerae GrpE protein 
TR:Q9KTP7 (EMBL : AE004170 ) (200 aa) 
fasta scores: E() : 2.1e-28, 53.9% 
id in 167 aa" 
/codon- start=l 
/transl-table=ll 

/product = "heat shock protein GrpE" 

/protein-id="CAC89950 . 1" 

/db-xref ="GI : 15979174 " 

/db-xref = " SPTREMBL : Q8ZH0 8 " 

/ 1 rans lat ion= " MSSKEQKTPNEQVSEEMENT 

AEQQVEATQETGECVDPRVAELEV 

QLSDALQRERESLLRAKAEVENIRRRTELDVEKA 

HKFALERFSSELLPVIDNLERALD 

TADKTNTELISMIEGVELTLKSLLDAVGKFGIEV 

VGETHVPFNPEVHQAMTMLESADH 

EPNHVMMVMQKGYTLNGRLLRPAMVAVSKAKS " 

/gene="YPO1107" 

/note="Pfam match to entry PF01025 
GrpE, GrpE, score 240.00, E-value 
3 .2e-68" 
/gene="YPO1107" 



complement (1323 98 . 
78) 



complement (1323 98 
78) 



complement (132455 . 
46) 



complement (132737 . 
75) 



134388. -134777 
134388. .134777 



134403. .134756 



signature. " 
1336 /gene="gltA" 

/not e= " synonyms : gluT , icdB , 
YPO1108" 
1336 /gene="gltA" 

/EC-number= "4.1.3.7" 
/note=" Similar to Escherichia coli 
citrate synthase GltA 
SWrCISY-ECOLI (P00891) (427 aa) 
fasta scores: E() : 0, 86.7% id in 
427 aa, and to Vibrio cholerae 
citrate synthase VC2092 TR:Q9KQA8 
(EMBL:AE004283) (431 aa) fasta 
scores: E() : 0, 78.0% id in 422 
aa" 

/codon- start=l 
/transl-table=ll 

/product=" citrate synthase GltA" 

/protein- id= " CAC8 9951.1" 

/db-xref ="GI : 15979175" 

/db-xref = " SPTREMBL : Q8ZH07 " 

/ 1 rans 1 at ion= " MADKKATLNLGEAA I ELGVL 

SPTLGTDVIDVRTLGSKGYFTFDP 

GFTSTASCESKITYIDGDQGILLHRGFPIAQLAK 

DSTYLEVCYILLYGETPTPEEYKT 

FKTTVTRHTMIHEQITRLFHGFRRDSHPMAVLCG 

VTGALAAFYHDALDVNNERHREIT 

AFRLLSKMPTVAAMCYKYSLGQPFVYPRNDLSYA 

GNFLHMMFSTPCEKYEVNPVLERA 

MDRILILHADHEQNASTSTVRTAGSSGANPFACI 

AAGIASLWGPAHGGANEACLKMLE 

EIKTVEHIPEFIRRAKDKNDSFRLMGFGHRVYKN 

YDPRATVMRETCHEVLEELKLNNS 

LLEVAMELEN I ALNDP YF I EKKL YPNVDFYSGI I 

LKAMGI PSSMFTVI FAIARTIGWI 

AHWNEMHNDGIKIARPRQLYTGYTERDFQSQLKK 
If 

1335 /gene="gltA" 

/note="Pfam match to entry PF00285 
citrate-synt , Citrate synthase, 
score 783.50, E-value 4.5e-246" 
1327 /gene="gltA" 

/note="PS00480 Citrate synthase 

signature." 

/gene="sdhC" 

/note=" synonyms: cybA, YPO1109" 
/gene="sdhC" 

/note=" Similar to Escherichia coli 
succinate dehydrogenase cytochrome 
b556 subunit SdhC SW:DHSC-ECOLI 
(P10446) (129 aa) fasta scores: 
E() : 0, 77.2% id in 127 aa, and to 
Vibrio cholerae succinate 
dehydrogenase, cytochrome b556 
subunit VC2091 TR:Q9KQA9 
(EMBL:AE004283) (130 aa) fasta 
scores: E() : 0, 75.0% id in 124 
aa" 

/codon- start=l 
/transl-table=ll 

/product=" succinate dehydrogenase 

cytochrome b-556 subunit" 

/protein- id="CAC8 9952 .1" 

/db-xref ="GI : 15979176" 

/db-xref = " SPTREMBL : Q8ZH06 " 

/ trans lat ion= " MGKTVKKQRPVNLDLQTIRF 

PVTAIASILHRVSGVITFVAVGIL 

LWLLGLSVSSQEGFMQAATIMNSFFVKFIFWGIL 

TALAYHICGGIRHLLMDFGYLEEN 

LAVGTRS AQVAMGLTLVLS VLAGVLVW " 

/gene="sdhC" 



misc-f eature 134412 . . 134486 



misc-f eature 134472 . . 134540 



misC"f eature 134601. .134669 



misc-f eature 134637 . . 134678 



misc-f eature 

gene 
CDS 



134712. .134771 

134771. .135118 
134771. .135118 



misc-f eature 134819 . . 134887 



misc-f eature 134945 . . 135013 



misc-f eature 135041. .135109 



gene 
CDS 



135119. . 136885 
135119. .136885 



Sdh-cyt, Succinate dehydrogenase 
cytochrome b subunit, score 
165,60, E-value 8.3e-46" 
/gene="sdhC" 

/note="PS01000 Succinate 
dehydrogenase cytochrome b subunit 
signature 1 . " 
/gene="sdhC" 

/note="one of 3 probable 
transmembrane helices predicted 
for YPO1109 by TMHMM2.0" 
/gene="sdhC" 

/note="one of 3 probable 
transmembrane helices predicted 
for YP01109 by TMHMM2.0" 
/gene="sdhC" 

/note="PS01001 Succinate 
dehydrogenase cytochrome b subunit 
signature 2 . " 
/gene="sdhC" 

/note="one of 3 probable 
transmembrane helices predicted 
for YPO1109 by TMHMM2.0" 
/gene="sdhD" 

/note=: " synonym : YPOlllO " 
/gene="sdhD" 

/note=" Similar to Escherichia coli 
succinate dehydrogenase 
hydrophobic membrane anchor 
protein SdhD SW:DHSD-ECOLI 
(P10445) (115 aa) fasta scores: 
E() : 1.9e-32, 69.3% id in 114 aa, 
and to Vibrio cholerae succinate 
dehydrogenase , hydrophobic 
membrane anchor protein VC2090 
TR:Q9KQB0 (EMBL : AE0042 83 ) (114 aa) 
fasta scores: E(): l.le-23, 57.4% 
id in 115 aa" 
/ codon- start = 1 
/transl-table=ll 

/ product= " succinate dehydrogenase 
hydrophobic membrane anchor 
protein" 

/protein- id= " CAC8 9953 . 1 " 

/db-xref="GI: 15979177" 

/db-xref = "SPTREMBL : Q8ZH05 " 

/ 1 rans lat ion= " MVSNASALGRNGVHDWLLLR 

ASAIVITLYVFYILGFWIVPDIT 

YEIWRGFFASHITKVFTLLTLLS ILAHAWIGLWQ 

VLTDYIKPLAIRLVLQLVWITLL 

VYLLYGTIWWGA" 

/gene="sdhD" 

/note="one of 3 probable 
transmembrane helices predicted 
for YPOlllO by TMHiynyi2.0" 
/gene="sdhD" 

/note="one of 3 probable 
transmembrane helices predicted 
for YPOlllO by TMHMiyi2.0" 

/gene="sdhD" 

/note="one of 3 probable 
transmembrane helices predicted 
for YPOlllO by TiyiHiyiM2 . 0 " 
/gene="sdhA" 

/ no t e = " s y nonym : YPO 1 1 1 1 " 

/gene="sdhA" 

/EC- number = "1.3.99.1" 

/note=" Similar to Escherichia coli 

succinate dehydrogenase 

flavoprotein subunit SdhA 

SW:DHSA-ECOLI (P10444) (588 aa) 

fasta scores: E(): 0, 92.0% id in 

588 aa, and to Vibrio cholerae 

succinate dehydrogenase, 

flavoprotein subunit VC2089 



misc-f eature 135245 . . 135274 



misc-f eature 135440 . . 136438 



gene 
CDS 



136935. .137651 
136935. .137651 



misc-f eature 136980 . . 137177 



misc-f eature 137379 . . 137414 



fasta scores: E() : 0, 82.3% id in 
588 aa" 

/codon-start=l 
/transl-table=ll 

/product= "succinate dehydrogenase 

flavoprotein subunit" 

/protein- id= " CAC8 9954.1" 

/db-xref="GI: 15979178" 

/db-xref = "SPTREMBL : Q8ZH04 " 

/translation="MNLPIREFDAVWGAGGAGM 

RAALQISQMGLSCALISKVFPTRS 

HTVSAQGGITVALGNTHEDNWEWHMYDTVKGSDY 

IGDQDAIEYMCKTGPEAVLELEHM 

GLPFSRLEDGS I YQRPFGGQSLNFGGGQAARTAA 

AADRTGHALLHTLYQQNLKNHTTI 

FSEWYALDLVKNQDGAFVGCTAINIETGEWYFK 

ARATILATGGAGRIYQSTTNAHIN 

TGDGVGMALRAGVPVQDMEMWQFHPTGIAGAGVL 

VTEGCRGEGGYLLNKHGERFMERY 

APNAKDLAGRDWARSIMIEIREGRGCDGPWGPH 

AKLKLDHLGKDVLESRLPGILELS 

RTFAHVDP I KEPI PVI PTCHYMMGGI PTKVTGQA 

I TVNEKGEDWI PGLFAVGE I ACV 

SVHGANRLGGNSLLDLWFGRAAGMHLQESLMEQ 

GASRDAS ESD I EASLARMNRWNNT 

RSGEDPVEIRKALQACMQNNFSVFREGDAMAKGL 

EELKT I RERLQNARLDDTS S E FNT 

QRIECLELDNLMETAFSTAVSANFRTESRGAHSR 

FDFPDRDDVNWLCHSLYLPGTESM 

TRREVNMQPKLREAFPPKVRS Y " 

/gene="sdhA" 

/note="PS00504 Fumarate reductase 
/ succinate dehydrogenase 
FAD-binding site." 
/gene="sdhA" 

/note="Pfam match to entry PF00890 
FAD-binding- 2, FAD binding domain, 
score 733.90, E-value 7.1e-217" 
/gene="sdhB" 

/note=" synonym: YP01112" 

/gene="sdhB" 

/EC- number= "1.3.99.1" 

/note=" Similar to Escherichia coli 

succinate dehydrogenase 

iron- sulfur protein SdhB 

SW:DHSB-ECOLI (P07014) (238 aa) 

fasta scores: E(): 0, 90.8% id in 

238 aa, and to Vibrio cholerae 

succinate dehydrogenase , 

iron- sulfur protein VC2088 

TR:Q9KQB2 (EMBL: AE004283 ) (236 aa) 

fasta scores: E() : 0, 80.2% id in 

237 aa" 

/codon- start =1 
/transl-table=ll 

/product= " succinate dehydrogenase 

iron- sulfur protein" 

/protein- id= " CAC8 9955 . 1 " 

/db-xref="GI: 15979179" 

/db-xref = " SPTREMBL : Q8ZH03 " 

/ 1 rans lat i on= " MKLEFS I YRYNPDVDNAPHM 

QDYTLDAEEGRDMMLLDALIQLKE 

KDPTLSFRRSCREGVCGSDGLNMNGKNGLACITP 

ISALQKGNKKIVIRPLPGLPWRD 

LWDMGQFYTQYEKIKPYLLNDGKNPPAREHLQS 

PEQRAKLDGLYECILCACCSTSCP 

S FWWNPDKFVGPAGLLAAYRFL IDSRDTETASRL 

DDLDDAFS VFRCHS IMNCVSVCPK 

GLNPTKAIGHIKSMLLQRSA" 

/gene="sdhB" 

/note="Pfam match to entry PFOOlll 
fer2, 2Fe-2S iron-sulfur cluster 
binding domains, score 8.70, 
E-value 0.96" 
/gene="sdhB" 



gene 
CDS 



138007. .140814 
138007. .140814 



misc- feature 



gene 
CDS 



138625. .139599 

140844. .142067 
140844. .142067 



iron- sulfur binding region 

signature . " 

/gene="sucA" 

/note=" synonym: YP01113" 
/gene=" sucA" 
/EC-number= "1.2.4.2" 
/note=" Similar to Escherichia coli 
2-oxoglutarate dehydrogenase El 
component SucA SW:0D01-EC0LI 
(P07015) (933 aa) fasta scores: 
E(): 0, 87.8% id in 935 aa, and to 
Vibrio cholerae 2-oxoglutarate 
dehydrogenase. El component VC2087 
TR:Q9KQB3 (EMBL : AE004282) (936 aa) 
fasta scores: E(): 0, 73.8% id in 
936 aa" 

/codon-start=l 

/transl-table=ll 

/product = " 2 - oxoglutarate 

dehydrogenase El component" 

/protein- id= "CAC89956 .1" 

/db-xref="GI: 15979180" 

/db-xref="S PTREMBL : Q8 ZHO 2 " 

/ translat ion= "MQNGAMKAWLDSSYLAGANQ 

SYIEQLYEDFLTDPGSVDDSWRSI 

FQQLPTTGVKPDQLHSQTREYFRRLAKDSTRYNS 

SINDPDIDAKQVKVLQLINAFRFR 

GHQHANLDPLGLWKQESVPDLDPSYHNLTEADFQ 

NT FNVGS FAI GKETMKLADL YAAL 

KQTYCGSVGAEYMHITNTEEKRWIQQRIESWGK 

PTFSDAEKRRFLSELTAAEGLERY 

LGAKFPGAKRFSLEGGDSLVTMLKEMIRHAGKNG 

TREWLGMAHRGRLNVLINVLGKK 

PEDLFDEFAGKHKEHLGTGDVKYHQGFSSDVETE 

GGLVHLALAFNPSHLEIVSPWIG 

S VRARRDRLDEARSNMVLP I T I HGDAAI TGQGW 

QETLNMSQARGYEVGGTVRIVINN 

QIGFTTSNPLDARSTQYCTDIAKMVQAPIFHVNA 

DDPEAVAFVTRLALDFRNTFKRDV 

MIDLVCYRRHGHNEADEPSATQPVMYQKIKKHPT 

PRKI YADKLI EQNI ASLEDATEMV 

NL YRDALDHGDCWEEWRPMNLQS FTWS P YLNHE 

WDEAYPSKVEMKRLQELARRISSA 

PEAIEMQSRVAKIYSDRALMASGEKPFDWGGAET 

LAYATLVDEGI P IRLSGEDAGRGT 

FFHRHAVIHNQKNGSVYVPLSNIHSGQGDFQVWD 

SVLSEEAVLAFEYGYATAEPRTLT 

IWEAQFGDFANGAQWIDQFISSGEQKWGRMCGL 

VMLLPHGYEGQGPEHSSARLERYL 

QLCAEQNMQVCI PSTPAQVYHMIRRQALRGMRRP 

LIVMSPKSLLRHPLATSSLDELAN 

GSFLPAIGEIDELDPKGIKRWMCSGKVYYDLLE 

QRRKNGQTDVAIVRIEQLYPFPHQ 

AVQSVLEQYAHVHDFVWCQEEPLNQGAWYCSQHN 

FREVI PFGASLRYAGRPASASPAV 

GYLSVHQKQQQALVNDALNVE" , 

/gene="sucA" 

/note="Pfam match to entry PF00676 
El-dehydrog, Dehydrogenase El 
component, score 115.00, E-value 

1.5e-30" 
/gene="sucB" 

/note= " synonym : YP01114 " 

/gene="sucB" 

/EC-number="2 .3 .1.61" 

/note=" Similar to Escherichia coli 

dihydrolipoamide 

succinyltransf erase component of 

2-oxoglutarate dehydrogenase 

complex SucB SW:0D02-EC0LI 

(P07016) (404 aa) fasta scores: 

E() : 0, 84.2% id in 406 aa, and to 

Vibrio cholerae 2-oxoglutarate 

dehydrogenase, E2 component, 

dihydrol ipoamide 



misc-f eature 140853 . . 141074 

misc-f eature 141402 . . 142061 

gene 142180 143346 

CDS 142180. .143346 



misc-f eature 142264 . . 142773 



TR:Q9KQB4 (EMBL :AE0 04282 ) (404 aa) 
fasta scores: E() : 0, 75.1% id in 
405 aa" 

/ codon- start =1 
/transl-table=ll 
/product="dihydrolipoamide 
succinyl transferase component of 
2 - oxoglutarate dehydrogenase 
complex" 

/protein- id= " CAC8 9957.1" 

/db-xref="GI: 15979181" 

/db-xref="SPTREMBL:Q8ZH01" 

/translation^ "MSSVDINVPDLPESVADGSV 

ATWHKKPGDSVKRDEVLVEIETDK 

VILEVPASQDGILDAILEDEGATVTSRQVLGRIR 

PSDSSGKPTEEKSQSTESTPAQRQ 

TASLEEESNETLSPAIRRLIAEHDLDATAIKGSG 

VGGR I TRED VD SHLAS RKS AS AW 

ADAKAVAAAAPVLAGRSEKRVPMSRLRKRVAERL 

LEAKNSTAMLTTFNEINMQPIMDL 

RKQYGEAFEKRHGVRLGFMS FY I KAWEALKRYP 

EVNAS IDGEDWYHNYFDVS lAVS 

TPRGLVTPVLRDVDTLSMADIEKKIKELAVKGRD 

GKLKVEELTGGNFTITNGGVFGSL 

MSTPIINPPQSAILGMHAIKDRPMAVNGQWILP 

MMYLALS YDHRL IDGRES VGYLVT 

VKEMLEDPARLLLDV " 

/gene="sucB" 

/note="Pfam match to entry PF00364 
biotin-lipoyl, Biot in- requiring 
enzymes, score 90.40, E-value 
3.7e-23" 
/gene=" sucB " 

/note="Pfam match to entry PF00198 
2-oxoacid-dh, 2-oxo acid 
dehydrogenases acyltransf erase 
(catalytic domain), score 364.60, 
E-value le-105" 
/gene="sucC" 

/note=" synonyms: b0728, YP01115" 
/gene=" sucC" 
/EC-number= "6.2.1.5" 
/note=" Similar to Escherichia coli 
succinyl -CoA synthetase beta chain 
SW:SUCC-ECOLI (P07460) (388 aa) 
fasta scores: E(): 0, 92.8% id in 
3 88 aa, and to Vibrio cholerae 
succinyl -CoA synthase, beta 
subunit VC2085 TR:Q9KQB5 
(EMBL:AE004282) (388 aa) fasta 
scores: E(): 0, 77.1% id in 388 
aa" 

/codon- start =1 
/transl- tableau 

/product = " succinyl - CoA synthetase 
beta chain" 

/protein-id= "CAC89958 .1" 

/db-xref="GI: 15979182" 

/db-xref ="SWISS-PROT:Q8ZH00" 

/ 1 rans lat ion= " MNLHEYQAKQLFARYGMPAP 

TGYACTTPREAEEAASKIGAGPWV 

VKCQVHAGGRGKAGGVKLVNSKEDIRAFAEQWLG 

KKLVTYQTDANGQPVHQILVEAAT 

DIDKELYLGAVIDRSSRRWFMASTEGGVEIEKV 

AEETPELIHKIALDPLTGPQPYQG 

RELAFKLGLTGKQVGQFTKIFMGLATLFLERDLA 

MVEINPLWTKQGDLICLDGKLGA 

DGNALFRQPELREMRDPSQEDAREAHAAQWELNY 

VALDGNIGCMVNGAGLAMGTMD IV 

KLHGGEPANFLDVGGGATKERVTEAFKI ILSDDK 

VKAVFVNIFGGIVRCDLIADGIIG 

AVEEVGVNVPVWRLEGNNAELGAKKLADSGLNI 

I AATSLTDAAQQWAAVGAK " 

/gene=" sucC" 

/note="Pfam match to entry PF02222 



mi 



sc-feature 142915. .143343 



misc-f eature 142948 . . 143022 
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CDS 



143346. .144218 
143346. .144218 



misc-f eature 143742 . . 144185 



misc-f eature 143799 . . 143888 



misc-f eature 144051. .144092 
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CDS 



144985. .146553 
144985. .146553 



213.20, E-value 4e~60" 
/gene=" sucC" 

/note="Pfam match to entry PF0054 9 
ligase-CoA, CoA-ligases, score 
280.20, E-value 2.6e-80" 
/gene="sucC" 

/note="PS01217 ATP-citrate lyase / 
succinyl-CoA ligases family 
signature 3 . " 
/gene=" sucD" 

/note=" synonym: YP01116" 
/gene=" sucD" 
/EC-number= "6.2.1.5" 
/note=" Similar to Escherichia coli 
succinyl-CoA synthetase alpha 
chain SW: SUCD-ECOLI (P07459) (288 
aa) fasta scores: E(): 0, 90.9% id 
in 287 aa, and to Vibrio cholerae 
succinyl-CoA synthase, alpha 
subunit VC2084 TR:Q9KQB6 
(EMBL:AE004282) (290 aa) fasta 
scores: E() : 0, 85.4% id in 287 
aa" 

/codon-start=l 
/transl-table=ll 

/product = " succinyl - CoA synthetase 
alpha chain" 

/protein- id= " CAC8 9959.1" 

/db-xref ="GI : 15979183 " 

/ db - xr e f = " S PTREMBL : Q 8 Z GZ 9 " 

/translation="MSILIDKNTKVICQGFTGSQ 

GTFHSEQAIAYGTKMVGGVTPGKG 

GTQHLGLPVFNTVREAVETTGATASVIYVPAPFC 

KDSILEAIDAGIKLIICITEGIPT 

LDMLWKVRLEQSDARMIGPNCPGVITPGECKIG 

IMPGHIHLPGKVGIVSRSGTLTYE 

AVKQTTD I GFGQS S C VG I GGD P I PGSNF ID I LKL 

FQEDPQTEVIVMIGEIGGNAEEEA 

AAYIKEHVTKPWGYIAGVTAPKGKRMGHAGAI I 

AGGKGTADDKFAALEAAGVKTVRS 

LAEIGNAVKAVLPQ" 

/gene=" sucD" 

/note="Pfam match to entry PF00549 
ligase-CoA, CoA- ligases, score 
225.20, E-value 9.4e-64" 
/gene="sucD " 

/note="PS0l2l6 ATP-citrate lyase / 
succinyl-CoA ligases family 
signature 1." 
/gene=" sucD " 

/note="PS00399 ATP-citrate lyase / 
succinyl -CoA ligases family active 
site. " 

/gene="cydA" 

/note=" synonym: YP01117" 

/gene=" cydA" 

/EC-number= "1.10.3.-" 

/note=" Similar to Escherichia coli 

cytochrome D ubiquinol oxidase 

subunit I SW:CYDA-ECOLI (P11026) 

(522 aa) fasta scores: E() : 0, 

86.6% id in 522 aa" 

/codon-start=l 

/transl-table=ll 

/product=" cytochrome D ubiquinol 

oxidase subunit I" 

/prot e in - id= " CAC8 9960.1" 

/db-xref="GI: 15979184" 

/db-xref = " SPTREMBL : Q8ZGZ8 " 

/ 1 rans 1 a t i on= " MFD I VELSRLQFALTAMYHF 

LFVPLTLGLoAFLLAIMESVYVLSG 

KQIYKDMTKFWGKLFAINFALGVATGLTMEFQFG 

TNWSYFSHYVGDIFGAPLAIEGLM 

AFFLESTFVGLFFFGWDRLTKHQHLAVTWLVALG 

SNFSALWILVANGWMQNPIASDFN 



misc-feature 145003 . . 146517 



misc-f eature 145042 . . 145110 



misc-feature 145144 . . 145212 



misc-feature 145270 . . 145338 



misc-feature 145357. .145425 



misc-feature 145537. .145605 



misc-feature 145639. .145698 



misc-feature 146143 . . 146211 



misc-feature 146245 . . 146313 



misc-feature 146380 . . 146448 
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146568. .147707 
146568. .147707 



GAMF I LG I S S Y YLLKGRD I P FAKR 
SFAIAASFGLASVLSVIVLGDESGYEMGDVQKTK 
LAAIEAEWETQPPPAAFTLFAIPN 
QETMENRFAIQIPYALGLIATRSLDTPVIGLRDL 
MSQHEVRIRNGIQAYSLLEQLRGG 
NTDPAVRDAFNKAKQDLGYGMLLKRYTENVADAT 
EEQIQLAAQDS I PRVLPLYFAFRI 
MVACGFLMLL 1 1 GLAF YNWRGRIGEKKWLLRAA 
LYGI PLPWI AVEAGWFVAE YGRQP 
WAIGEVLPTAVANS SVTAGE ILFSMALI CGLYTL 
FLIAEMYLMFKFARLGPSSLKTGR 
YHFEQPTAPVQEAR " 
/gene=" cydA" 

/note="Pfam match to entry PF01654 
Bac-Ubq-Cox, Bacterial Cytochrome 
Ubiquinol Oxidase, score 1092.40, 
E-value 0" 
/gene="cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TiyiHiyiM2 . 0 " 
/gene="cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TMHiyiM2.0" 
/gene="cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by Tiy[HiyiM2 . 0 " 
/gene=" cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TiyiHiyiiyi2 . 0 " 
/gene=" cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TMHMiyi2.0" 
/gene=" cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TMHMM2.0" 
/gene="cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TMHMM2.0" 
/gene=" cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TiyiHiyiiyi2 . 0 " 
/gene=" cydA" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP01117 by TMHMM2 . 0 " 
/gene="cydB" 

/note= " synonym : YP01118 " 

/gene=" cydB" 

/EC-number="1.10.3. -" 

/note=" Similar to Escherichia coli 

cytochrome D ubiquinol oxidase 

subunit II SW:CYDB-ECOLI (P11027) 

(379 aa) fasta scores: E(): 0, 

78.4% id in 379 aa" 

/ codon - s t ar t = 1 

/transl- tableau 

/product=" cytochrome D ubiquinol 

oxidase subunit II" 

/protein- id= " CAC8 9961.1" 

/db-xref ="GI : 15979185" 

/db-xref="S PTREMBL : Q 8 ZGZ 7 " 

/ trans lat ion= " MFDYEVLRFIWWLLIGVLLI 

GFAVTDGFDMGVGILLRI IGKNDT 

ERRIMINS lAPHWDGNQVWLITAGGALFAAWPMV 

YAAAFSGFYIAMILVLAALFFRPV 

GFD YRS KLENS RWRNMWDWG I F IGS FVPAWFGV 

AFGNLLQGVPFHMNEYMRLFYTGN 



misc-f eature 146580 . . 147668 



misc-f eature 146586 . . 146639 



misc-f eature 146799 . . 146867 



misc-f eature 146928 . . 146996 



misc-f eature 147054 , . 147122 



misc-f eature 147183 . . 147236 



misc-f eature 147354 . . 147422 



misc-f eature 147441. .147509 



misc-f eature 



gene 
CDS 



147567. .147635 



147799. .148131 
147799. .148131 



misc-f eature 147886 . . 147939 



misc-f eature 147982 . . 148041 



misc-f eature 148054 . . 148122 



ELHLRSRKAAQ I S ALVMS VAFLLA 

G I WL VKG I DGF V I TS VLDTAAE SNPMRKEVAHQA 

GAWLINFNKYPILWALPALGVILP 

LPT I LLSRFEKGAWAFLFS SLT I ACVI LTAGVTM 

FPFVMPSSTMPNVSLTMWDATSSL 

LTLKVMTIVAIIFVPIILLYTSWCYYKMFGRIDK 

EFIENNKHSLY" 

/gene=:"cydB " 

/note="Ptam match to entry PF02322 
Cyto-ox-2, , score 714.90, E-value 
3.7e-211" 
/gene=" cydB" 

/note="one of 8 probable 
transmembrane helices predicted 
for YP01118 by Tiy[HiyDyi2.0" 
/gene=" cydB " 

/note="one of 8 probable 
transmembrane helices predicted 
for YP01118 by TMHMM2.0" 
/gene="cydB" 

/note="one of 8 probable 
transmembrane helices predicted 
for YP01118 by Tiy[HiyDyi2 . 0 " 
/gene=" cydB" 

/not e= "one of 8 probable 
transmembrane helices predicted 
for YP01118 by TMHMM2 . 0 " 
/gene=" cydB" 

/note="one of 8 probable 
transmembrane helices predicted 
for YP01118 by TMHiyiM2 . 0 " 
/gene=" cydB" 

/note="one of 8 probable 
transmembrane helices predicted 
for YP01118 by TMHMM2.0" 
/gene="cydB" 

/not e= "one of 8 probable 
transmembrane helices predicted 
for YP01118 by TMHMiyi2.0" 
/gene=" cydB " 

/note="one of 8 probable 
transmembrane helices predicted 
for YP01118 by TMHiyiM2 . 0 " 
/gene="YP01119" 
/gene="YP01119" 

/note="Similar to Escherichia coli 
protein YbgE SW: YBGE-ECOLI 
(P37343) (97 aa) fasta scores: 
E() : 6e-22, 59.6% id in 89 aa, and 
to Vibrio cholerae hypothetical 
protein VC1841 TR:Q9KR06 
(EMBL:AE004260) (105 aa) fasta 
scores: E(): 4.3e-08, 34.5% id in 
84 aa" 

/codon- start =1 
/transl-table=ll 

/product = "conserved hypothetical 
protein" 

/protein-id="CAC89962 .1" 

/db-xref="GI: 15979186" 

/db-xref = "SPTREMBL : Q8ZGZ6 " 

/ 1 rans lat ion= " MKPTQQLKPLTMATTMLSDK 

LYELMDKGPLRALSLVLAFAIiAFC 

VFWDPTRFAAATSSLEVWQEVFIVWAVCTGVIHG 

VGFRPKQVWLRAFFAPLPAIVILA TGLFYFFA" 

/gene="YP01119" 

/note="one of 3 probable 

transmembrane helices predicted 

for YP01119 by TiyiHiy[M2 . 0 " 

/gene="YP01119" 

/note="one of 3 probable 

transmembrane helices predicted 

for YP01119 by TMHiyDyi2.0" 

/gene="YP01119" 

/note="one of 3 probable 



gene 
CDS 



148263 . . 148664 
148263 . .148664 



gene 
CDS 



148664 . . 149350 
148664. .149350 



misc-f eature 148706 . . 148774 



misc-feature 148880. .149296 



misc-f eature 149060. .149128 



misc-feature 149171. .149239 



gene 
CDS 



149363. .149791 
149363. .149791 



for YP01119 by TMHMM2.0" 

/gene="YPO1120" 

/gene="YPO1120" 

/note=" Similar to Escherichia coli 
hypothetical protein YbqC 
SW:YBGC-ECOLI (P08999) (134 aa) 
fasta scores: E{) : 0, 72.2% id in 
133 aa" 

/codon-start=l 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein- id="CAC8 9 963 . 1" 

/db-xref=:"GI: 15979187" 

/ db - xr e f = " S PTREMBL : Q 8 ZGZ 5 " 

/translation= "MSNTLFRWPVRVYFEDTDAG 

GMVYHARYVAFYERARTEMLRQRN 

FHQQQLLSEHVAFAVRSMTVEYLAPARLDDMLEV 

QSEVTAMRGASLTFAQRILDSHGN 

LLSSAEVLIACIDPHQMKPRALPKSIVAEFK" 

/gene="tolQ" 

/note=" synonym: YP01121" 
/gene="tolQ" 

/note=" Similar to Escherichia coli 
TolQ protein SW: TOLQ-ECOLI 
(P05828) (230 aa) fasta scores: 
E{): 0, 89.4% id in 227 aa" 
/codon- start =1 
/transl-table=ll 
/product="TolQ colicin import 
protein" 

/protein- id="CAC89964 . 1" 

/db-xref="GI: 15979188" 

/db-xref = " SPTREMBL : Q8ZGZ4 " 

/translation="MADMNILDLFLQASLFVKLI 

MLVLMGFS lASWAI I IQRTRILNA 

ATRDAEAFEDKFWSGIELSRLYQESQARRDTLSG 

SEQIFHSGFKEFARLHRANSHAPE 

AVIDGASRAMRI SMNRELEALETHI PFLGTVGS I 

SPYIGLFGTVWGIMHAFISLGAVK 

QATLQMVAPGIAEALI ATAIGLFAAI PAVMAYNR 

LNQHVNKLEQNYDNFMEEFIAILH 

RQAFASTESK" 

/gene="tolQ" 

/note="one of 3 probable 
transmembrane helices predicted 
for YP01121 by TiyiHiyiM2 . 0 " 
/gene=:"tolQ" 

/note="Pfam match to entry PF01618 
MotA-ExbB, MotA/TolQ/ExbB proton 
channel family, score 220.60, 
E-value 2.4e-62" 
/gene="tolQ" 

/note="one of 3 probable 
transmembrane helices predicted 
for YP01121 by TMHiyiiyi2 . 0 " 
/gene="tolQ" 

/note="one of 3 probable 
transmembrane helices predicted 
for YP01121 by TiyiHMM2 . 0 " 
/gene="tolR" 

/note=" synonym: YP01122" 
/gene="tolR" 

/note=" Similar to Escherichia coli 
TolR protein protein SW: TOLR-ECOLI 
{P05829) (142 aa) fasta scores: 
E(): 0, 83.1% id in 142 aa, and to 
Vibrio cholerae TolR membrane 
protein vcl83 8 TR:Q9KR09 
(EMBL:AE004259) (146 aa) fasta 
scores: E() : 5.5e-15, 39.4% id in 
137 aa" 

/codon- start =1 
/transl-table=ll 
/product="TolR colicin import 



misc-f eature 149387 . . 149788 
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149903. .151069 
149903. .151069 



misc-feature 149939 . . 150007 
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151189. .152481 
151189. .152481 



/protein-id= " CAC8 9965.1" 

/db-xref="GI: 15979189" 

/ db - xr e f = " S PTREMBL : Q 8 ZGZ 3 " 

/ translation= "MARVRGRKRRELKSEINIVP 

LLDVLLVLLLIFMATAPIITQSVE 

VNLPDATDSKTVSSDDNPPVIVEVSGVGQYTVW 

DHQRMELLPSEQWAEAQARLKTN 

PKTVFLIGGAKEVPYDE II KALNMLHQAGVTSVG 

LMTQPI" 

/gene="tolR" 

/note="Pfam match to entry PF02472 
ExbD, , score 67.50, E-value 
2.8e-16" 
/gene="tolR" 

/note="l probable transmembrane 
helix predicted for YP01122 by 
Tiy[HiyiM2 . 0 " 

/gene="tolA" 

/note= " synonym : YP0112 3 " 
/gene="tolA" 

/note=" Similar to Escherichia coli 
TolA protein SW: TOLA-ECOLI 
(P19934) (421 aa) fasta scores: 
E() : 5.1e-27, 57.9% id in 423 aa, 
and to Pseudomonas aeruginosa TolA 
protein SW: TOLA-PSEAE (P50600) 
(347 aa) fasta scores: E(): 
6.6e-09, 33.2% id in 319 aa" 
/codon-start=l 
/transl-table=ll 
/product = "TolA colicin import 
membrane protein" 
/protein-id="CAC89966 . 1" 
/db-xref=:"GI: 15979190" 
/db-xref = " SPTREMBL : Q8 ZGZ2 " 
/translation= "MGKATEQNDKLNRAVIVSW 
LHI ILIALLIWGSLTQTTEMGGGG 
AGGEVIDAVMVDPGAVTEQYNRQQQQQTDAKRAE 
QQRQKKAEQQAEELQQKQAAEQQR 
LKELEKERLQAQEDAKLAAEEQKKQVAEQQKQIA 
EQQKQAAEQQKIAAAAVAKAKEEQ 
KQAETAAAQAKAEADKIVKAQAEAQKKAEAEAKK 
EAAVAAAAKKQADADAKKAVEVAE 
KAAADAAEKKAAADAEKKAAAAKKVAAAAEAKKK 
AAAEAAASTDVDDLFGGLANAKNA 
PKSGSGAGAAAAGKGGGKKSGASGADISGYLGQI 
TGAI QS KF YDADLYKGRTCDLRI K 
LAPDGLLIDVKAEGGDPALCQAAI AAAKQAKI PK 
PPSTDVYEQFKNAPLVFKPQ " 
/gene="tolA" 

/note="l probable transmembrane 
helix predicted for YP01123 by 
TiyiHMM2 . 0 " 
/gene="tolB" 

/note= " synonym : YP01124 " 
/gene="tolB" 

/note="Similar to Escherichia coli 
TolB protein precursor 
SWzTOLB-ECOLI (P19935) (430 aa) 
fasta scores: E(): 0, 84.9% id in 
430 aa" 

/codon- start =1 
/transl-table=ll 
/product="TolB colicin import 
protein" 

/protein-id="CAC89967 . 1" 

/db-xref="GI: 15979191" 

/db-xref="SPTREMBL:Q8ZGZl" 

/ 1 rans lat ion= " MKQAFRVALGFLVLWASVLH 

AEVRIEITQGVDSARPIGWPFKW 

MGPGTPPEEIGAIVGADLRNSGKFNPIDAARMPQ 

QPSTAAEVTPAAWTALGIDAVWG 

QVQPSADGS YWSYQLVDTSGSAGS ILAQNQYKV 

TKQWLRYSAHTVSDEVFEKLTGIK 

GAFRTRIAYWKTNGGKFPHELRVSDYDGYNQFV 



gene 
CDS 



152532. .153038 
152532. .153038 



misc- feature 



misc- feature 



misc- feature 
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CDS 



152565. .152597 



152730. .153017 



152829. .152963 

153048. .153857 
153048. .153857 



misc -feature 



153603. .153704 



FESGKSALVIQTLANGAIRQVASFPRHNGAPAFS 

PDGTKLAFALSKSGSLNLYVMDLA 

SGQISQVTDGRSNNTEPSWFPDSQNLAYTSDQGG 

RPQVYKVNINGGVPQRITWEGSQN 

QNADVSPDGKFLVLVSSNGGAQHIAKQDLETGAV 

QVLTDTLLDETPSIAPNGTMVIYS 

STQGLGSVLQLVSTDGRFKARLPATDGQVKFPAW 

SPYL" 

/gerie="pal" 

/note=" synonyms: excC, YP01125" 
/gene="pal" 

/note=" Similar to Escherichia coli 
peptidoglycan-associated 
lipoprotein precursor Pal or ExcC 
SW:PAL-ECOLI (P07176) (173 aa) 
fasta scores: E(): 0, 84.4% id in 
173 aa, and to Vibrio cholerae 

feptidoglycan- associated 
ipoprotem vcl835 TR:Q9KR12 
(EiyiBL:AE004259) (172 aa) fasta 
scores: E(): 0, 59.9% id in 172 
aa" 

/codon-start=l 
/transl"table=ll 

/product = "peptidoglycan-associated 

lipoprotein Pal" 

/protein-id="CAC89968 .1" 

/db-xref="GI: 15979192" 

/db-xref = " SPTREMBL : Q8ZGZ0 " 

/ translation= "MQLNKVLKGLMLALPVLAVA 

ACSSNKSANNDQSGMGAGTGTENG 

SNLSSEEQARLQMQELQKNNIVYFGFDKYDIGSD 

FAQMLDAHAAFLRSNPSDKVWEG 

HADERGTPEYNIALGERRASAVKMYLQGKGVSAD 

QISIVSYGKEKPAVLGHDEAAFAK NRRAVLVY" 

7gene="pal" 

/note="PS00013 Prokaryotic 
membrane lipoprotein lipid 
attachment site. " 
/gene="pal" 

/note="Pfam match to entry PF00691 
OmpA, OmpA family, score 171.40, 
E-value 1.5e-47" 
/ gene = "pal" 

/note="PS01068 OmpA-like domain." 

/gene="YP01126" 

/gene="YP01126" 

/note=" Similar to Escherichia coli 
hypothetical 28.2 kDa protein in 
pal-lysT intergenic region 
precursor YbgF SW: YBGF-ECOLI 
(P45955) (263 aa) fasta scores: 
E(): 0, 70.0% id in 270 aa, and to 
Pseudomonas aeruginosa periplasmic 
protein TR:Q9RNV2 (EMBL: AF177774) 
(274 aa) fasta scores: E(): 
6.6e-17, 34.2% id in 225 aa" 
/codon- start =1 
/transl-table=ll 
/product = "putative exported 
protein" 

/protein- id= " CAC8 9969.1" 

/db-xref = "GI : 15979193 " 

/db-xref = " SPTREMBL : Q8ZGY9 " 

/ 1 rans lat ion= " MNSNFRRHLVGLSLLVGVAV 

PWAATAQAP I SNVGSGS VEDRVTQ 

LERISNAHSQLLTQLQQQLSDSQRDVDSLRGQIQ 

ESQYQLNQWERQKQIYQQMESLS 

GGQGAQNSASAASGATADNTAAGSSGNADAGAAA 

STAAPAASTGDENSDYNVAVSLAL 

EKKQYDQAITVFQSFVKQYPKSTYQPNANYWLGQ 

LYYNKGKKDDAAYYYAVWKNYPK 

SPKSSEAMFKVGVIMQDKGQSDKAKAVYQQVIKQ 

YPNTDAAKQAQKRLSAL " 

/gene="YP01l26" 



misc-f eature 
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tRNA 
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153714. .153815 

154048. .154123 

154151. . 154226 

154255. .154324 
154693 . . 155754 
154693. .155754 
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154777. .155715 

155876. .156601 
155876. .156601 



misc-f eature 155933 . . 155992 



misc-f eature 156011 . . 156079 



TPR, TPR Domain, score 8.70, 
E-value 5.6" 
/gene="YP01126" 

/note="Pfam match to entry . PF00515 

TPR, TPR Domain, score 11.80, 

E-value 2.6" 

/product = " tRNA- Lys " 

/note="tRNA Lys anticodon TTT, 

Cove score 95.27" 

/product = " tRNA- Lys " 

/note="tRNA Lys anticodon TTT, 

Cove score 95.27" 

/note="tRNA Pseudo anticodon TTT, 

Cove score 40.49" 

/gene="nadA" 

/note= " synonyms : nicA, YP01127" 
/gene="nadA" 

/note= "Similar to Escherichia coli 
quinolinate synthetase A NadA or 
NicA SW:NADA-ECOLI (P11458) (347 
aa) fasta scores: E() : 0, 82.1% id 
in 346 aa" 
/codon- start =1 
/ transl-table=ll 

/product=" quinolinate synthetase 
A" 

/protein-id="CAC89970 .1" 

/db-xref="GI: 15979194" 

/db-xref = " SWISS - PROT : Q8 ZGY8 " 

/ 1 rans lat ion= " MSE I FDVNAAI YPFPARPVP 

LDTNEKAFYREKIKTLLKQRDAVL 

VAHYYTDPE IQALAEETGGCVADSLEMARFGNNH 

PASTLLVAGVRFMGETAKILNPEK 

KVLMPTLNAECSLDLGCPVDEFTAFCDSHPDRTV 

WYANTSAAVKAKADWWTSS I AV 

ELIEHLDSLGEKI IWAPDRHLGSYVQKKSGADVL 

CWQGACIVHDEFKTQALARMKALY 

PDAAVLVHPESPQAWDMADAVGSTSQLIQAAKT 

LPQKTLI VATDRGI FYKMQQACPD 

KELFEAPTAGEGATCRSCAHCPWMAMNGLRAIAE 

GLEQGGVMHEIHVDEELRQQALIP 

LNRMLDFANQLKLQVKGNA " 

/gene="nadA" 

/note="Pfam match to entry PF02445 
NadA, , score 609.80, E-value 
1.6e-179" 
/gene="pnuC" 

/note= " synonym : YP0112 8 " 
/gene="pnuC" 

/note=" Similar to Escherichia coli 
PnuC protein SW: PNUC-ECOLI 
(P31215) (239 aa) fasta scores: 
E(): 0, 76.1% id in 238 aa" 
/codon- start =1 
/transl-table=ll 

/product="intergral membrane NMN 

transport protexn PnuC" 

/protein- id= " CAC8 9971.1" 

/db-xref="GI: 15979195" 

/db-xref = " SPTREMBL : Q8ZGY7 " 

/ trans lat ion= " MDFLSTGNILVHIPLGAGGY 

DLSWIEAIGTLFGLLCIWFASKEK 

IINYLFGLINVTLFAVIFFQIQLYASLLLQLFFF 

GANIYGWYAWSKQTPDNQAELKIR 

WLSLPKALAWAAVCIAGIVLMTLHIDTVFAWLTR 

VAVTVMQSLGANVQMPELQPDAFP 

FWDSTMMVLS I AAMILMTRKYVENWLI WWIDVI 

S VAI FAYQGVYAMALE YAFLTL I A 

LNGSWLWIKSAARNHSRPLSTQG" 

/gene="pnuC" 

/not e=: "one of 7 probable 
transmembrane helices predicted 
for YP01128 by TMHiyiM2 . 0 " 
/gene="pnuC" 

/note="one of 7 probable 



156089. .156142 



156203 . .156271 



156347. .156406 



156425. .156484 



156494. .156553 



complement (156716 . 
54) 

complement (156716 . 
54) 



complement (156791. 
50) 



complement (157139, 
04) 



complement (157244 
09) 



complement (1573 55 , 
20) 



for YP01128 by TMHMM2.0" 
/gene="pnuC" 

/note="one of 7 probable 
transmembrane helices predicted 
for YP01128 by Tiy[HiyiM2 . 0 " 
/gene="pnuC" 

/note="one of 7 probable 
transmembrane helices predicted 
for YP01128 by Tiy[Hiyiiyi2 . 0 " 
/gene="pnuC" 

/note="one of 7 probable 
transmembrane helices predicted 
for YP01128 by Tiy[Hiyiiy[2 . 0 " 
/gene="pnuC" 

/note="one of 1 probable 
transmembrane helices predicted 
for YP01128 by TMHMiyi2 . 0 " 
/gene="pnuC" 

/note="one of 7 probable 
transmembrane helices predicted 
for YP01128 by TMHMM2.0" 
1576 /gene="YP01129" 

1576 /gene="YP01129" 

/note=" Similar to Escherichia coli 
hypothetical 34.7 kDa protein in 
pnuC-aroG intergenic region YbgR 
SW: YBGR-ECOLI (P75757) (313 aa) 
fasta scores: E(); 0, 66.1% id in 
307 aa, and to Bacillus subtilis 
cation transport protein TR: 007084 
(Eiy[BL:U93876) (311 aa) fasta 
scores: E() : 0, 41.4% id in 309 
aa" 

/codon- start =1 

/transl-table=ll 

/product= "putative cation 

transport protein" 

/protein- id= " CAC8 9972.1" 

/db-xref ="GI : 15979196 " 

/ db - xr e f = " S WI S S - PROT : Q 8 Z GY6 " 

/translation= "MAVSTIFSQDSNSKRLLIAF 

AITTLFMVTEAIGGWLSGSLALLA 

DAGHMLTDSAALFIALMAVHFSQRKPDPRHTFGY 

LRLTTLAAFVNAAALLLIVILIVW 

EAVHRFFSPHEVMGTPMLIIAIAGLLANIFCFWI 

LHKGEEEKNINVRAAALHVLSDLL 

GSVGAMIAAIVILTTGWTPIDPILSVLVSVLILR 

SAWRLLKESFHELLEGAPQEIDIN 

KLRKDLCTNIYEVRNIHHVHLWQVGEQRLMTLHA 

QVIPPLDHDALLQRIQDYLLHHYR 

I SHATVQMEYQHCGTPDCGINQAAPADGHHRHHH 

HE " 

.1574 /gene="YP01129" 

/note="Pfam match to entry PF01545 
Cation-efflux, Cation efflux 
family, score 250.50, E-value 
2.3e-71" 
.1572 /gene="YP01129" 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01129 by TMHMM2 . 0 " 
.1573 /gene="YP01129" 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01129 by TMHMM2 . 0 " 
.1574 /gene="YP01129" 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01129 by TMHMM2 . 0 " 
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10) 



complement (157541. .1576 
06) 



join{158103. .159062, 
161022. .161114) 



join(158103 . .159062, 
161022. .161114) 



misc- feature 



158112. .159092 



misc -feature 
misc- feature 
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complement (159063 . . 1610 
16) 

159063. .159090 



complement (159129 . . 1599 
11) 

complement (159129 .. 1599 
11) 



/note="one of 5 probable 
transmembrane helices predicted 
for YP01129 by TMHMM2.0" 
/gene="YP01129" 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01129 by TiyiHiyiM2 . 0 " 
/gene=" aroG" 

/note=" synonym: YPO113 0" 

/pseudo 

/gene="aroG" 

/EC-number = "4.1.2.15" 
/note=" Similar to Escherichia coli 
phospho - 2 - dehydro - 3 - deoxyhep t onat e 
aldolase, Phe- sensitive AroG 
SW:AROG-ECOLI (P00886) (350 aa) 
fasta scores: E(): 0, 86.6% id in 
320 aa. The IS insertion occurred 
near the C- terminal end (following 
codon 320) . This CDS is disrupted 
by the insertion of ISIOO. The 
insertion occurred near the 
C-terminus. It is not clear 
whether this insertion affects the 
function of the protein." 
/pseudo 

/codon- start=l 
/transl-table=ll 

/product = " phospho - 2 - dehydro - 3 - deox 
yheptonate aldolase, 
phe-sensitive, AroG (pseudogene) " 
/db-xref=" PSEUDO :CAC8 9973 .1" 
/ db - xr e f = " REMTREMBL : CAC8 9973" 
/gene="aroG" 

/note="Pfam match to entry PF00793 
DAHP-synth-1, DAHP synthetase 
family, score 755.60, E-value 
2e-223" 
/pseudo 

/note=" insertion sequence, ISIOO" 
/gene="aroG" 

/note=" ISIOO inverted repeat" 
/pseudo 

/gene="YP01131" 

/note=" synonym: ypmtl. 57c" 
/gene="YP01131" 

/note=" Similar to Escherichia coli 
insertion sequence IS21 putative 
ATP-binding protein, IstB 
SW: ISTB-ECOLI (P15026) (265 aa) 
fasta scores: E(): 0, 47.4% id in 
24 9 aa. Also almost identical to 
Yersinia pestis and Escherichia 
coli putative ISIOO transposase 
ypmtl. 57c TR:Q9R3L5 
(EMBL:AL117211) (260 aa) fasta 
scores: E(): 0, 99.6% id in 260 
aa. " 

/codon- start=l 

/transl-table=ll 

/product=" insertion sequence 

ISIOO, ATP-binding protein" 

/protein-id="CAC89974 .1" 

/db-xref ="GI : 15979197" 

/db-xref = " SPTREMBL : Q9R3L5 " 

/ 1 rans lat ion= " MMMELQHQRLMALAGQLQLE 

SLISAAPALSQQAVDQEWSYMDFL 

EHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEY 



complement (159204 
43) 



complement (159561. 
84) 



complement (159908 , 
30) 

complement (159908 
30) 



complement (160052 . 
85) 



complement (160814 
79) 



complement ( 160823 , 
06) 



complement (16 0989 , 
16) 

complement (161296 
48) 



NENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRF 
TTAADLLLQLSTAQRQGRYKTTLQ 
RGVMAPRLLIIDEIGYLPFSQEEAKLFFQVI AKR 
YEKSAMILTSNLPFGQWDQTFAGD 
AALTSAMLDRILHHSHWQIKGESYRLRQKRKAG 
VIAEANPE" 
1597 /gene="YP01131" 

/note="Pfam match to entry PF01695 
IstB, IstB-like ATP binding 
protein, score 367.20, E-value 
1.7e-106" 
,1595 /gene="YP01131" 

/note="PS0 0017 ATP/GTP- binding 
site motif A (P-loop)." 
,1609 /gene="YP01132" 

/note= " synonyms : yl093, ypmtl" 
.1609 /gene="YP01132" 

/note=" Similar to Escherichia coli 
transposase for insertion sequence 
element IS21 IstA SW: ISTA-ECOLI 
(P15025) (390 aa) fasta scores: 
E(): 3e-27, 33.1% id in 329 aa. 
Identical to the previously 
sequenced Yersinia pestis. 
Yersinia pseudotuberculosis, and 
Escherichia coli pesticin plasmid 
insertion sequence transposase 
Y1055 TR:P74993 (EMBL:U59875) (340 
aa) fasta scores: E() : 0, 100.0% 
id in 340 aa" 
/codon- start =1 
/transl-table=ll 
/product=" transposase for 
insertion sequence ISlOO" 
/protein- id= " CAC8 9975.1" 
/db-xref ="GI : 15979198 " 
/db-xref = " SPTREMBL : P74 993 " 
/ trans lat ion= " MVTFETVMEIKILHKQGMSS 
RAIARELGISRNTVKRYLQAKSEP 
PKYTPRPAVASLLDEYRDYIRQRIADAHPYKIPA 
TVI ARE IRDQGYRGGMT I LRAF IR 
SLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRS 
PLHVFVAVLGYSRMLYIEFTDNMR 
YDTLETCHRNAFRFFGGVPREVLYDNMKTWLQR 
DAYQTGQHRFHPSLWQFGKEMGFS 
PRLCRPFRAQTKGKVERMVQYTRNSFYI PLMTRL 
RPMGITVDVETANRHGLRWLHDVA 
NQRKHETIQARPCDRWLEEQQSMLALPPEKKEYD 
VHLDENLVNFDKHPLHHPLSIYDS FCRGVA" 
.1605 /gene="YP01132" 

/note="Pfam match to entry PF00665 
rve, Integrase core domain, score 
81.90, E-value 1.6e-22" 
.1608 /gene="YP01132" 

/note=" Predicted helix- turn-helix 
motif with score 2147 (+6.50 SD) 
at aa 18-3 9, sequence 
MSSRAIARELGISRNTVKRYLQ" 
.1609 /gene="YP01132" 

/note="Pfam match to entry PF00239 
recombinase. Site- specif ic 
recombinases, score 25.70, E-value 
4 .7e-06" 

.1610 /note="IS100 inverted repeat" 
. 1620 /gene="gpmA" 

/note=" synonyms: gpm, YP01133" 



48)" 

/EC-number="5.4 .2.1" 
/note=" Similar to Escherichia coli 
phosphoglycerate mutase 1 
SWiPMGl-ECOLI (P31217) (249 aa) 
fasta scores: E() : 0, 88.0% id in 
249 aa" 

/codon-start=l 
/transl-table=ll 

/product = "phosphoglycerate mutase 
1" 

/protein- id= " CAC8 9976.1" 
/db-xref="GI: 15979199" 
/db-xref=" SWISS- PROT : Q 8 ZGY5 " 
/translat ion= "MAVTKLVLVRHGESQWNNEN 
RFTGWYDVDLS EKGRS EAKAAGKL 
LKDEGFTFDFAYTSVLKRAIHTLWNILDELDQAW 
LPTEKTWKLNERH YGALQGLNKS E 
TAEKYGDEQVKQWRRGFAITPPALEKSDERFPGH 
DPRYAKLTDAELPTTESLALTIER 
VI P YWNDVI KPRIASGERVI I AAHGNSLRALVKY 
LDDLGEDEILELNIPTGVPLVYEF 
DENFKPIKHYYLGNADEIAAKAAAVANQGKAK " 

complement (161386 1620 /gene="gpmA" 

30) 

/note="Pfam match to entry PF00300 
PGAM, Phosphoglycerate mutase 
family, score 447.60, E-value 
l.le-130" 

complement (161998. .1620 /gene="gpmA" 
27) 

/note="PS00175 Phosphoglycerate 
mutase family phospnohistidine 
signature . " 

complement (162395 . .1627 /gene="psiF" 

30) 

/note= " synonym : YP01134 " 
complement (162395. .1627 /gene="psiF" 
30) 

/note=" Similar to Escherichia coli 

phosphate starvation- inducible 

protein PsiF precursor psiF 

SWrPSIF-ECOLI (P27295) (106 aa) 

fasta scores: E(): 1.2e-15, 54.5% 

id in 112 aa" 

/codon-start=l 

/transl-table=ll 

/produc t = " put at i ve 

starvation- inducible protein" 

/protein-id="CAC89977 .1" 

/db-xref="GI: 15979200" 

/db-xref = " SPTREMBL : Q8ZGY4 " 

/ trans 1 at ion= " MRLFPLWLLSAGLLLSANVM 

AADPPKTPS PAQAAQQQQMTDCNQ 

QASTQSLKGDERKNFMSQCLKAQTAPDGKALTPQ 

QQKMKSCNAEAAQKMLKGDERKTF 

MSTCLKKAA" 

complement (163023 . . 1640 /gene="galM" 

92) 

/note= " synonym : YP0113 5 " 

/pseudo 

complement (163 023 . .1640 /gene="galM" 
92) 

/EC-number= "5.1.3.3" 
/note=" Similar to Escherichia coli 
aldose 1-epimerase galM 
SW:GALM-ECOLI (P40681) (346 aa) 
fasta scores: E(): 0, 60.2% id in 
344 aa. There is a frameshift 
following codon 39. The frameshift 
occurs within a homopolymeric 
tract of 6G. The sequence has been 
checked and is believed to be 
correct" 
/pseudo 



misc-feature complement (163038 1639 

19) 



misc-feature complement (163533 . . 163 5 

62) 



gene complement (164 086 . .1652 

37) 

CDS complement (164 086. .1652 

37) 



misc-feature complement (164737 . . 1649 

64) 



misc-feature complement (164842 . . 1648 

77) 



misc-feature complement (165124 . .1651 

59) 



gene complement (165234 . . 1662 

86) 

CDS complement (165234 . .1662 

86) 



/transl-table=ll 

/produc t = " aldose 1 - epimerase 

(pseudogene) " 

/gene="galM" 

/note="Pfam match to entry PF01263 
Aldose-epim, Aldose 1-epimerase, 
score 377.60, E-value 1.3e-109" 
/pseudo 
/gene="galM" 

/note="PS00545 Aldose 1-epimerase 

putative active site." 

/pseudo 

/gene="galK" 

/note=" synonyms: galA, YP01137" 
/gene="galK" 

/EC-number= "2.7.1.6" 
/note= "Similar to Escherichia coli 
galactokinase SW:GAL1-EC0LI 
(P06976) (381 aa) fasta scores: 
E(): 0, 72.5% id in 382 aa" 
/codon- start =1 
/transl~table=ll 
/product= " galactokinase " 
/protein- id= " CAC8 9979.1" 
/db-xref-"GI : 15979201" 
/ db - xr e f = " S WI S S - PROT : Q8 ZGY3 " 
/trans lat ion= " MSLKQHTQTIFRQQFDRESD 
ITIKAPGRVNLIGEHTDYNDGFVL 
PCAINYETVISCGKRDDRQIRVIAADYENQQDIF 
SLDAPIVPHPEYRWADYVRGWKH 
LQMRNADFGGADLVI CGNVPQGAGLS S SASLEVA 
VGQALQSLYQLPLSGVELALNGQE 
AENQFVGCNCGIMDQLISALGKKDHALLIDCRTL 
ETRAVPMPENMAWI INSNIQRGL 
VDSEYNTRRQQCEAAARFFGVKALRDVEPSLFFS 
IQDELDPWAKRARHVI SENARTL 
AAADALAAGNLKLMGQLMQESHISMRDDFEITVP 
PIDRLVEIVKSVIGDQGGVRMTGG 
GFGGCI lALMPLELVEQVRTTVAQEYPAHSGGKK 
ETFYVCQASQGAGLC " 
/gene="galK" 

/note="Pfam match to entry PF00288 
GHMP-kinases, GHMP kinases 
putative ATP-binding proteins, 
score 83.90, E-value 2.5e-22" 
/gene="galK" 

/note=:"PS00627 GHMP kinases 
putative ATP-binding domain." 
/gene="galK" 

/note="PS00106 Galactokinase 

signature. " 

/gene="galT" 

/note=" synonyms: galB, YP01138" 
/gene="galT" 

/ EC - number = "2.7.7.10" 
/note= "Similar to Escherichia coli 
galactose- 1-phosphate 
uridylyltransferase GalT or GalB 
SW:GAL7-EC0LI (P09148) (348 aa) 
fasta scores: E() : 0, 82.4% id in 
347 aa" 

/codon- start =1 

/transl-table=ll 

/product= "galactose- 1-phosphate 

uridylyltransferase" 

/protein-id="CAC89980 . 1" 



misc- feature 



misc- feature 



gene 
CDS 



complement (165252 
83) 



complement (165783 , 
36) 



complement (166296 , 
12) 

complement (166296 . 
12) 



/db-xref = " SPTREMBL : Q8ZGY2 " 
/translation="MTHFNPVDHPHRRYNPLKDQ 
WVLVSPHRAKRPWQGQQEAPATEN 
LPAHDPDCFLCPGNTRVTGDVNPDYSSTYVFTND 
FAALMPDTPDAPPSHDPLMRSQSA 
RGTSRVICFSPDHSKTLPQLTLPALEQWQTWQQ 
QSAELGKIYPWVQVFENKGAAMGC 
SNPHPHGQIWANSFLPNEAEQEDRLQQQYFQQHQ 
S PMLLD YVERERRDGQRTWETEH 
WLAWPYWAAWPFETLLLPKAAVLRLEDLSAEQR 
SDLAIALKKLTSRYDNLFSCSFPY 
SMGWHGAPYNANDNTHWQLHAHFYPPLLRSASVR 
KFMVGYELLAETQRDLTAEQAATUj 
LRAVSDVHYKEAGAKS " 
1662 /gene="galT" 

/note="Pfam match to entry PF01087 
GalP-UDP-transf , 
Galactose- 1 -phosphate uridyl 
transferase, score 702.60, E-value 
1.8e-207" 
,1658 /gene="galT" 

/note="PS00117 

Galactose- 1-phosphate uridyl 
transferase family 1 active site 
signature . " 
.1673 /gene="galE" 



/note= " synonyms : galD , 
1673 /gene="galE" 



YP01139" 



misc- feature 



gene 
CDS 



complement (166314 
06) 



complement (167672 
90) 

complement (167672 
90) 



/EC-number= "5.1.3.2" 
/note=" Similar to Escherichia coli 
UDP-glucose 4-epimerase GalE or 
GalD SW:GALE-ECOLI (P09147) (338 
aa) fasta scores: E(): 0, 76.0% id 
in 33 8 aa. Previously sequenced in 
Yersinia pestis as TR:AAG22000 
(EMBL:AF282311) (338 aa) fasta 
scores: E(): 0, 100.0% id in 338 
aa" 

/codon- start =1 
/transl-table-11 

/product = " UDP - g lucose 4 - epimerase " 
/protein- id= " CAC8 9981.1" 
/db-xref="GI: 15 9792 03" 
/db-xref = " SWISS - PROT : Q9F7D4 " 
/ 1 rans lat ion= " MYVLVTGGSGYIGSHTCVQL 
lEAGYKPVILDNLCNSKSSVLARI 
HSLTGYTPELYAGDIRDRTLLDSIFAAHPIHAVI 
HFAGLKAVGESVNRPLEYYNNNVF 
GTLVLLEAMRAAQVKNL I FS S S ATVYGDQPQ I P Y 
VES FPTGS P S S P YGRSKLMVEQ I L 
QDVQLADPQWNMTILRYFNPVGAHPSGLMGEDPQ 
G I PNNLMP F I AQ VAVGRRE S LA I F 
GNGYPTPDGTGVRDY I HWDLADGHVAAMKTLHG 
KPGVHIFNLGAGVGHSVLQWAAF 
SKACGKPLAYHFAPRREGDLPAYWADATKAAEQL 
GWRVSRSLDEMAADTWHWQSKNPQ GYPD" 
1673 /gene="galE" 

/note="Pfam match to entry PF01370 
Epimerase, NAD dependent 
epimerase/dehydratase family, 
score 559.10, E-value 2.9e-164" 
.1684 /gene="YPO1140" 

.1684 /gene="YPO114 0" 

/note=" Similar to Vibrio cholerae 
hypothetical protein VC1074 
TR:Q9KT31 (EMBL : AE004189) (276 aa) 
fasta scores: E(): 0, 45.5% id in 
275 aa" 



misc- feature 

misc- feature 

misc- feature 

misc- feature 

misc- feature 

misc- feature 

misc- feature 

misc- feature 

misc -feature 

gene 
CDS 



complement (167681. 
46) 



complement (167768 . 
33) 



complement (16784 9 
99) 



complement (167939, 
95) 



complement (168011 , 
76) 



complement (168113 . 
78) 



complement (168239, 
04) 



complement (168326 . 
91) 



complement (168404 . 
69) 



complement (168707 . 
97) 

complement (168707 . 
97) 



/transl-table=ll 

/product =" putative membrane 

protein" 

/protein- id="CAC89982 . 1" 

/db-xref="GI: 15979204" 

/db-xref = " SPTREMBL : Q8ZGY1 " 

/ 1 rans lat ion= " MWGVLATSLLFLPLSQVLSL 

LFLVS SLAMACYSGVLTFPATAFL 

LAI IILVLLLQKYRQRNGVAAGLELLLVLGVIAL 

FLHLVPGFHNLKVLDKVRTGPLSA 

PFTMYYNLDKALVPFILLACLPTLFKVKKHPSVG 

RMGWWLILSVPALLLLAVALGGL 

KIELHTPVWIGSFI I ANLFFVCLAEEALFRGYLQ 

QRLGQWLGSYPALVITALLFGSAH 

FAGGPLLMLFAALAGVIYGLAWLWSGRLWVAVAF 

HFALNLMHLLFFTYPLYLPH " 

1677 /gene="YPO1140" 

, /note="one of 9 probable 
transmembrane helices predicted 
for YPO114 0 by TMHMM2 . 0 " 

1678 /gene="YPO114 0" 

/note="one of 9 probable 
transmembrane helices predicted 
for YP0114 0 by TMHMM2.0" 
.1678 /gene="YPO114 0" 

/note="one of 9 probable 
transmembrane helices predicted 
for YPO114 0 by TMHMM2 . 0 " 
.1679 /gene="YPO114 0" 

/not e= "one of 9 probable 
transmembrane helices predicted 
for YPO1140 by TMHiyiiy[2 . 0 " 
.1680 /gene="YPO1140" 

/note="one of 9 probable 
transmembrane helices predicted 
for YPO114 0 by TiyiHMM2 . 0 " 
.1681 /gene="YPO1140" 

/not e= "one of 9 probable 
transmembrane helices predicted 
for YPO114 0 by TMHMM2.0" 
.1683 /gene="YPO1140" 

/note="one of 9 probable 
transmembrane helices predicted 
for YPO114 0 by TMHMM2.0" 
.1683 /gene="YPO1140" 

/note="one of 9 probable 
transmembrane helices predicted 
for YPO1140 by TMHiyiM2.0" 
.1684 /gene="YPO1140" 

/note="one of 9 probable 
transmembrane helices predicted 
for YPO114 0 by TMHMM2.0" 
.1701 /gene="modF" 

/note=" synonyms: phrA, YP01142" 
.1701 /gene="modF" 

/note= "Similar to Escherichia coli 
putative molybdenum transport 
ATP-binding protein ModF or PhrA 
SW:MODF-ECOLI (P31060) (490 aa) 
fasta scores: E(): 0, 75.0% id in 
484 aa" 

/codon-start=l 

/transl-table=ll 

/product = "putative molybdenum 



misc- feature 

misc- feature 
misc- feature 

misc -feature 

gene 
CDS 



complement (168761, 
42) 



complement (169298 , 
21) 



complement ( 169565 . 
13) 



complement (170069, 
92) 



complement (1703 07, 
98) 

c omp 1 ement (170307 
98) 



modF " 

/protein-id="CAC89983 .1" 
/db-xref="Gl: 15979205" 
/ db - xre f = " S PTREMBL : Q8 ZGYO " 
/ 1 r ans lat ion= " MSELQI SQGCFRLSDTQTLI 
LPALHIQAGDCWAFVGANGSGKSA 
LAKALSGELALLEGSRQSHFRHGVRLSFEQLQQL 
VNEEWQRNNTDLLSADEDDTGRTT 
AEI IQEAARDPQRCQQLAALFGISHLLSRRFKYL 
STGETRKTLLCQALMPQPDLLILD 
EPFDGLDVAARAQLAEMLSTLSAQGVTLVLVLNR 
FDDI PDFVQYVGVLADCHLTHMGP 
RQQILSEALIAQLAHSENLDGLSLPETEDPQHHL 
RIPKNEPLIILNDGWEYNDRPIL 
HNLSWQVNPGEHWQIIGQNGAGKSTLLSLITGDH 
PQGYSNDLTLFGRRRGSGETIWDI 
KRHIGYVSGSLHLDYRVSTRVRTVILSGFFDSIG 
LYQAASDRQQQLADQWLSLLGFST 
SVANQPFHSLSWGQQRLVLIARALVKHPALLILD 
EPLQGLDPLNRLLVRRFIDVMIGE 
GETQLLFVSHHAEDAPQCITHRLTFVPFGDIYHY 
QQEKSPSQPQVK" 
1693 /gene =" modF" 

/note="Pfam match to entry PF00005 
ABC-tran, ABC transporter, score 
155.20, E-value l.le-42" 
,1693 /gene="modF" 

/note="PS00017 ATP/GTP -binding 
site motif A (P-loop) ." 
.1701 /gene="modF" 

/note="Pfam match to entry PF00005 
ABC-tran, ABC transporter, score 
87.10, E-value 3-7e-22" 
.1700 /gene="modF" 

/note="PS00017 ATP/GTP -binding 
site motif A (P-loop) . " 
.1710 /gene="modE" 



/note= " synonyms : 
1710 /gene="modE" 



modR, YP01143" 



gene 
CDS 



171415. .171567 
171415. .171567 



/note=" Similar to Escherichia coli 
molybdenum transport protein 
SWiMODE-ECOLI (P46930) (262 aa) 
fasta scores: E(): 0, 70.5% id in 
264 aa" 

/codon-start=l 

/transl-table=ll 

/product= "molybdenum transport 

protein ModE" 

/protein-id="CAC89984 .1" 

/db-xref="GI: 15979206" 

/db-xref = " SWISS-PROT : P584 97 " 

/ trans lation= " MQAE ILLTLKLQQRLFADPR 

RIALLKQIQHTGSISQGAKLAGIS 

YKSAWDAINDMNTLSEEILVERATGGKGGGGAHL 

TRYGERLIQLYDLLATIQQKAFDT 

LKDDSLPLDSLLAAISRFSLQTSARNQFFGTLIE 

RDHQQVQQHVNILLSDGKTRLTAA 

ITQQSADRLQLSAGKEVLALIKAPWVKLVTDPAL 

AGAADNALPGTVAS I EPGNDHS EV 

IVTLTGGANLCSTQNNSELHALNLRVGSAVIAQF 

NADRVIIATLC" 

/gene="YP01144" 

/gene="YP01144" 

/note=" Similar to Escherichia coli 
hypothetical 5.3 kDa protein in 
modE-modA intergenic region 
precursor YbhT SW: YBHT-ECOLI 
(P75759) (49 aa) fasta scores: 
E(): 8.4e-ll, 82.5% id in 40 aa. 



171433. .171501 

171802. .172581 
171802. .172581 



171826. .171885 

172581. .173276 
172581. .173276 



172623. .172691 



many eukaryotic ATPase subunits 
eg. Mesembryanthemum crystallinum 
vacuolar H+ ATPase subunit C 
TR:Q9S953 ( EMBL : UNKNOWN ACCESSION) 
(76 aa) fasta scores: E(): 3.9, 
54 . 5% id in 33 aa" 
/codon- start=l 
/transl-table=ll 
/product= "putative membrane 
protein" 

/protein- id= " CAC8 9985.1" 

/db-xref ="GI : 15979207" 

/db-xref = " SPTREMBL : Q8ZGX9 " 

/ 1 rans lat ion= " MFELLKSLVFAWMVPWMA 

VILGLIYGLGEVFNVISKTGHPKE RNTLRS" 

/gene="YP01144" 

/note="l probable transmembrane 
helix predicted for YP01144 by 
TMHiyiiyi2 . 0 " 
/gene="modA" 

/note= " synonym : YP01145 " 
/gene="modA" 

/note=" Similar to Escherichia coli 
molybdate -binding periplasmic 
protein precursor ModA 
SW:M0DA-EC0LI (P37329) (257 aa) 
fasta scores: E() : 0, 71.1% id in 
246 aa" 

/codon- start=l 

/transl-table=ll 

/product ="molybdate-binding 

periplasmic protein precursor" 

/protein- id= " CAC8 9986.1" 

/db-xref="GI: 15979208" 

/ db - xr e f = " S PTREMBL : Q 8 ZGX8 " 

/translation^ "MKNQYGKVSYWVAGAVLLTA 

FSGPAFAADKITVFAAASLTNALQ 

DIAVQYKQEKQVDWASYASSSTLARQIEQGAPA 

DLFISADQQWMDYAIDKQQIVANT 

RYTLLGNELVLIAPQDSQIDKVEIDKKTDWKKLL 

EGGRLAVGDPDHVPAGIYAKESLE 

NLGAWSTLAPEMARANNVRSAMALVERAEAPLGI 

VYGSDAVASKKVKWGIFPEASHK 

PVE YPMAI VKGHDNPTVTAFYDYLKS PAAAVI FK 

NYGFTPR" 

/gene="modA" 

/note="l probable transmembrane 
helix predicted for YP01145 by 
TMHiyiiyi2 . 0 " 
/gene="modB " 

/note=:" synonyms: chlJ, YP01146" 
/gene="modB" 

/note=" Similar to Escherichia coli 
molybdenum transport system 
permease protein Modb or ChlJ 
SW:MODB-ECOLI (P09834) (229 aa) 
fasta scores: E(): 0, 76,3% id in 
228 aa" 

/codon- start =1 

/transl-table=ll 

/product= "molybdenum transport 

system permease protein ModB" 

/prot e in - id= " CAC8 9987.1" 

/db-xref="GI: 15979209" 

/db-xref = " SPTREMBL : Q8ZGX7 " 

/ trans lat ion= " MILSEYEWQAI ILSLKVSGV 

AVACSLPLGILMAWVLVRCRFPGK 

SLLDSVIHLPLVLPPWIGYLLLISMGRRGFIGE 

WLYSWFGINFSFSWRGAALASAW 

AFPLMVRAIRLALEAVDTRLELAARTLGATPWRV 

FFTITLPLSLPGVIAGTVLSFARS 

LGEFGATITFVSNIPGETRTIPLAMYTLIETPGA 

EAAAARLCVIAI ILSLVSLLLSEW 

LANWGKKRMGAPC" 

/gene="modB" 



misc- feature 



misc- feature 



misc-f eature 



172725. . 172793 



172851. .172919 



172920. .173150 



misc-f eature 



misc-f eature 



gene 
CDS 



172980. .173048 

173169. .173237 

173270. .174349 
173270. .174349 



misc-f eature 

misc-f eature 

misc -feature 

gene 
CDS 



173339, .173884 



173360. .173383 



173654. .173698 



complement (174405 , 
26) 

complement (174405 , 
26) 



transmembrane helices predicted 
for YP01146 by TMHMM2.0" 
/gene="modB" 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01146 by TMHMM2.0" 
/gene="modB" 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01146 by TMHMM2.0" 
/gene="modB" 

/note="Pfam match to entry PF00528 
BPD-transp, 

Binding-protein- dependent 
transport systems inner membrane 
component, score 78.90, E-value 
le-19" 

/gene="modB " 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01146 by TMHMM2.0" 
/gene="modB" 

/note="one of 5 probable 
transmembrane helices predicted 
for YP01146 by TiyiHMM2 . 0 " 
/gene="modC " 

/note= " synonyms : chlD, narD, 

YP0114 7" 

/gene="modC" 

/note=" Similar to Escherichia coli 
molybdenum transport ATP-binding 
protein ModC or ChlD or NarD 
SW:MODC-ECOLI (P09833) (352 aa) 
fasta scores: E(): 0, 69.1% id in 
349 aa" 

/ codon-start=l 

/transl-table=ll 

/product = "molybdenum transport 

ATP-binding protein ModC" 

/prot e in- id= " CAC8 9988.1" 

/db-xref ="GI : 15979210" 

/db-xref = " SPTREMBL : Q8ZGX6 " 

/ 1 r ans 1 at ion= " MLELNFSQQLGDLHLQVATD 

LPAQGITAIFGLSGAGKTSLINVI 

GGLTRPQQGRVILNGRVLVDAEKNIYLPPEKRRV 

GYVFQDARLFPHYRVRGNLQYGMA 

ASMRGQFDAIVGLLGIEPLLNRFPFTLSGGEKQR 

VAIGRALLTAPELLLMDEPIiASLD 

LPRKRELLPYLERLAQDVNTPILYVSHSMDEILR 

LADQWVMDAGKVRAVGGLEEVWA 

SSALRPWLQREEPSS ILRVSVIGHHDRYAMTALA 

LGDQRLWVGKLDAAEGNSMRIRIN 

AAD VS LALQ P PH S S S I RN I L P VKVAECLD VDGQ V 

DVKLAI GEQWLWAR I TPWARDELG 

LKPGQWVYAQIKSVSFNRQNGPVPD " 

/ gene =" mode " 

/note="Pfam match to entry PF00005 
ABC- t ran, ABC transporter, score 
203.30, E-value 3.7e-57" 
/gene="modC" 

/note="PS00017 ATP/GTP-binding 
site motif A (P-loop) ." 
/gene="modC" 

/note="PS00211 ABC transporters 
family signature." 
1752 /gene="YP01148" 

1752 7gene="YP01148" 

/note=" Similar to Escherichia coli 
hypothetical protein YbhA 
SW:YBHA-ECOLI (P21829) (272 aa) 
fasta scores: E(): 0, 64.0% id in 
272 aa, and to Vibrio cholerae 
hypothetical protein VCA0243 



fasta scores: E{) : 0, 39.2% id in 
273 aa" 

/ codon- start = 1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/prot e in- id= " CAC8 9989.1" 
/db-xref="GI: 15979211" 
/db-xref = " SPTREMBL : Q8ZGX5 " 
/ 1 rans 1 at ion= " MTYRI I ALDLDGTLLDHKKR 
ILPESLSALAQARAEGVKVIWTG 
RHHVAIHPFYQALQLDTPAICCNGTYIYDYQNKK 
VLDSNPLTPQQAVQVLQLLEKTQI 
HGLMYVDDAMLYQQITGHVTRTLSWAESLPPAQR 
PTFLQVNSLLDAAHSATAIWKFAT 
SHPDTAQLKAFAARVEAEMGLACEWSWHDQVD I A 
QAGNSKGKRLQQWVESQGLSMQEV 
lAFGDNFNDLSMLEAAGLGVAMGNSDDAIKQRAD 
LVITDNEQPGIAAVIRQHVLA" 
misc-feature complement (174447 . .1752 /gene="YP01148" 

14) 

/note="Pfam match to entry PF00592 
DUF3, Cof family DUF3 , score 
280.90, E-value 1.6e-80" 
misc-feature complement (174498 . .1745 /gene="YP01148 " 

66) 

/note="PS01229 Hypothetical cof 
family signature 2 . " 

gene 175517. .176521 /gene= " YP01149 " 

CDS 175517. .176521 /gene="YP01149" 

/note=" Similar to Escherichia coli 
hypothetical protein YbhE 
SW:YBHE-ECOLI (P52697) (331 aa) 
fasta scores: E() : 0, 56.8% id in 

329 aa, and to Buchnera aphidicola 
(subsp. Acyrthosiphon pisum) 
hypothetical protein BU293 
SW:Y293-BUCAI (P57380) (334 aa) 
fasta scores: E() : 0, 35.8% id in 

330 aa" 

/codon- start=l 
/transl-table=ll 

/product^" conserved hypothetical 

protein" 

/protein- id= " CAC8 9990.1" 
/db-xref="GI: 15979212" 
/ db - xr e f = " S PTREMBL : Q 8 Z GX4 " 
/translation= "MKQAVYVASPDSQQIHVWQL 
DSAGELTLLQTVDVPGQVQPMAIS 
PNQRHLYVGVRPDFGIVSYHIADDGTLTAAGMAP 
LPGSPTHIDTDRQGRFLFSASYSF 
NCVS I S P IDTHGWQAP I QQLDDL PAPHSANIDP 
TNQILLVPCLKEDKVRLFDLSAEG 
QLTPHAQADITVAAGAGPRHMAFHPNHQVAYCVN 
ELNSSVDVYQISNNGQEYHLVQSL 
DAMPADFTGTRWAADIHITPNGRYLYISDRTANL 
LGI FTVS EDGRVI SLVGHHLTEAQ 
PRGFNIDHSGNFLIASGQKSDHIEVYRIDQNTGE 
LTTLKRYPVGKGPMWVSIRGAQNS " 
gene complement (176706 1779 /gene="bioA" 

74) 

/note= " synonym : YPO1150 " 
CDS complement (176706. .1779 /gene="bioA" 

74) 

/EC-number= "2.6.1.62" 
/note=" Similar to many eg. 
Escherichia coli 

adenosy Ime thionine - 8 - amino - 7 - oxono 
nanoate aminotransferase BioA 
SW:BIOA-ECOLI (P12995) (429 aa) 
fasta scores: E(): 0, 74.6% id in 
417 aa" 

/codon- start =1 
/transl-table=ll 

/product = " adenosy Ime thionine - 8 - ami 



misc- feature 



misc- feature 



gene 
CDS 



complement (176766 
75) 



complement (177138 
51) 



178073. .179110 
178073 . . 179110 



misc-f eature 

gene 
CDS 



178085. .179005 

179110. .180261 
179110. .180261 



aminotransferase" 
/protein- id= " CAC8 9991.1" 
/db-xref ="GI : 15979213 " 
/db-xref = " SPTREMBL : Q8ZGX3 " 
/translation="MTPSDLVFDQQHIWHPYTSM 
TEPLPCYPWGAEGVELQLADGRR 
LIDGMSSWWAAIHGYNHPVLNFAAHQQLDKMSHV 
MFGGITHPPAVKLCRQLVAMTPPP 
LECVFLADSGSVSVEVALKMALQYWQAKGERRQR 
I LTLRHGYHGDTFGAMS VCDPQNS 
MHSLYQGYLAENLFANAPQCGFDDPWDPQDIANF 
VALITQHANEIAAVILEPWQGAG 
GMRIYHPSYLREVRALCDKHQILLIADEIATGFG 
RTGKLFACEHAQ I VPD I LCLGKAL 
TGGYLTLS ATLTTRAVAET I S KGDAGCFMHGPTF 
lyiANPLACAVASANLSLLAENSWQQ 
QVSKIEDQLKRELLPLAQEDTVADVRVLGAIGW 
EMKKPVNVARLQRS FVEQGVW I RP 
FGKLIYLMPPYIISQHALTRLTAAWDRGS " 
.1778 /gene="bioA" 

/note="Pfam match to entry PF00202 
aminotran-3 , Aminotransferases 
class -III pyridoxal -phosphate, 
score 527.10, E-value 8.3e-183" 
.1772 /gene="bioA" 

/note="PS00600 Aminotransferases 
class-Ill pyridoxal -phosphate 
attachment site." 
/gene="bioB" 

/note= " synonym : YPOllSl " 

/gene="bioB" 

/EC"number= "2.8.1.6" 

/note= "Similar to many eg. 

Escherichia coli biotm synthase 

BioB SW:BIOB-ECOLI (P12996) (346 

aa) fasta scores: E() : 0, 84.6% id 

in 344 aa" 

/codon-start=l 

/transl-table=ll 

/product ="biot in synthase" 

/protein-id="CAC89992 .1" 

/db-xref = "GI : 15979214 " 

/db-xref = " SPTREMBL : Q8ZGX2 " 

/translation= "MATYHHWTVGQALALFDKPL 

LELLFEAQQVHRQHFDPRQVQVST 

LLSIKTGACPEDCKYCPQSSRYKTGLESERLMQV 

EQVLESAKKAKAAGSTRFCMGAAW 

KNPHERDMPYLAKMVEGVKALGMETCMTLGSLSK 

QQAHRLADAGLDYYNHNLDTSPEF 

YGSIITTRSYQERLDTLNEVRDAGIKVCSGGIVG 

LGETVRDRAGLLVQLANLPKPPES 

VP I NMLVKVKGTPLENNAE VDAFE F IRT I AVARI 

MMPS S YVRLSAGREQMNEQTQAMC 

FMAGANS I FYGCKLLTTPNPDEDKDLQLFRKLGL 

NPQQTATSHGDREQQQALTEQLLH 

GDTAQFYNAAV" 

/gene="bioB" 

/note="Pfam match to entry PF01792 
Biotin-synth, Biotin synthase, 
score 689.90, E-value 1.2e-203" 
/gene="bioF" 

/note= " synonym : YP01152 " 

/gene="bioF" 

/EC - number = "2.3.1.47" 

/note=" Similar to many eg. 

Escherichia coli 

8-amino-7-oxononanoate synthase 

BioF SW:BIOF-ECOLI (P12998) (384 

aa) fasta scores: E(): 0, 65.8% id 

in 383 aa" 

/codon-start=l 

/transl-table=ll 

/product= " 8 -amino- 7 -oxononanoate 



/protein- id= " CAC8 9993.1" 

/ab-xref ="GI : 15979215" 

/db-xref = " SPTREMBL : Q8 ZGXl " 

/translat ion= "MSWQDKIAQGLQRRRDAAAY 

RTRQVNEGANGRWLQSGERQYLNF 

SSNDYLGLSQNDEVIAAWQQGARRYGVGSGGSGH 

VTGYSQPHARLEQQLADWLGYPRA 

LLFISGYAANQAVLTALTDADDRILADKLSHASL 

LEAAAHSPAQLRRFQHNQPEALQN 

LLIKPCQGQTLWTEGVFSMDGDSAPLAALQQQT 

SAAGGWLLVDDAHGIGVHGEGGRG 

SCWLQGVQPELLWTFGKAFGLSGAAVLCQEPVA 

EYLLQYARHLIYSTAMPPAQACAL 

QAALRQVQQGDALRQQLQQRIRQFRTAAAHLPLQ 

LGASKTAIQPLLVGDNQQSLIWAE 

QLRAAGLWVTAIRPPTVPPGSARLRITLSAAHQP 

EDIDRLLEVLYGLCH " 

/gene="bioF" 

/note=:"Pfam match to entry PF00222 
aminotran-2 , Aminotransferases 
class-II, score 337.40, E-value 
1.6e-97" 
/gene="bioF" 

/note="PS0 0599 Aminotransferases 
class- II pyridoxal -phosphate 
attachment site . " 
/gene="bioC" 

/note= " synonym : YP01153 " 
/gene="bioC" 

/note=" Similar to many eg. 
Escherichia coli biotin synthesis 
protein BioC SW:BIOC-ECOLI 
(P12999) (251 aa) fasta scores: 
E(): 0, 49.6% id in 250 aa" 
/codon-start=l 
/transl-table=ll 

/product= "biotin synthesis protein 
BioC" 

/protein-id="CAC89994 .1" 

/db-xref ="GI : 15979216" 

/db-xref = " SPTREMBL : QB ZGXO " 

/translation= "MASVTEHAQWGVGCLPPLNV 

DKQAIAAAFSRAAESYDSAANLQR 

ETGHRLVQLGQQHTGFWLDAGCGTGHFSQHWRL 

LGKRVIALDLAAGMLDYARQQQVA 

DDYLLGDIEHIPLPDQSVDICFSNLAVQWCSDLG 

AALSEFYRVTRPGGI ILFSTLAEG 

SLDELGQAWQQVDGQRHVNDFLPLQHIQTACQYY 

RHHLTTALYQPRFPNVIALMRSLQ 

GIGATHLHHGRQAGLQGRQRLAALQRAYVMQSGG 

YPLS YHMVYGVI YRD " 

/gene="bioD" 

/note= " synonym : YP01154 " 
/gene="bioD " 
/EC-number= "6.3.3.3" 
/note=" Similar to many eg. 
Escherichia coli dethiobiotin 
synthetase BioD SW:BIOD-ECOLI 
(P13000) (224 aa) fasta scores: 
E(): 0, 71.2% id in 229 aa" 
/ codon- start=l 
/transl-table=ll 

/product=" dethiobiotin synthetase" 

/protein- id= " CACB 9995.1" 

/db-xref ="GI : 15979217" 

/db-xref = " SWISS - PROT : Q8ZGW9 " 

/translation= "MTKRWFITGTDTDVGKTVAS 

CALLQAATAQGYRTAGYKPVASGS 

QMTADGLRNSDALALQANS SQRLGYSQVNPFTFL 

EATSPHIASESEGRAIPLTALSQG 

LRQLEPSADWILIEGAGGWFTPLSPQATFADWVQ 

QEQLPVIMWGVKLGCINHALLTA 

QAIQHAGLTLAGWVANEVTPAGRRQAEYQATLTR 

MITAPLLGIIPYLSDIEENPVTTR 

RDLGH YLDLTVLRAAEREAVNM " 



25) 

complement (181915. .1826 /gene="YP01155" 
25) 

/note=" Similar to Synechocystis sp 
hiqh-af f inity brancned-cham amino 
acid transport ATP-binding protein 
TR:Q55753 (EMBL : D64002 ) (249 aa) 
fasta scores: E(): 5.8e-31, 43.3% 
id in 238 aa, and to Escherichia 
coli high-affinity branched- chain 
amino acid transport ATP-binding 
protein LivF SW: LIVF-ECOLI 
(P22731) (237 aa) fasta scores: 
E(): 5.1e-22, 36.3% id in 237 aa" 
/codon-start=l 
/transl-table=ll 
/product="putative amino acid 
transporter" 

/protein-id="CAC89996 .1" 
/db-xref="GI: 15979218" 
/db-xref = " SPTREMBL : Q8ZGW8 " 
/translation= "MLSLTAVNQYYGNNHILWDI 
NLELPRGQCTCLIGRNGVGKTTLI 
NCIMGHLPIKSGTMIWQPHHEPPQNLQQQPVERR 
NALGIGYVPQGQQI FSQLSVEDNL 
LVAVLAGRHKSHPIPGWVFELFPLLHDKRSQRGG 
ELTRNQQQQLAIARALVAEPELLI 
LDEPGSGTSPALSEDISTVLHQLSRNLGMTLLLV 
EHRLPFIQHIADRFCLMAGGRJSrVA 
QGTLDQLNEGLI SEHLAR " 

complement (181984. .1825 7gene="YP01155" 

47) 

/note="Pfam match to entry PF00005 
ABC-tran, ABC transporter, score 
135.40, E-value l.le-36" 

complement (182503. .1825 /gene="YP01155" 

26) 

/note="PS00017 ATP/GTP-binding 
site motif A (P-loop)." 

183715. .185730 /gene="uvrB" 

/note=" synonym: YP01156" 

183715. .185730 /gene="uvrB" 

/note=" Similar to many eg. 
Escherichia coli excinuclease ABC 
subunit B UvrB SW:UVRB-ECOLI 
(P07025) (673 aa) fasta scores: 
E(): 0, 85.9% id in 673 aa" 
/codon- start=l 
/transl-table=ll 

/product^ "excinuclease ABC subunit 
B" 

/prot e in- id= " CAC8 9997.1" 

/db-xref="GI: 15979219" 

/db-xref = " SWISS - PROT : Q8 ZGW7 " 

/translation="MSKSFKLHSVFKPAGDQPEA 

IRKLEEGLENGLAHQTLLGVTGSG 

KTFTVANVIADLNRPTMILAPNKTLAAQLYGEMK 

EFFPDNAVEYFVSYYDYYQPEAYV 

PSSDTFIEKDASVNEHIEQMRLSATKALLERRDV 

VWASVSAIYGLGDPDLYLKMMLH 

LTRGMIIDQRSILRRLSELQYSRNDQVFQRGTFR 

VRGEVIDIFPAESDEWALRVELFD 

EEVERLS I FDPLTGQLQHEVPRFTVYPKTHYVTP 

RER I LQAMEE I KVELAERRQVLIiA 

NNKLLEEQRLSQRTQFDLEMMNELGYCSGIENYS 

RYLSGRGPGEAPPTLFDYLPADGL 

LI VDESHVTI PQIGGMYKGDRSRKETLVEYGFRL 

PSALDNRPMRFEEFEALAPQTIYV 

SATPGKYELEKSGGDI lEQWRPTGLLDPLIEVR 

PVATQVDDLLSEIRIRAAINERVL 

VTTLTKRMAEDLTDYLSEHGAKVRYLHSDIDTVE 

RVEI IRDLRLGEFDVLVGINLLRE 

GLDMPEVSLVAILDADKEGFLRSERSLIQTIGRA 

ARNLNGKAILYGDRITASMEKAIG 

ETERRRAKQQAYNEERRI IPQGLNKKIGDILQLG 



misc- feature 
misc- feature 

misc-f eature 



gene 
CDS 



183829. .183852 



185092. .185352 



185605. .185712 



186022. .186267 
186022. .186267 



gene 
CDS 



complement (186398 , 
21) 

complement (186398 
21) 



misc-f eature 



complement (186416 
88) 



LSPKALDQKIRELEAKMYTYAQNLEFEQAAELRD 
QVHQLRQQFIAIS" 
/gene="uvrB " 

/note="PS00017 ATP/GTP-binding 
site motif A (P-loop)." 

/gene="uvrB " 

/note="Pfam match to entry PF00271 
helicase-C, Helicases conserved 
C- terminal domain, score 78.20, 
E-value 1.7e-19" 
/gene="uvrB" 

/note="Pf am match to entry PF02151 
UVR, UvrB/uvrC motif, score 58.00, 
E-value 2e-13" 
/gene="YP01157" 
/gene="YP01157" 

/note=" Similar to Vibrio cholerae 
hypothetical protein VC1816 
TR:Q9KR31 (EMBL : AE004257 ) (71 aa) 
fasta scores: E(): 6.9e-17, 64.7% 
id in 68 aa, and to Vibrio marinus 
genes, complete cds, similar to 
eicosapentaenoic acid synthesis 
gene cluster TR:Q9RA23 
(EMBL:AB025342) (72 aa) fasta 
scores: E(): 1.2e-16, 68.3% id in 
63 aa" 

/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC89998 . 1" 

/db-xref ="GI : 15979220" 

/db-xref ="SPTREMBL : Q8ZGW6 " 

/translation="MSEHLSKDPLHGITLEQLLT 

KLVENYGWDGLADRIQINCFISDP 

S IKSSLKFLRRTPWARQKVEALYIEMSDKAEWLK 

GQS" 

.1873 /gene="YP01158" 

.1873 /gene="YP01158" 

/note=" Similar to Escherichia coli 
hypothetical protein YbhK 
SW: YBHK-ECOLI (P75767) (302 aa) 
fasta scores: E(): 0, 77.7% id in 
3 01 aa, and to Vibrio cholerae 
hypothetical protein VC1023 
TR:Q9KT82 (Eiy[BL:AE004184) (296 aa) 
fasta scores: E(): 0, 53.9% id in 
297 aa" 

/codon- start=l 
/transl-table=ll 

/product = "conserved hypothetical 
protein" 

/protein- id= " CAC8 9999.1" 
/db-xref="GI: 15979221" 
/db-xref ="SWISS-PROT:P58589" 
/ 1 rans lat ion= " MRNRTLADLERWALGGGHG 
LGRVMSSLSSLGSRLTGIVTTTDN 
GGSTGRIRRSEGGIAWGDTRNCLNQLITEPSVAS 
AMFEYRFTGNGELAGHNLGNLMLK 
ALDHLSIRPIEAINLVRSLLKVDALLIPMSEQPV 
DLMAIDHEGHPIYGEVNIDQLAQM 
PQEMLLSPPVHATREAVEAINQADVILIGPGSFL 
TSLMPLLLLDELTQALRRSSASMI 
YIGNLGRELSPAAAALSLQDKLTIMESKIGRKI I 
DAVIVSPTIDISGVKDRIIVQQPL 
EAKD I PHRHDRELLRQALENTLQQLNGADS V " 
.1872 /gene="YP01158" 

/note="Pfam match to entry PF01933 
UPF0052, Uncharacterised protein 
family UPF0052, score 345.20, 
E-value 7.4e-100" 



/note= " synonyms : bisA, chlA, 
chlAl, narA, YP01159" 

187853 . . 188833 /gene="moaA" 

/note=" Similar to Escherichia coli 
molybdenum cofactor biosynthesis 
protein A MoaA SWiMOAA-ECOLI 
(P30745) (329 aa) fasta scores: 
E() : 0, 78.2% id in 326 aa, and to 
Vibrio cholerae molybdenum 
cofactor biosynthesis protein A 
VC1024 TR:Q9KT81 (EMBL: AE004184 ) 
(334 aa) fasta scores: E() : 0, 
65.0% id in 329 aa" 
/codon- start =1 
/transl-table=ll 
/product = "molybdenum cofactor 
biosynthesis protein A" 
/protein- id="CAC900 0 0 .1" 
/db-xref="GI: 15979222" 
/db-xref="SWISS-PROT:Q8ZGW5 " 
/ 1 rans lat ion= " MVQLTDAFARKFYYLRLS IT 
DVCNFRCTYCLPEGYRPDGVKSFL 
SLDEINRVSRAFALLGTEKIRLTGGEPSMRRDFT 
D II AT I RQNPAIRTLAVTTNG YRL 
VRDVAQWRDAGLTAINVSVDSLDPRQFHAITGQD 
KFYQVMQGIDAAFDAGFDKVKVNA 
VLMRDVNDRQLSAFLDWI KPRP IQLRFIELMETG 
EGGNLFRKHHVSGGVIRQQLLEQG 
WQQQDRARSDGPAQVFHHSDYQGEIGLIMPYEKD 
FCASCNRLRVSALGNLHLCLFGEQ 
GITLRDLLGSDDQQDELIARIQSALQTKKQTHFL 
HQGNSGITQNLSFIGG" 

187907. .187942 /gene="moaA" 

/note="PS013 05 moaA / nifB / pqqE 
family signature." 

188012 . . 188656 /gene="moaA" 

/note="Pfam match to entry PF01444 
MoaA-NifB-PqqE, moaA / nifB / pqqE 
family, score 258.80, E-value 
7.5e-74" 

188934 . . 189413 /gene="moaC" 

/note= " synonyms : chlA3 , YPO1160" 

188934 . . 189413 /gene="moaC" 

/note=" Similar to Escherichia coli 
molybdenum cofactor biosynthesis 
protein C MoaC SW:MOAC-ECOLI 
(P30747) (160 aa) fasta scores: 
E(): 0, 87.9% id in 157 aa, and to 
Vibrio cholerae molybdenum 
cofactor biosynthesis protein C 
VC1026 TR:Q9KT79 (EMBL : AE004184 ) 
(160 aa) fasta scores: E(): 0, 
78.5% id in 158 aa" 
/codon- start=l 
/transl-table=ll 
/product= "molybdenum cofactor 
biosynthesis protein C" 
/protein- id= " CAC9 0 001.1" 
/db-xref ="GI : 15979223 " 
/db-xref = " SWISS - PROT : Q8ZGW4 " 
/translation= "MTQLTHINTAGEAHMVDVSA 
KNETVREARAEAFVDMQAATLAMI 
IDGSHHKGDVFATARIAGIQAAKKTWELIPLCHP 
LLLTKVEVKLEAQPEHNRVRIETC 
CRLTGKTGVEMEALTAASVAALTIYDMCKAVQKD 
MI IGPVRLLTKSGGKSGDFKVDI " 

188976 . . 189383 /gene="moaC" 

/note="Pfam match to entry PF01967 
MoaC, MoaC family, score 306.40, 
E-value 3.3e-88" 

189410 . . 189655 /gene="moaD" 

/note= " synonyms : chlA4, chlM, 
YP01161" 

189410. .189655 /gene="moaD" 

/note=" Similar to Escherichia coli 



gene 
CDS 



189655. .190110 
189655. .190110 



misc- feature 



gene 
CDS 



189748. .190023 



190253. .190963 
190253. .190963 



factor," subunit 1 MoaD 
SW:MOAD-ECOLI (P30748) (81 aa) 
fasta scores: E() : 2.9e-23, 71.6% 
id in 81 aa, and to Vibrio 
cholerae molybdenum cofactor 
biosynthesis protein D vcl027 
TR:Q9KT78 (EMBL : AE004 184 ) (81 aa) 
fasta scores: E(): 2.4e-17, 63.0% 
id in 81 aa" 
/codon-start=l 
/transl-table=ll 
/product = "mo lybdopter in [mpt] 
converting factor, subunit 1" 
/protein-id="CAC90002 . 1" 
/db-xref="GI: 15979224" 
/db-xref = " SPTREMBL : Q8ZGW3 " 
/translation- "MIQILFFAQVRELVGVDKLQ 
LAAEFPTVEALRQSLCQRGERWQL 
ALEEGKLLTAVNQSLVSAQHPLAAGDEVAFFPPV 
TGG" 

/gene="moaE" 

/note=" synonyms: chlA5, YP01162" 
/gene="moaE" 

/note=" Similar to Escherichia coli 
molybdopterin [mpt] converting 
factor, subunit 2 MoaE 
SW:MOAE-ECOLI (P30749) (149 aa) 
fasta scores: E(): 0, 81.7% id in 
120 aa, and to Vibrio cholerae 
molybdenum cofactor biosynthesis 
protein E VG1028 TR:Q9KT77 
(EMBL:AE004184) (150 aa) fasta 
scores: E(): 0, 68.3% id in 120 
aa" 

/codon- start =1 

/ trans l-table=ll 

/product= "molybdopterin [mpt] 

converting factor, subunit 2" 

/protein-id="CAC90003 . 1" 

/db-xref="GI: 15979225" 

/db-xref = " SWISS- PROT : Q8ZGW2 " 

/ trans lat ion= " MMENTRIRVGAEAFSVGDEY 

TWLSQCDEDGAWTFTGKVRNHNL 

GASVSALTLEHYPGMTEKALTEIIADARSRWSLQ 

RVSVIHRVGPLFPGDEIVFVGVTS 

AHRSMAFEAAEFIMDYLKTRAPFWKREATVEGER 

WVESRDSDHIAAKRW" 

/gene="moaE" 

/note="Pfam match to entry PF02391 
MoeA, , score 123.70, E-value 
3 .4e-33" 

/gene="YP01163" 
/gene="YP01163" 

/note=" Similar to Escherichia coli 
hypothetical 25.9 l^Da protein in 
moaE-rhlE intergenic region 
SW:YBHL-ECOLI (P75768) (234 aa) 
fasta scores: E(): 0, 73.3% id in 
236 aa, and to Borrelia 
burgdorferi conserved hypothetical 
integral membrane protein BB053 9 
TR:051489 (EMBL : AEO 01155) (232 aa) 
fasta scores: E(): 0, 49.3% id in 
215 aa" 

/codon- start =1 

/trans l-table=ll 

/product =" putative membrane 

protein" 

/prote in- id= " CAC9 0004.1" 

/db-xref="GI: 15979226" 

/db-xref = " SPTREMBL : Q8 ZGWl " 

/ 1 rans lat i on= " MDRYPRSNGS I VERAGSGIQ 

AYMAQVYGWMTCGLLLTAWAWYA 

ANTPS I I FALQSNQILFFGLI lAQLGLVFVISGM 

VNRLSGTAATSLFMLYSALTGLTL 



misc- feature 
misc- feature 

misc-feature 
misc -feature 
misc-feature 
misc-feature 
misc-feature 
misc-feature 

misc-feature 

gene 
CDS 



190313. .190381 



190373 . .190960 



190424. .190492 



190511. .190579 



190592. .190651 



190670. .190738 



190751. .190810 



190883 . .190951 



191053. .191764 



191198- 
191198. 



.191656 
.191656 



misc-feature 



gene 



191252. .191563 



complement (191885, 
88) 



RDLSGMGSMLFMGLIGIILASLVN 

IWLKSPALMWWTYIGVLVFVGLTAYDTQKLKNL 

GAQLDVNDKDSFRKYSIVGALTLY 

LDFINLFLMLLRIFGNRR" 

/gene="YP01163" 

/note="one of 7 probable 

transmembrane helices predicted 

for YP01163 by TiyiHMM2 . 0 " 

/gene="YP01163" 

/note="Pfam match to entry PF01027 
UPF0005, Uncharacterized protein 
family UPF0005, score 190.90, 
E-value 2e-53" 
/gene="YP01163" 
/note="one of 7 probable 
transmembrane helices predicted 
for YP01163 by TMHMM2.0" 
/gene="YP01163" 
/note="one of 7 probable 
transmembrane helices predicted 
for YP01163 by Tiy[HMM2.0" 
/gene="YP01163" 
/note="one of 7 probable 
transmembrane helices predicted 
for YP01163 by TMHMM2.0" 
/gene="YP01163" 
/note="one of 7 probable 
transmembrane helices predicted 
for YP01163 by TMHMM2.0" 
/gene="YP01163" 
/note="one of 7 probable 
transmembrane helices predicted 
for YP01163 by TMHiyiM2.0" 
/gene="YP01163" 
/note="one of 7 probable 
transmembrane helices predicted 
for YP01163 by TMHiynyi2.0" 
/note="IS200-like insertion 
sequence: IS1541. Contains a 'G' 
at nucleotide position 315" 
/gene="tnp" 

/note= " synonym : YP01164 " 
/gene="tnp" 

/note=" Similar to Salmonella 
typhimurium, and Salmonella typhi 
transposase for insertion sequence 
IS200 TnpA SW: T200-SALTY (Q57334) 
(152 aa) fasta scores: E(): 0, 
94.1% id in 152 aa. Identical to 
the Yersinia pseudotuberculosis 
IS1541 element transposase Tnp 
TR:Q9X9F5 (EMBL : AJ23 8014 ) (152 aa) 
fasta scores: E(): 0, 100.0% id in 
152 aa" 

/codon-start=l 

/transl-table=ll 

/product = "transposase for the 

IS1541 insertion element" 

/protein-id= "CAC90005.1" 

/db-xref ="GI : 15979227" 

/db-xref = " SPTREMBL : Q9X9F5 " 

/translation= "MRDEKSLAHTRWNCKYHIVF 

APKYRRQVFYREKRRAIGS ILRKL 

CEWKNVNILEAECCVDHIHMLLEIPPKMSVSGFM 

GYLKGKS S LMLYEQFGDLKFKYRN 

REFWCRGYYVDTVGKNTARIQEYIKHQLEEDKMG 

EQLS I P YPGS PFTGRK " 

/gene="tnp" 

/note="Pfam match to entry PF01797 
Transposase- 17, Transposase IS200 
like, score 236.10, E-value 
5.1e-67" 
1935 /gene="betA" 

/note= " synonym : YP01165 " 



88)' 



misc- feature 

misc- feature 

misc -feature 

gene 
CDS 



complement (192008 
82) 



complement (192770, 
14) 



complement (193274 
45) 



complement (193611 , 
83) 

complement (193611 . 
83) 



/EC-number= "1.1.99.1" 
/note=" Similar to many eg, 
Escherichia coli choline 
dehydrogenase Bet A SWiBETA-ECOLI 
(P17444) (556 aa) fasta scores: 
E(): 0, 79.4% id in 553 aa, and to 
Homo sapiens choline dehydrogenase 
ChdH TR:Q9NY17 (EMBL: AJ272267) 
(482 aa) fasta scores: E(): 0, 
50.1% id in 467 aa" 
/codon- start =1 
/transl-table=ll 
/product=" choline dehydrogenase" 
/protein- id= " CAC9 0 006.1" 
/db-xref="GI: 15979228" 
/db-xref = " SPTREMBL : Q8ZGW0 " 
/ 1 rans lat ion= " MEYDYI I IGAGSAGNVLAAR 
LTEDADVTVLLLEAGGPDYRLDFR 
TQMPAALAFPLQGKRYNWAYETDPEPHMNNRRME 
CGRGKGLGGSSLINGMCYIRGNAM 
DFDHWASLSGLEDWSYLDCLPYFRKAETRDIGPN 
DFHGGEGPVSVTTPKIGNNPLFHA 
MVAAGVQAGYPRTDDLNGYQQEGFGPMDRTVTPK 
GRRAS TARGYLDQARPRNNLT II T 
HALTDRILFEGKRATGVRYLKGDAGTGQTAYARR 
EVLLCGGAIASPQILQRSGIGPAE 
LLQRLDIPLVQALPGVGENLQDHLEMYLQYSCKQ 
PVSLYPALLWFNQPKIGIEWLFNG 
TGVGASNQFEAGGFIRSRDAFTWPNIQYHFLPVA 
INYNGSNAVKEHGFQAHVGSMRSP 
SRGRI QVKS KDPRQHPS I LFNYMS S EQDWHEFRD 
AIRITREI lAQPALDPYRGREISP 
GANVQNDDELDAFIREHAETAYHPSCSCKMGDDK 
MAWDGQGRVHGVQGLRWDAS IM 
PQIITGNLNATTIMIAEKIADRIRGCQPLAKSNA 
AYFIAGDTPARTSPVRHSLPVTSY P" 
1935 /gene="betA" 

/note="Pfam match to entry PF00732 
GMC-oxred, GMC oxidoreductases, 
score 963.20, E-value 6.6e-286" 
,1928 /gene="betA" 

/note="PS00624 GMC oxidoreductases 
signature 2 . " 
.1933 /gene="betA" 

/note="PS00623 GMC oxidoreductases 
signature 1." 
.1950 /gene="betB" 

/note=" synonym: YP01166" 
.1950 /gene="betB" 

/EC-number= "1.2.1.8" 
/note=" Similar to Escherichia coli 
betaine aldehyde dehydrogenase 
BetB SW:DHAB-ECOLI (P17445) (489 
aa) fasta scores: E(): 0, 76.7% id 
in 48 9 aa, and to Homo sapiens 
aldehyde dehydrogenase, E3 isozyme 
AldH9 or AldH7 or AldH9Al 
SW:DHAG-HUMAN (P49189) (493 aa) 
fasta scores: E(): 0, 51.3% id in 
483 aa" 

/codon- start=l 

/transl-table=ll 

/product = "betaine aldehyde 

dehydrogenase " 

/protein- id= " CAC9 0007.1" 

/db-xref ="GI : 15979229" 

/db-xref = " SPTREMBL : Q8ZGV9 " 

/translation="MSRYGLQKLYINGAYTDSTS 

GDTFDAVNPANGECIAQLQAANAQ 



LRDRNDELAAIETADTGKPLSETR 

SVDIVTGADVLEYYAGLIPALEGQQIPLRGSAFV 

YTRREPLGWAGIGAWNYPLQIAL 

WKSAPALAAGNAMIFKPSEVTSLTALKLAGIYTE 

AGLPAGVFNVLTGSGDQVGQMLTE 

HPGIAKVSFTGGIASGKKVMANAAGSTLKDVTME 

LGGKS PLI I FADADLDKAAD I AMM 

ANFYSSGQVCTNGTRVFVPQALQAAFEQKIVERV 

KRIHIGDPSDERTNFGPLVSFQHR 

DSVMRYIDSGKREGATLLIGGYSLTEGALAHGAY 

VAPTVFTHCRDDMQIVREEIFGPV 

MS ILSYQSEEEVIRRANDTEYGLAAGWTQDLNR 

AHRVIHQLQAGI CWINTWGESAPE 

MPVGGYKHSGVGRENGISTLEHYTQIKSIQVELG 

SFNSVF" 

complement (193635. .1950 /gene="betB" 
44) 

/note="Pfam match to entry PF00171 
aldedh. Aldehyde dehydrogenase 
family, score 804.40, E-value 
4 . le-238" 

complement (194214 . . 1942 /gene="betB" 
49) 

/note="PSG0070 Aldehyde 
dehydrogenases cysteine active 
site . " 

complement (194310. .1943 /gene="betB" 
33) 

/note="PS00687 Aldehyde 
dehydrogenases glutamic acid 

complement (195141. .1957 /gene="betl" 
37) 

/note=" synonym: YP01167" 
complement (195141. .1957 /gene="betl" 
37) 

/note=" Similar to Escherichia coli 
regulatory protein of the 
osmoregulatory choline -glycine 
betaine pathway Bet I SW:BETI-ECOLI 
(P17446) (195 aa) fasta scores: 
E(): 0, 64.1% id in 192 aa" 
/codon-start=l 
/transl-table=ll 

/product="TetR- family regulatory 
protein" 

/protein- id="CAC90008 .1" 

/db-xref="GI: 15979230" 

/db-xref = " SPTREMBL : Q8ZGV8 " 

/ 1 rans 1 at ion= " MPKVGMQP IRRQQLI EATMA 

AVNEVGMHEAS lAQIAKRAGVSNG 

IISHYFRDKNGLLEATMRYLIRHLGEAVKQHLAA 

LSVNDPRARLRAIAEGNFDDSQIN 

S AAMKTWLAFWAS SMHS PQL YRLQQVNNRRLYSN 

LCAEFKRCLPREQAQLAAKGMAGL 

IDGLWLRSALSGEHFNRQEALLI IHNYIEQQLNI 

KYKC" 

complement (195537. .1955 /gene="betl" 
84) 

/note="PS00038 Myc-type, 

' helix- loop-helix* dimerization 

domain signature." 

complement (195558. .1956 /gene="betl" 

98) 

/note="Pfam match to entry PF00440 
tetR, Bacterial regulatory 
proteins, tetR family, score 
53.90, E-value 2.3e-12" 

complement (195570. .1956 /gene="betl" 

62) 

/note="PS01081 Bacterial 
regulatory proteins, tetR family 
signature . " 

complement (195585. .1956 /gene="betl" 

50) 



gene 
CDS 



196102. .198123 
196102. . 198123 



misc-f eature 196144 . . 196212 



misc-f eature 196156 . . 197589 



misc-f eature 196255 . . 196323 



misc-f eature 196384 . . 196452 



misc-feature 196534 . . 196602 



misc-f eature 196687 . . 196755 



misc-feature 196798 . . 196866 



protein BetT 
:AF264948) (676 
E() : 0, 80.0% id 



motif with score 1758 (+5.17 SD) 
at aa 30-51, sequence 
AS lAQI AKRAGVSNGI I SHYFR " 
/gene="betT" 

/note=" synonym: YP01168" 
/gene="betT" 

/note=" Similar to Escherichia coli 
high-affinity choline transport 
protein BetT SW: BETT-ECOLI 
(P17447) (677 aa) fasta scores: 
E(): 0, 77.6% id in 678 aa, and to 
Erwinia amylovora high-affinity 
choline transport 
TR:AAG31040 (EMBL: 
aa) fasta scores: 
in 681 aa" 
/codon- start =1 
/trans l-table=ll 
/product= "high-affinity choline 
transport protein" 
/protein- id="CAC90009.1" 
/db-xref = "GI : 15979231" 
/db-xref = " SPTREMBL : Q8ZGV7 " 
/ 1 rans lat ion= " MVTSD IDTKPQKDTLNP WF 
FTS AGL I LAF CLMT I F YTDLSNRW 
IGITLNWVSATFGWYYLLAATLYIVFVIYIATSR 
YGSIKLGPEQSKPEFSLVSWAAML 
FAAGIGIDLMFFSVAEPVTQYMMPPEGQGQTLEA 
ARQAMVWTLFHYGLTGWSMYALMG 
lALGYFSYRYNLPLTIRSALYPIFGKRIDGPIGH 
SVDIAAVIGTIFGIATTLGIGWQ 
LNYGLNVLFHIPESLAVQAGLILLSWMAAVSVT 
SGVNKGIRILSEVNVLLALGLILF 
LLNALVLNVGDYINRFLGMTLNSFAFDRPTQWMN 
SWTLFFWAWWVAWSPFVGLFLARI 
S RGRT I RQF WGTL 1 1 P FVFTLLWLS I FGNS ALY 
QILHGNIAFANEVMEFPERGFYSL 
LAQYPGFTLSASVATITGLLFYVTSADSGSLVLG 
NFTSRLADINNDSPNWLRTFWS IT 
IGLLTLGMLMTDGVAALQNATVIMGLPFSFVIFF 
VMVGLYKSLKIEDYRKASSQQTPT 
PTLISGNEVLNWKQRLSRVMNYPGTTYTAKILDT 
ICRPAMEEVAQELAVRGVKVEFNE 
LPPKEDEQLNRLNHLELLVHLGDEQNFVYQIWPQ 
RYSVPAFTYRARSGKSHYYRLETY 
LLEGTQGNDLMDYSKEQI INNILDQYERHMNFLH 
IQSEAPGNIMTFPDQ" 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHMM2 . 0 " 
/gene="betT" 

/note="Pfam match to entry PF02028 
BCCT, BCCT family transporter, 
score 909.00, E-value 1.3e-269" 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TiyiHMiy[2 . 0 " 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHiyDyi2 . 0 " 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHMM2.0" 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TiyiHiyiM2 . 0 " 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TiyiHMM2.0" 



misc- feature 

misc- feature 
misc -feature 

misc -feature 

misc -feature 

misc -feature 



gene 
CDS 



197026. .197094 



197032 
197128 



197061 
197187 



197278. .197346 



197434. .197502 



197512. .197571 



198355. 
198355. 



.199248 
.199248 



misc- feature 



198388. .198816 



misc- feature 

misc- feature 

gene 
CDS 



198427. .198492 



198430. .198522 



complement (199286 , 
67) 

complement (199286 , 
67) 



/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHMiyi2 . 0 " 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHiyiM2.0" 
/gene="betT" 

/note="PS01303 BCCT family of 
transporters signature." 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHiyiiyi2 . 0 " 

/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHiyiM2.0" 
/gene="betT" 

/note="one of 12 probable 
transmembrane helices predicted 
for YP01168 by TMHiyiM2 . 0 " 

/gene="betT" 

/note="one of 12 probable 

transmembrane helices predicted 

for YP01168 by TMHiynyi2 . 0 " 

/gene="YP01169" 

/gene="YP01169" 

/note=" Similar to many eg. 

Escherichia coli xantnosine operon 

regulatory protein WapR 

SW:XAPR-ECOLI (P23841) (294 aa) 

fasta scores: E(): 0, 38.3% id in 

287 aa" 

/codon- start =1 

/transl-table=ll 

/product= "putative LysR- family 

regulatory protein" 

/protein- id= "CAC90010 .1" 

/db-xref ="GI : 15979232 " 

/db-xref = " SPTREMBL : Q8 ZGV6 " 

/translation="MQKLLFSGRINLKMIRYFLA 

VSDELHFGKAAERLHISQPPLSLQ 

I KELEDALGFPL F I RD SRNWLTLxAGEMiyiKAEMK 

DVFDNIENSLSRVSYIARHEQTHL 

NIGI IGSALWHQLLEKFKHYKTVNPKTTWSLHEL 

PPSKQYEALLNKKLDIGFWRCADL 

EQNPALIYRRVEKQRVAVAVSNESLLVNKKILTL 

KDLSGQTLIFLTFNHSGYSKNLYN 

SCLNAGCTPQAIYQFDEPQTQLAFVNSNLGIALV 

PESMQEIPWPNIKFIPLKEDLSAD 

LFAVYHPDSVTPALNKLLiALF " 

/gene="YP01169" 

/note="Pfam match to entry PF00126 
HTH-1, Bacterial regulatory 
helix-turn-helix protein, lysR 
family, score 123.40, E-value 
4.3e-33" 
/gene="YP01169" 

/note=" Predicted helix- turn-helix 
motif with score 1460 (+4.16 SD) 
at aa 25-4 6, sequence 
LHFGKAAERLHISQPPLSLQIK" 
/gene="YP01169" 
/note="PS00044 Bacterial 
regulatory proteins, lysR family 
signature. " 
.2001 /gene="YPO1170" 

,2001 /gene="YPO1170" 

/note= "Weakly similar to Bacillus 
subtilis hypothetical 35.8 kDa 
protein in IplD-pel intergenic 
region SW:YETK-BACSU (031540) (330 



complement (199292 
42) 



complement (199298 . 
78) 



complement (199358 
23) 



complement (199463 
28) 



complement (199559 . 
24) 



complement (199652 . 
17) 



complement (19974 8 . 
19) 



complement (199748 . 
04) 



complement (199820. 
70) 



complement (199883 
48) 



22.5% id in 298 aa, and to Erwinia 

chrysanthemi PecM protein pecM 

SWrPECM-ERWCH (P42194) (297 aa) 

fasta scores: E() : 0.00048, 20.9% 

id in 277 aa" 

/codon-start=l 

/transl-table=ll 

/product = "putative membrane 

protein" 

/protein-id="CAC90011.1" 
/db-xref ="GI : 15979233 " 
/db-xref = " SPTREMBL : Q8ZGV5 " 
/ translation= "MTTRVLKAHLQMLGFTALLG 
GS FX ASATI SNALPPMVITWLRYA 
lASLFFWLLVSQGLLTLPHYRDLGRYTLISLPP 
LLYFACMIFSLQTTSAINSSALYT 
TVPLMSMVMSWFLNAKSTWPWAALLLGILGAL 
LIIFKGDLSQVLRLSLMPSDYLFL 
FGCLGMALNPI WKKLHRGEHALVLTGWSLI CAT 
LLLTVIIAYQLPEIEWRNISVITW 
SGVLYLATFATALSFFLFQKACIVLSPTKVSGYV 
YLIPLSVIVTNMMLGQTINWQEIA 
SGAI LVI I AMAI LVKAK " 
.1993 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YPO1170 by TiyiHMM2.0" 
1996 /gene="YPO1170" 

/note="Pfam match to entry PF00892 
DUF6, Integral membrane protein 
DUF6, score 41.70, E- value 
1.6e-08" 
.1994 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YP01170 by TMHMM2.0" 
.1995 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YPO1170 by TMHMM2.0" 
,1996 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YP01170 by TMHiyiM2 . 0 " 
.1997 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YPO1170 by TiyiHMM2 . 0 " 
.2001 /gene="YPO1170" 

/note="Pfam match to entry PF00892 
DUF6, Integral membrane protein 
DUF6, score 54.80, E-value 
1.9e-12" 
.1998 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YPO1170 by TMHiynyi2 . 0 " 
.1998 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YPO1170 by TMHiyiM2.0" 
.1999 /gene="YPO1170" 

/note="one of 10 probable 
transmembrane helices predicted 
for YPO1170 by TMHiyiM2.0" 



53)" 

/note="one of 10 probable 
transmembrane helices predicted 
for yPO1170 by TMHMM2.0" 

complement (200066. .2001 /gene="YPO1170" 

31) 

/not e= "one of 10 probable 
transmembrane helices predicted 
for YPO1170 by TMHiyiiy[2 . 0 " 

200480. .201343 /gene="xapA" 

/note=" synonyms: pndA, YP01171" 

200480. .201343 /gene="xapA" 

/EC-number= "2.4.2.-" 
/note=" Similar to Escherichia coli 
xanthosine phosphorylase XapA or 
PndA SW:XAPA-ECOLI (P45563) (277 
aa) fasta scores: E(): 0, 70.2% id 
in 272 aa, and to Mus musculus 
purine nucleoside phosphorylase 
SW:PNPH-MOUSE (P23492) (289 aa) 
fasta scores: E() : 0, 42.6% id m 
265 aa" 

/codon- start =1 

/transl-table=ll 

/product= "xanthosine 

phosphorylase" 

/protein-id="CAC90012 .1" 

/db-xref ="GI : 15979234 " 

/db-xref = " SPTREMBL : Q8 ZGV4 " 

/ 1 rans lat ion= " MTTVNSNINVDADFNELPFQ 

AVKYIQKIKPGFKPQIAFILGSGL 

GDLVDQ I TNDTT I S YAD I PGFPVS S VHGHAGELV 

LGDLCGVPVMCMKGRGHFYEGKGM 

S IMTNPVRTFKLMGCEFLFCTNAAGSLRPEVLPG 

SWMLKDHINTMPGTPLVGPNDDR 

FGPRFFSLANAYDKDLRADMAKIAQQLDIPLTEG 

VFVSYLGPCFETPAEIRMMQI IGG 

DWGMSWPEVLSAAHCGLKVIALTAITNLAEGL 

SDWLSHEQTLKFAKVASVNFTKL 

lEAFLKSKALR" 

200585. .201220 /gene="xapA" 

/note="Pfam match to entry PF00896 
Mtap-PNP, phosphorylases family 2, 
score 362.50, E-value 4.6e-105" 

201482. .202126 /gene="xapB' " 

/note= " synonym : YP01172 " 
/pseudo 

201482. .202126 /gene="xapB' " 

/note="Gene interrupted by IS1541. 
Similar to Escherichia coli 
xanthosine permease XapB 
SW:XAPB-ECOLI (P45562) (418 aa) 
fasta scores: E(): 0, 62.5% id in 
216 aa, and to Escherichia coli 
nucleoside permease NupG nupG 
SW:NUPG-ECOLI (P09452) (418 aa) 
fasta scores: E() : 0, 49.6% id in 
248 aa" 
/pseudo 

/codon- start=l 
. /transl- table=ll 
/product= "xanthosine permease 
(pseudogene) " 
201500 . .201568 /gene= "xapB ' " 

/note="one of 6 probable 
transmembrane helices predicted 
for YP01172 by 
/pseudo 

201596. .201664 /gene="xapB' " 

/note="one of 6 probable 
transmembrane helices predicted 
for YP01172 by TMHMM2.0" 
/pseudo 

201683 . .201742 /gene= "xapB ' " 

/note="one of 6 probable 



misc-feature 201770. .201838 



misc-feature 201875. .201931 



misc-feature 201959. .202027 



misc-feature 202127. .202838 



gene 
CDS 



202272. .202730 
202272 . .202730 



misc-feature 



gene 
CDS 



202326. .202637 



203145. .204074 
203145. .204074 



gene 
CDS 



204368. .205312 
204368. .205312 



for YP01172 by TiyiHiynyi2 . 0 " 

/pseudo 

/gene="xapB ' " 

/note="one of 6 probable 

transmembrane helices predicted 

for YP01172 by TiyiHiy[M2.0" 

/pseudo 

/gene="xapB * " 

/note="one of 6 probable 

transmembrane helices predicted 

for YP01172 by TiyiHiyiM2 . 0 " 

/pseudo 

/gene="xapB ' " 

/note="one of 6 probable 

transmembrane helices predicted 

for YP01172 by TMHMM2.0" 

/pseudo 

/note="IS200-like insertion 
sequence: IS1541. Contains a ^G* 
at nucleotide position 315" 
/gene="tnp" 

/note= " synonym : YP01173 " 
/gene="tnp" 

/note=" Similar to Salmonella 

typhimurium, and Salmonella typhi 

transposase for insertion sequence 

IS200 TnpA SW: T200-SALTY (Q57334) 

(152 aa) fasta scores: E() : 0, 

94.1% id in 152 aa. Also identical 

to the Yersinia pseudotuberculosis 

IS1541-like elements transposase 

Tnp TR:Q9X9F5 (EMBL: AJ238014) (152 

aa) fasta scores: E(): 0, 100.0% 

id in 152 aa" 

/ codon - s t ar t = 1 

/transl-table=ll 

/product=" transposase for the 

IS1541 insertion element" 

/protein- id= " CAC9 0014.1" 

/db-xref="GI: 15979235" 

/db-xref = " SPTREMBL : Q9X9F5 " 

/ 1 rans lat ion= " MRDEKSIAHTRWNCKYHIVF 

APKYRRQVFYREKRRAIGS ILRKL 

CEWKNVNILEAECCVDHIHMLLEIPPKMSVSGFM 

GYLKGKSSLMLYEQFGDLKFKYRN 

REFWCRGYYVDTVGKNTARIQEYIKHQLEEDKMG 

EQLS I PYPGS PFTGRK " 

/gene=" tnp" 

/note="Pfam match to entry PF01797 

Transposase -17, Transposase IS200 

like, score 236.10, E-value 

5.1e-67" 

/gene="YP01174" 

/gene="YP01174" 

/note="no significant database 
hits" 

/ codon- start = 1 

/transl-table=ll 

/product = "hypothetical protein" 

/protein-id= "CAC90015 . 1" 

/db-xref="GI: 1597923 6" 

/db-xref = "SPTREMBL : Q8ZGV3 " 

/trans lat ion= " MPQVNNISTNNIHSAGFNNS 

NSIQKYTGAVSSISDDLRINNEKC 

KSD I GT I SGD I KINRHS AVYGNVNS VSGD ITVKN 

SIVDKDITTVSGDVNAVNSTIGKN 

IKTVSGSIEVEQSTVSGNLETTSGGIDIDTTKIN 

GNVHTTSGSISMNDSTIDGSVTCK 

AGSVTIVNSTIKESLNVTSEKIIVGTASCIGKIN 

I S PPESWFNIMNFGNDS I VMGMR 

NFCISGEVNFTITNGKVFVNEQRVGHTASQSTSK 

KVEEVTINIAKNASVNDIVFYTKK 

CHIILEGNAKYNGEKKDGMQFTHVNAPKSHAYA" 

/gene="YP01175" 

/gene="YP01175" 



Escherichia coli hypothetical 
protein Yohl SW: YOHI-ECOLI 

(P33371) (315 aa) fasta scores: 
E() : 0, 83.3% id in 311 aa, and to 
Vibrio cholerae hypothetical 
protein VC1105 TR:Q9KT00 

(EMBL:AE004191) (323 aa) fasta 
scores: E(): 0, 58.9% id in 309 
aa" 

/codon- start =1 
/transl-table=ll 

/product=" conserved hypothetical 
protein" 

/protein-id="CAC90016 . 1" 

/db-xref="GI: 15979237" 

/db-xref = " SPTREMBL : Q8ZGV2 " 

/ 1 rans lat ion= " MSMRVILAPMEGVLDSLVRE 

LLSEVNDYDLCITEFLRWDQLLP 

AKSFYRLCPELHNQSRTQSGTLVRIQLLGQYPEW 

LAENAARAVALGSYGVDLNCGCPS 

KLVNGSGGGATLLKDPELIYQGAKAMRAAVPAHL 

PVTVKIRLGWDSGDRQFEIADAVQ 

QAGATELAVHGRTKEDGYQAERINWQAIGEIRQR 

LTIPVIANGEIWDYQSAQECMKVT 

GCDAVMLGRGALNVPNLSRWKYNEPRMPWLEW 

KLLQKYVQLEKQGDTGLYHVARI K 

QWLGYLRKEYTEATDLFGEIRALKNSKDIALAIQ 

RINR" 

204383 . .205297 /gene="YP01175" 

/note="Pfam match to entry PF01207 
UPF0034, Uncharacterized protein 
family UPF0034, score 428.70, 
E-value 5,2e-125" 

204647. .204703 /gene= " YP01175 " 

/note="PS01136 Uncharacterized 
protein family UPF0034 signature." 

205904 . . 206839 /gene="pbpG" 

/note=" synonym: YP01176" 

205904 . .206839 /gene="pbpG" 

/EC-number="3 .4 .99. 
/note=" Similar to Escherichia coli 
penicillin-binding protein 7 
precursor PBPG SW: PBP7-EC0LI 
(P33364) (313 aa) fasta scores: 
E(): 0, 73.5% id in 309 aa, and to 
Vibrio cholerae D-alanyl-d-alanine 
endopeptidase vca0870 TR:Q9KL77 
(EMBL:AE004414) (316 aa) fasta 
scores: E(): 0, 61.1% id in 314 
aa" 

/codon- start =1 

/transl-table=ll 

/product= "penicillin-binding 

protein 7 precursor" 

/protein- id= " CAC9 0017.1" 

/db-xref ="GI: 1597 923 8" 

/ db - xre f = " S PTREMBL : Q8 ZGVl " 

/ 1 rans lat ion= " MYVKIRFALLS FLLLSTGI S 

VAPLAIARGSAVEVKGTAPLELAS 

GSAMWDLQTNKVIYANNADKWPIASITKLMTA 

MWLDAKLPLDEILSVDIDQTKEL 

KGVFSRVRVNSEISRKDMLLLTLMSSENRAAASL 

AHHYPGGYNAFIKAMNAKAKSLGM 

NSTHYVEPTGLS INNVSTARDLAKLLMATKQYPL 

IGQLSTTTEKMATFREPNYTLPFR 

NTNHLVYNDKWNIQLTKTGFTNQAGHCLVMRTVI 

GKRPVALWLDAFGKYTHFADANR 

LRSWIETGKAAPIPGAAKSYRQQKDSQGRLAQVS 

E" 

205994 . .206764 /gene="pbpG" 

/note="Pfam match to entry PF00768 
Peptidase-Sll, D-alanyl-D- alanine 
carboxypeptidase, score 380.50, 
E-value 1.7e-110" 

complement (206960. .2086 /gene="dld" 



/ no t e = " synonym : YPO 1177" 
complement (206960 . .2086 /gene="dld" 
75) 

/EC-number= "1.1.1.28" 
/note=" Similar to Escherichia coli 
D- lactate dehydrogenase Did 
SW:DLD-ECOLI (P06149) (570 aa) 
fasta scores: E(): 0, 66.1% id in 
558 aa, and to Neisseria 
meningitidis D- lactate 
dehydrogenase NMB0997 TR:Q9JZL7 
(EMBL:AE002450) (563 aa) fasta 
scores: E(): 0, 60.9% id in 558 
aa" 

/codon-start=l 
/transl- tableau 

/product="D- lactate dehydrogenase" 

/protein-id="CAC90018 . 1" 

/db-xref="GI: 1597923 9" 

/db-xref = " SPTREMBL : Q8ZGV0 " 

/translat ion= "MMSQFNNEKIQLLLTQLQHI 

VGTRYLLTGERQTERYRTGFRSGK 

GVALAWFPSTLLQQWQLLQACVAADTII IMQAA 

NTGLTEGSTPSGDDYDRPIVILNT 

LRLNNIQLLDDGKQVIGFPGSTLNQLEKRLKPYG 

REPHSVIGSSCIGASWGGICNNS 

GGSLVQRGPAYTEMALYAQIDAQGELQLINHLGI 

SLGNTPEEILQRLEQGQYGAEDIE 

QTGQQASDSEYATRVRDIDATTPSRFNADPRRLF 

EASGCAGKLAI FAVRLDTFPSEKQ 

QQVFYIGTNQTQVLTELRRIILRDFKHLPIAGEY 

LHRD I FD I AETYGKDTFVMINSMG 

TNNMPRFFTLKGKIDARLNKIPHLVDHLTDRVMQ 

GFSQILPNHLPKRLKTYRNQYEHH 

LLLKMSGDGITEAQQHLRAFFATAEGNFIACTAD 

EGKKAFLHRFAAAGAAVRYHAVHA 

DQVED ILALD I ALRRNEKQWFETLP PE I EKCL I A 

KLYYGHFLCHVFHQDYIVKKGVDV 

KALKEKMLSLLNDKGAEYPAEHNVGHLYLAKPAL 

KDFYQQIDPTNSFNPGIGKTSKRK RWQQD" 

complement (208031. .2086 /gene="dld" 

33) 

/note="Pf am match to entry PF01565 
FAD-binding-4 , FAD binding domain, 
score 104.80, E-value 1.7e-27" 

208882. .209421 /gene="cyaB" 

/note=" synonym: YP01178" 

208882 . . 209421 /gene="cyaB" 

/EC-number="4 .6.1.1" 
/note= "Belongs to a family of 
thermophilic adenylate cyclases. 
Similar to Aeromonas hydrophila 
adenylate cyclase 2 CyaB TR: 069199 
(EiyiBL:AJ223730) (190 aa) fasta 
scores: E(): 8.9e-33, 50.9% id in 
175 aa" 

/codon-start=l 
/transl-table=ll 
/product = "adenylate cyclase 2" 
/protein-id="CAC90019.1" 
/db-xref="GI: 15979240" 
/db-xref = " SPTREMBL : Q8 ZGU9 " 
/ t rans lat ion= " MSEHFVGKYEVELKFRVMDL 
TTLHEQLVAQKATAFTLNNHEKDI 
YLDANGQDLADQQISMVLREMNPSGIRLWIVKGP 
GAERCEASNIEDVSKVQSMLATLG 
YHPAFTIEKQRS I YFVGKFHITVDHLTGLGDFAE 
lAIMTDDATELDKLKAECRDFANT 
FGLQVDQQEPRS YRQLLGF " 
208912. .209418 /gene="cyaB" 

/note="Pfam match to entry PF01928 
Adenylate- eye- 2 , Adenylate 
cyclase, score 136.60, E-value 
4 6e - 3 7" 

complement (209447. .2097 /gene="YPOll79" 



CDS complement (209447. .2097 /gene=" YP01179" 

88 ) 

/note=" Similar to Escherichia coli 
hypothetical protein YajD 
SW:YAJD-ECOLI (P19678) (109 aa) 
fasta scores: E() : 0, 84.7% id in 
98 aa" 

/codon-start=l 
/transl-table=ll 

/product = "conserved hypothetical 
protein" 

/protein-id="CAC90020 . 1" 
/db-xref ="GI : 15979241" 
/db-xref = " SPTREMBL : Q8ZGU8 " 
/translation="MAYIPKNYSRLESGYREKAL 
KI FPWVCGKCSREFVYSNLRELTV 
HHIDHDHGNNPEDGSNWEMLCLFCHDHEHSKYTE 
VDLYGSTWAGDDAQKDQGVATHN 
PFANLKSLMKK" 

SEQUENCE (SEQ) : 

1 actaatcagg ccaataaccc atgagatcga taacccatga gatcgatagc ccatgagatc 
61 gataacctat gagatcgata gcgaatcagg tcaaaaatag ccagcaccag agtgatggtc 
121 ttcacaacaa gtggtgagtg tgatcgcatg tctcatcggt tggataacaa ataaaataaa 
181 cagtaactac attactgaat catcttgttg tttttgatga atttaataaa ttattgtggt 
241 gtctactttt ctttttataa gcaagatgct tcaattttca ttgaattctg catgattaca 
301 gccaatccca cactccatat aaataagata tttgcacatt atctgtagtc tatttcttgt 
361 atttttgtgg gatttttcct tttttatatt atacgtctta atatttaaaa ttgcatattc 
421 agtgaattat gatttttaat ccttatattt tatttattgt ggttttttat atcgaatgcg 
4 81 gtgctattca tggcttttat acgccggtct gattagatgt tgatctacat tgttttgttg 
541 catcgttctt gtgatgaaat ttaaaagaac gtataagttg gtatggtaag ttaaattgtg 
601 cagtttttcc tattttgcga ttttttaggc gcataacaat ttctttaaca gcatacatct 
661 aagtaatggg ctatttttca gggccttact gcatatgaat cacttttttt aaaatcttaa 
721 tcgcttattc attactgagt tggccaatga cacatgtctg tcatctgact tatgtgatgc 
781 gatcattatc aacttaattg acaaattaaa aatttgttaa ggctatatat gaatacaata 
841 tttaaagtga tttggaatgc atctttgaat gtctgggttg tggtgagtga gctggcaaaa 
901 ggccggataa aaaccaagag cagtcgtaac ttgatatcag agggtgtatt accaaaattt 
961 gaacaaagta tggtatcgaa actgttcaga aaaaaccttt tggccctatc tctgggcagt 
1021 attgtttttc taagcaccgg ccctgtattt gctgctgata ttactgtcag cacgcaggca 
1081 gagctttctg ccgcgttatc caatggtact tatgacaaaa ttattttagg tgctgacatt 
1141 acgctcattg ggagcctgac cgtcaatatg acgagtaatc aggttgtgat tgatggccaa 
1201 ggaaagtttg gtctaaccgt taacaatacg acaaattatg gtctggtggt gtcatcgggt 
1261 tcagggacct taacgctaca aaatatgtcc aaaattgact cggccaacta ctacagcatg 
1321 gtcgtgttga acggtgccaa tactgccgtc aatgtgattt ataacaatat tgatttcctg 
1381 ggttcctctc aacttattta tatgggggcc tatggggcgg ccaccaatag tataatgacc 
1441 tttggcgata tcttgaacga tgtggtggtc aatgaccgtg ctcaggaaat tggtgaagtc 
1501 aataaactgg cgtttaccgg tagatttcat gttacgcata cggggtcctc tgtgacatct 
1561 tttgtcagca ctggcggggc aaataataca tcaaccatgg attttgccag tggtgctgat 
1621 gtcaagatag accggacggg gtcgactgga gacctgacca gtaccggtgt taatgccttt 
1681 gcctatactt ttgctgatgg tgctagcttt gagttaattg ctaatcagaa tgttttcagc 
1741 ggaaccacta ccaatagagg cttggaaatt ggtagttata atagtatcga tggctttggt 
1801 tccggggtga aaatagttct gcaatctcgc tccgatggtt caattatttc aggtaatggg 
1861 attgataacg ccaccactaa tgccgctggt atcaataata atgccagtgg tgatgctaat 
1921 gttatttaca atctgggtac tggctctatt ttaaaggcca cgaatactgg gattttagcg 
1981 acaaaaaatg cgaataatgc cagtgatatt tatattcgct ctgcgggtga tatcactgct 
2041 gcaacgggta tttctgcaac acataacgga accggcaccg ttaaaatcaa gaatgatggc 
2101 accatcacca gtaccaccgc cgggatcgcc atttcttccg catcaataaa agagatcagc 
2161 gttgataata ctgacggtac catcacggcg actgccggta caggcgttaa cgtactggcc 
2221 agtgccatat taaatctgtt cggtggcacc attaatacct cggcgacggc caacggtatt 
2281 acgtttgcgg gcacggaggg tggccatacc ctgactgatt tgacgattaa tctgttgggt 
2341 acgggtattg ctttgtcaaa cgtggctggc gtcaatttga cgttaagtaa cgtaacttta 
2401 aacacactta atgggacggc gctaaatagc ctgacagggt taacgttagt cgatagcctt 
2461 aatgggcgta acaccattaa tattgaaggt gccggcatcg gaattgcggc gacaaatacc 
2521 gaattgaata ccttcgatgc agaggcatta gatattaatg ttaatggtgc cgggatcggc 
2581 attcaagcca ctggcggtgg tgtcaatctc agcgcttcta acttgatcat taatgttgcc 
2641 aatacattgg gtactgcgct gcaaatcacg gatgggatag ataataccac caccattggc 
2701 aacgaaatcc aacttaatgc cgagaatgcg acggcaatta acttcctcgg ctcctccagt 
2761 aaaacgttaa acaacaatgg caccattaaa ggcagtgtga tattcgcggg tgttgctgac 
2821 catatcatta ataacaatgg aacactggat ggcacattaa ccaccggtgc cggtaatgac 
2881 acgttagtgc tagacagtag ctcgcaaagt aacgatgtga ttaacctggg ggatggcaat 
2 941 aatagcgtga ccattcagaa tggggcgaca gtgtcctcca ttatcactgg gaatggtaat 
3001 gatactttca cgatcaatgg gatgagcgtg gggagtacct acctcggttc gctggatgcg 
3061 ggcacggggc tcaataccct aaatttcaat gcctcaaccg acgaactagc ggcagccacg 
3121 tcacttcagg gcttcaccaa tattaatctt gtcgacagcc atatcactct ggtgtctgac 
3181 gataatatcg gcagcggtat ggtcaatatc gatagcagta gtgagctgct atttggtagc 



3301 aacagcgcca acgtgtcgtt agagcaggcc 
3361 cagggcgggg cgctaaccgc cagtaacagt 
3421 gacggtacgt tgaacctgga caacattgca 
3481 acactgaatg tggcgaagaa ccttgccacc 
3541 ggggcgttta gtgggatcgt caatctgaca 
3 601 gcggcggcac tggccagtgc caccttaaag 
3 661 accactgacc gcaccctgca cgggctggat 
3 721 gcggtgccgc agtctcagac cagcggggtt 
3781 gggacggtca atatcaccgg ctctggtagc 
3 841 gtgtcgatcc ttgaacagga tcgtgctggc 
3 901 gtgaccggag atattgatgc cttggattta 

3 961 acgcaagggg tgcagtctgc catccagcag 

4 021 aattatggcc tggccagcag taacagcaat 
4 081 ctgagtgcgc tggagttgtt agccgatggc 
4141 ggtttgacgg ccaacagagt actgaatgcc 
42 01 gatgcccaaa atggcgcctt aaccttggct 
42 61 accgtcacgg cgggtgagtt gatcctcggg 
4321 ctggatatcg ccagtggggc cagtgccaat 
4381 gtgaccaacg tggggacggt gacactgggc 
4441 accaatggcg ggatccttga tttaaccggg 
4 501 tctactgtgg caggcggctt gaccggtgcc 
4561 tcggtcagtg ccgccaacag tggcttgagc 
4621 gtgacactga ccgataccgg cacgttaggt 
4 681 aacctgaacg gtgccaatgc agccatgacc 
4 741 accaacgcgg cagtcacgct gagcgggaat 
48 01 accgatggcg aactgaccgt gggacaggcc 
4861 aatctgggca ccctcacttc tcatctgatc 
4921 gttctgagcg gtgtggcggg ttcaacagtc 
4 981 acggccaata acagcggctt cctcggccaa 
5041 gttgcgtcaa cgaacaatct gggggcgtca 
5101 actctgtcgc tgagcggctt taacggcacc 
5161 ctgcaagtga ccgatgatgc cgaggtcacc 
5221 gtaaccattg atatcgctga tgcgacgcta 
52 81 gtgttgaccg gtaacggctt gctgaatgtg 
5341 tttggttcga cggtgggcgg ggcgtttagt 
54 01 gctttaagtg cggataacgc ggcggcactg 
5461 agtgtgacca ccgtcggcgc cactgaccgc 
5521 acgctgatct ttgacggttc accgccgcaa 
5581 gatctggcac tgaacagcgg gacgatcagt 
5641 catccggtga cgccaccgaa cgtgtcgctc 
5701 gagctgatta atgccgcgaa tgtcaccggc 
5761 ggcacagcca ttacctctgg tacgcaaggg 
5821 acggtagcca atgccatcca caactatggc 
5881 ggcctgtatg tgaattacac cctgagtgcg 
5941 ttgttgctgg cgaccgaaag cggtttgacg 
6001 ggtgttggcg gtttggtggt ggatgcccaa 
6061 aaccgttacg agggaacgac caccgtcacg 
6121 gcattcggcc agacgtcatt actgaatatc 
6181 taccgccaga cggtgggggc ggtgaccaat 
6241 gtgttaacca gtggcttgct gaccaatggc 

63 01 aacctggctg cgggcggatc gtctaccgtg 
6361 aatattaacg gcggtgattt ggcggtcagt 
6421 catattgccg atgtggcctc ggtgaccctg 

64 81 gtcgaggtgc tgggtacgct gaacctgaac 
6541 agtggcggcg gggtgattaa taccaacgcg 
6601 agcggcgcac atcagatcgg taccgatggc 
6661 ggggccagca gcgctacggt taatctgggc 
6721 gtcagcgaga gtattgccaa cgttctgagc 
6781 ggcggggcag atactgcact gacggccaat 
6841 gcgggtaaca gcaaactgac ggttgcgtca 
6901 gcgctggccg gggcgggcga tactctgtcg 
6961 agcgtcaccg gcagcggtgt actgcaagtg 
7021 agcaacgggg taggcaatac ggtgaaggtc 
7081 gatattgcac tgttcgacca tgtgctgacc 
7141 cttgccacca ccgcgtttga ctttggttcg 
7201 aatctgacca ataccacttt tgctttaagt 
7261 accttaaaac tgtcggatga cagtgtgacc 
7321 gggctggatt taaatggcgg tacgctgatc 
73 81 aatggggtcg tcacggttac tgatctggca 
7441 gcgggtaact gggagaatga acatccggtg 
7501 gatcggggtg acattctgct gcaactgatt 
7561 gatctggagc tgatgatcaa tggtaccact 
7621 gtccagcagg gcgggtatac ggtggcgaat 
7681 ggcggcagtg gcctgtatgt gaattacacc 
7741 gctaatgcgt tgttgctggc gaccgaaagc 



agtatgtttg ctggcacgtg gcaagttaat 
aaccaattag gttcggccaa gattgggttg 
ctattcaatc atgtgctgac cgggaacggc 
accgcgtttg actttggttc gacggtgggc 
aagactactt ttgctttaag tgcggataac 
ctgtcggatg acagtgtgac caccgtgggc 
ttgagtggcg gtacgctgat ctttgatggt 
gtcacggtca ctgacctggc actgaacagc 
tgggataaca ccgatccgct ggcaacaaat 
tcgacgctgg aactgattaa tgccactaat 
ctggtcaatg gcacggccat tacttctggt 
ggcggcagta cggtagccaa tgccatccac 
ggtgacagcg gcctgtatgt gaattacacc 
gctgatgcgt tgttgctggc gaccgaaagc 
gagctattcg gtgtcggcgg tttggtggtg 
aacggcagta accgttacga gggaacgacc 
gcgaatgggg ccttcggcca gacgtcatta 
attaatggct atagccagac ggtgggggcg 
agcggcgggg tgttaaccag cggcctgctg 
ggcgcactga acctgactgc gggcggggcc 
ggaaccctga atattaatgg cggtaatttg 
ggccagaccc atattgccga tgtggcctcg 
accagcgccg tcgaggtgct gggtacgctg 
aacgtactca gtggtgacgg gacgattaat 
aacagtttta gcggcgcaca tcagatcggt 
agtaatctgg gggccagcag cgccacggtt 
ttgaatggtg tcagcgagag tattgccaac 
gatattatcg gcggggcaga tactgcactg 
tatgccttgg cgggtaacag caaactgacg 
tccagtgtgg cgctggcagg ggcgggcgat 
tttggtaaca ' gcgtcaccgg cagcggtgta 
ctgaccagca gcaacggggt aagcaacgcg 
aatctggacg atattgctct gtttaatcat 
gcgaaaaacg atgccagcac cgcgtttgac 
gggatcgtca atctgaccaa taccaccttt 
gcgcgtgcca ccttaaaact gtcggatgac 
accctgcacg ggctggattt aaatggcggt 
tctcaggcca atggggtcgt cacggttact 
attaccgggg cgggtaactg ggagaatgaa 
cttgagcagg atcggggtga cattttactg 
aatgccaaca atttggattt gctggttgat 
gtggagtctg ccatccagca gggcggtagc 
ctgaccagca gtaatggcaa tggtggcagc 
ctggagttgt tagccaatgg cgcgaatgcg 
gccaacagag tactgaatgc cgagttattc 
aatggcgcct taaccttggc taacggcaat 
gcgggggaat tgatcctcgg ggcgaatggg 
gccagtgggg ccagtgccaa tattaatggc 
agcggtgcgg taacgttggg caacggtggg 
gggatccttg atttaaccgg gggcgcactg 
gcaggcggct tgaccggtgc cggcaccctg 
gccaccaaca gtggcttgag cggccagacc 
accgggacgg gtacgttggg taccagcgcc 
ggtgccaatg cagccatgac caacgtactc 
gcggtcacgc tgagcgggaa taacagcttt 
gaactgaccg tgggacaggc cagtaatctg 
accctcactt ctcatctgat cttgaatggt 
ggtgtggcgg gttcaacagt cgatattatc 
aacagcggct tcctcggcca atatgccttg 
acgaacaatc tgggggcgtc atccagcgtg 
ctgagcggtt ttaacggcac ctttggtaac 
accgatgatg ccgaagtcac cctgaccagc 
gatatcgctg atgcgacgct gaacctgaac 
gggaatggca cattgaatgt ggcgaagaac 
acggtgggcg gggcctttag tgggatcgtc 
gcggataacg cggcggcact ggcgcgtgcc 
accgtcggca ccactgaccg catcctgcac 
tttgacggct cgccgccgca atctcaggcc 
ctgaacagcg ggacgatcag cattaccggg 
acgccaccga acgtgtcgct ccttgagcag 
gatgccgata atgtgaccgg caatgccaat 
atttccgcag gacaaggggt gcagtctact 
gctacgcata actatggcat gaccagcaat 
ctgagtgcgc tggagttgtt agccgatggc 
ggtttgacgg ccaacagaga gctgaatgcc 



7861 aacggcaata accgttacga gggaacgacc 
7921 gcgaatgggg ctttcggcca gacgtcatta 
7981 attaatggct atcgccagac ggtgggggcg 
8041 aacggtgggg agttaaccag tactgacacc 
8101 gatggcatcc tgaatctgga gaatgggggg 
8161 aacggtatcc tgaatatcaa gggtggcgat 
8221 gcggggcaaa ccaatatttc cgatggtgca 
8281 atcggaaccg gtaatttggg cagcagcgtt 
8341 gcggataatt cactggctaa cgtgatcagt 
8401 gtgacgctga gcgggaatag cagctttagt 
8461 ctgaccgtgg gtcaggccag taatctgggg 
8521 cttacttctc atctgatctt gaatggcgtt 
8581 gtggcgggtt caacagtcga tattatcggc 
8641 agcggcttcc tcggccaata tgccttggcg 
8701 aacaatctgg gggcgtcatc cagcgtggcg 
8761 agcggcttta acggcacctt tggtaacagc 
8821 gatgatgccg aagtcaccct gaccagcagc 
8881 atcgctgatg cgacgctaaa tctggacgat 
8941 aacggcttgc tgaatgtggc gaaaaacgat 
9001 gtgggcgggg ccttcaccgg cacagtcaac 
9061 aataacacca cagtattggc ccaggctacg 
9121 gtgggtaacg gtgtgcagaa tattggcaca 
9181 gataatattg ttgataacgc aggcattatc 
9241 atcaatacca ccgggggcgg tgaggttcgg 
93 01 ctggatgggc tctcggtgat ggaactggat 
93 61 ggggcggcga cagggacagg ccatgagttg 
9421 agcgcggtca cttatcaggg ggtccataac 
9481 tcgtttaatt acggcatgac cactggcgag 
9541 ctgaccgcgt tagaactatt gagcacgggc 
9601 gcgaataacg ggactcaatc taacgatctt 
9661 gcctttgcgt cggctaatga tggcagcacg 
9721 actggtacca cttgggtctc ttcaggtaat 
9781 caaacctcct tgctggcgat gagtaccgcc 
9841 gtggtgggtg agttagccac cgaagggggc 
9901 accgtaacgg ggggcggcca aattgatggt 
9961 agtggtggct tgctgaatgt ttcttatgat 
10021 gccaatggcg cggtggcaca cctgtctcag 
10081 aacaacggca cacttcatct ggataatact 
10141 agtgatggcg aggtattgct aagtaacaat 
10201 ggttactcgg gtctgtttac taatcaggcc 
10261 catttaggtg gcagcagcat cgccaatagc 
10321 tgggagttaa ccaataccat cagcggcacg 
10381 gtaaaaattg aaggcgatac ggttagcgca 
10441 caattgggca gttcggcggt tacccagaca 
10501 gcattgctgg tatcattcgc atcgaatatg 
10561 gctaacggct ccttaggggg atatggtcag 
10621 ctgattatgc caaatgcctt aacgggcggg 
10681 tacaccggtg atgaggggat gatcaccttc 
10741 acggatagac tggttattac cggggatacc 
10801 attgggggtg tcggggcgcg cacctttgaa 
10861 tctgccgggc agtttaccct gaatgggcgc 
10921 tatcaaggtg gggccagcac tccagatgac 
10981 gaccgccgcc ctgaaccggc gagttacacc 
11041 gtcaccagtc tggcggatag aatgggtgag 
11101 cagaagacca ctagcctgtg gctgcgtaac 
11161 agcggcgagt tgaaaactca ggataaccgt 
11221 caatggagcc gcaatgcaca ggacctgtgg 
11281 agcagcagtt ctaccgtggc acaggttgct 
11341 tacagcgtgg ggatctatgg ctcatggctt 
11401 gtcgattctt gggtgcaata cagttggttt 
11461 actgagaaat atgactcaaa aggctttacc 
11521 gttggcgaaa gtgttaacca gagctacttt 
11581 ggcgtaaaag ccgatgacca tacagaaacc 
11641 ggcaatatcc agacgcgact cggggcgaag 
11701 gtcagtggcc cggcattcaa gccttttgtt 
11761 tttggcacga cattagacgg tgtcacggtg 
11821 ctgaaactgg gcgttgatgg gcaggtaaat 
11881 cagcaagtgg gtaacaaggg ctacagcgaa 
11941 ttctaatttt atcgtcaggt aattaaccac 
12001 caggacagaa tatgcgccgt tttttatggg 
12061 tagatattcc atccacagat tatctaataa 
12121 atgagttagg gccatatcga tctcccccat 
12181 gcggttggca tccacatcgc ccgtcatccc 
12241 tgggtaaata ttctccagtg cgtggtactg 
12301 ctcaatacgt tgcaccacgc ttttccacgg 



accgtcacgg cgggggaatt gatcctcggg 
ctgaatatcg ccagtggggc cagtgcgaat 
gtgaccaata ccggtacggt aacgttgggc 
ttgatcaata ccggaatgat taatgtgacc 
gcttctagca ttagcggcgg cttaacgggc 
ttcaccatca gcatcgataa caatggtctg 
tcagtcactc ttggtaatgg ggggaccatt 
attgatgtgc tgggggatct aaacctggtc 
ggtgacggga cgattaatac cacagcaaca 
ggtgcacatc agattggtac caatggcgaa 
gccagcagcg ccacggttaa tctgggcacc 
agcgagagta ttgccaacgt tctgagcggt 
ggagcagata ccgcactgac ggccaataac 
ggtaacagca aactgacggt tgcgtcaacg 
ctggcaggga cgggcgatac cctgtcgctg 
gtcaccggca gcggtgtact gcaagtgacc 
aacggggtaa gcaacgcggt aactattgat 
attgctctgt ttaatcatgc gttgaccggt 
gccagcaccg cgtttgactt tggtgctacc 
ctgaacaatt ctacttttga tttaagcggc 
ttgaaattat ccagcggtaa cctgacctcg 
ttggcgatga atggcggcac gttgctgttt 
acttcagatg ggacgattgc ggctaatagc 
gttaatttac cgagcaatct ggcaccaagt 
gaaggcgaaa tcattgtcac tctggcaact 
acgttgacgg atgagaatgg tgatccgatc 
gctggcagta cctcagccgc cgccaccggt 
gattatgatg gcctgtatgt caattacggt 
agcgaggcgt tggtattgac cgccatcttg 
tcagcccaaa tcacagggag cggtgatctg 
gcatccttat ctaatagcac caacagctat 
ttgcgtctgg atgctgattc agcactgggg 
actcatgttg atatcaacgg tacccagcag 
agtacactgg atctcaacga cggtaagtta 
gcattgactg gcggtggtga actggtactg 
aacgctggct ttactggcag tactgatatt 
gcgcaggggc tagggaacgg caccattaat 
atcgggacac tgtttaacgc tttgaccggt 
gccagtgttc aattggccgg tgataacagc 
gggagtattc tcattgccaa tagtgccgag 
ggtgcgttga tcctggatac gggttcagtc 
gggaccttgg ttaagcgtgg cagcggaacg 
ggtctgacta cgattgaaga gggtttactg 
ctttcgctgg aagagtccct gcaagagagg 
gccaatctga ccagtaacgt actgattacc 
gtgaccggta atgttgagaa ctatggcaac 
gattttggca ccttcaccat tgatggtaat 
aatactatcc tggctggcga tacatcggta 
gcagggcaaa gttatgtcac ggtaaacaat 
ggcatcaaaa ttattgatgt cggcggtgat 
gctgttggcg gtgcttatga gtacttcttg 
ggcaactggt atctgcgtac tgaggcagat 
gctaacctgg cggcggccaa caatatgttc 
acgctgtata ccgatgtctt taccggtgag 
gaaggtagcc ataatcgctc ccgcgatgat 
tatgtgatgc aacttggcgg cgatgtggcg 
cgtgttgggg tgatggcggg ctatgccaat 
ggctaccgtt ctactggctc ggtggatggc 
gccgataatg ccgatgatac tggcgcgtat 
gacaaccgcg ttagcgggca ggatttagcc 
gcatcagtcg aagggggcta tgccttcaaa 
attcagccaa aagcacaggt ggtgtggatg 
aatggtacgg ttatctctgg tgacggtaat 
gcctttatca atccaagtga taaagccaaa 
gaagccaatt ggatccataa caccaaagat 
aaacaagccg ggacggcgaa tatcgcggag 
agccaactga atctttgggg aaatatcggt 
accagcgtgg tgttaggcgt taaatataat 
taacgacatg atgatcaacg gcgcatattg 
ataacggacc gtacagtatc aaggcgagat 
99^99^99^^ cctttggcag agagaacctc 
gagctcatcg ttgattcttt tgccgacaat 
gttggctgcc aattcagccg ccagtttatc 
cttttcgacc gcagcagcat aacgttcagg 
agaccattgt gatagccagc gcgagaaatc 



12421 tgatacgcca aagaagcgca tttctttgtt taccgaggag agctgtaagg aattgagcag 
12481 tttgacctga taggcgagat agacttcgat atcatcgact gaatttacat tcagtgtttt 
12541 tagtattttt actttttcgc gggcaatatt ctcaagttgt tccagacgga acatctctcg 
12601 gccacgcgcc atcagttcag gtagtttttc atcatatttc cccttttcga cctcatggag 
12661 catgactgcc ttttgcatat cgttatgcgt caaggtaatg cggtcttcac aggtcgcgct 
12721 tgcttcctga gcaataagaa aggtctgctc acgtagagta ggtgagtcag ccaattgcaa 
12781 caaccatgcc gcaatttgtt gtttaaattc agggatatta tttgcgtttt gagtggtagc 

12 841 caaacggtga agaaaaacgc taaaagcggc tgagttagct tctgtctcga tatttgccca 
12901 atctcctgcc aacctatttt gttgttccgg cggcaaccag gtggcaacaa tgttacggag 
12961 actcgctggt ggtagattgt ctaattcgct ccagttaatc cgtggcccct gataatacgg 

13 021 gttaatacgg agatgttgta gtacctgcag tgtgctggta gacagagggt taccttctaa 
13 081 gagaatttca cagttttggg gcagacgggc caggtcggga agttgggata attgattata 
13141 ttcagcgctg atacacctta aactaccggg taaattttct ggcagttttg tgagcctatt 
13201 gctcaatacc attaaaaatg acagcgaggc gggcaatgtc tcggggagtt gtgtgagttg 
13261 attactggtc acacttagat ttaatagcga tattggcaat gtatcgggca atgtgcgcag 
13321 ttgattacta tccgcatcca gtcttttcaa actgttcggc agtgtaggca gttcagttaa 
13 381 gccacagctt gcaataccaa gagactccag attcgggggc aagcgatttg ggagttgcag 
13441 tgactcatta tcattaacgt ctaattcttt tagcccttca ggtagcgact cagggagccg 
13501 ttctagttca tttccgtaca catttagaga taacagcgag gccggtaatg tatccggtaa 
13561 tgtacgcagt tgattaaaag aagcattcag tttttgcagg ttgtcgggca gtgttgctgg 
13 621 caactcagtt aacttattac ccatgagatc cagttcgtta cacggcggca gaatgtctgg 
13 681 taatgaggcc aaattaagat cacggagaca gaggtgatag gtattttttt ccaggcaatc 
13 741 tttcatgcat tccacggcga agcgccgttg ttcgtcagta ccaggtattg cttcgctttc 
13801 ccatttttcc catattgcat aatagtcagc cggagttaag gctgctgttg atgtccgtgc 
13 861 agaatggatt tcacggtctg gctcgatgtt cggcattgac acgttcgacg ttatatttga 
13 921 tagattcatt gtattaaaac ctcttcctta ataagcctta taggtattat tttggctgat 

13 981 aaataacagg gggtggataa ggtctactac tggtattttt agcttactgg cttgaataag 

14 041 ccaaaaaagc gggtggtgtg gagggagaag aatgacctat ctacggcgca tattgcagga 
14101 cagaatatgc gccgtttttt atgagataac gggttgtgta gtattaaaac gagatgagtt 
14161 cactacacac cgaggttatt gacaaagagc ctgcgctgga gaaaacaggc agatcgtaaa 
14221 gataccgtaa acccttcctt ggggggcgat ccgcaccatc cctggcgcgg acgctttact 
14281 cttctgccta ccctcaccgt taaaagtcaa ttcatcgggg tttgtcagcc gtctgacacc 
14 341 tcgtgttttg acttttcgca ggcaattttt tcgagttgtt ccagacggaa catctcccgg 
144 01 ccacgcgcca tcagttcagg tgatttttca tcatactttc ccttctagcc cccatgattc 
14461 atgactgctt ttgcatattg tttaagtcga ttcagctaat ccgtggcccc tgataattcg 
14521 gcttagtact gagacgcagt agtgccagca gcgtgctggg agaaagaggg ttaccgtcta 
14581 agcaaatgtt acaattttgg cgcagatggg ccaggtcggg aagctgggat aattgattat 
14641 attcagtata tatacacctt aaactaccgg gtaaattttc cggtaatgct gtgagttgat 
14701 tataggctac atttaacctt aacagcgata tcggcaatgt atcgggcaat atgcgcagtt 
14761 gattaccatg cgcatccagt tctttcaaac tgttcggcag tgtaggcagt tcagttaagt 
14 821 tacagtttga aatatcaaga gactccagaa ccgggggcaa gcggcttggg agttgcagtg 
14881 acgtattaca accaacgtct aatgttttca acccttcagg tagtggctca gggagccgct 
14 941 ccagttcatt catgtgcacc tttagagata acagcgaggc cggtaatgta ttcggtaatg 
15001 tacgcagctg attataagaa gccttcagtg tttgcagatt gtctggcagt gttgttggta 
15061 actcagttaa attattgcat ataatattca gttcattaca cggtggcagg gtatcgggta 
15121 atgaggtcaa atcaagagaa gagagattga ggcgctcggc attgttttcc aaacactctt 
15181 tcattctcgc tacggcttgg ccccgttgtt caccagcaac agttcttgga tcattttccc 
15241 acttttccca tattgcatga tagtcagccg gagttaaggc tgttgccgct ggccggtcag 
15301 catggatttc acggtctggt cctatgttcg gcattgacac gttcgaggtt atatttgata 
15361 gatacattgt attacgctcc cttaataagt tttagaagca ttattttggc tgataaataa 
15421 caggggatac taaggtctac tactggtatt tttagctaac tggtggtctg aattagccaa 
15481 acgagcgggg gggagggaga agaatgacct atctgcggcg catattgcag gacagaatat 
15541 gcgccgtttt ttatgggata cgagctgtgt agtattaaaa cgagattaga gattccatcc 
15601 acagattatc taataaggag gaggctcctt tggcagagag aacctcatga gtcagggcca 
15661 tatcgatctc ctccattatc tcatcgttga ttcttttgcc gacaatgcgg ttggcatcca 
15721 catcgcccgt catcccgttg gctgccaatt cagccgccag tttatctggg taaatattct 
15781 ccagtgcgtg gtactgcttt tcgaccgcag cagcataacg ttcaggctca atacgttgca 
15841 ccacgctttt ccacggagac cattgtgata gccagcgcgg gaaatcttgg ttttccgcag 
15901 ttttcacccg ggtttctgcg ctcagtaaat cgtctgttgt cacatccgat acgccaaaga 
15961 agcgcatttc tttgtttacc gaggagagct ctaaggaccc acgtagttta acctgatagg 
16021 cgagatagac ttcgatgtcg tcgacagaat gtacattcag tgtttttagt gtttttactt 
16081 tttcgcgggc aatattctca agttgttcca gacggaacat ctctcggcca cgcgctatca 
16141 gttcaggtag tttttcatca tatttcccct tttcgacctc atggagcatg actgcctttt 
16201 gcatatcgtt atgcgtcaag gtaatgcggt cttcacaggt cgcgctagct tcctgagcga 
16261 taagaaaggt ctgctcacgt agagtaaatg agccagccag ttgcaacaac caggctgcaa 
16321 tttgttgttt aaattcaggg atattcttta cgttttgagt ggtagccaag cggcaaagaa 
16381 aaacgttaaa atcgcctgag ttagtttctt tttcgatctt tgcccaatct tctgccaacg 
16441 atttttgttg ttccggcggc acccaggtgg caacaatgtt acggagactc actggtggta 
16501 gattgtctaa ttcgcggcag ctaatccgtg gcccctgata atccaggtta atatagagat 
16561 gctgtagtgc ctgcagcgta ctggtagaaa aagggttacc gtctaagcga atgctacaat 
16621 tttggggcag gtgggctagg tcgggaagtt gggataattg attatcttta gcatatatat 
16681 cccttaaact accgggtaaa ttttccggta gtgctgtgag tctattgctc gataccagta 
16741 aagatgacag aagggcgggc aatgtatcgg gcaatgtgtg cagttgatta ttatccacct 
16801 ccagtttttc taaactgttc ggcaatgtag gcagttcagt taagcgacag tttgccatac 
16861 caagagactc cagattcggg ggcaagcggt ttgggcgttg cagtgactca ttacgaccaa 



16981 acacatttag agataacagc gaggccggta 
17041 aagcagcatt cagtgtttgc aggttgtctg 
17101 tttcaataat attcagttta ttacacggtg 
17161 gctcatcgag atcgaggcgt tcggtattgt 
17221 cctggccccg ttgttcacca gcaacagttc 
17281 cataatagtc agccggagtt aaggctgctg 
17341 ctggctcgat gttcggcatt gactcgttcg 
174 01 aacctctccc ttaataagcc ttataggtat 
17461 taaggtctac tactggtatt tttaactcat 
17521 attgtgagtt gccgacgagg tcaatgaagt 
17581 gcagtaataa atccaattgt tttttagtct 
17641 ctatatctgc atatgccaat tcttatggat 
17701 atttaatatt attctataat gataaggtta 
17761 attataaaaa ttatcgataa aatattatat 
17821 tcgagatgcg ggtaggcggc aaccgagcaa 
17881 gggatgagta agggcagcca acatccctgt 
17941 attcgcgaga tattgctctt cattgacgga 
180 01 ttctatcttc tcttacatcc acgcagtacc 
18061 ctctcctgct cgctttagcc atgacatttt 
18121 taaaaccgca gaacgatcgg cagaccggca 
18181 ccatgccaga catcaccatg ctatttattt 
18241 ttggattatc gtttatcaac tttgattatt 
183 01 ttaccaatat gttggcaccg gttgaactgg 
18361 ttattaactt cccaccgctg gcgatcacca 
18421 agggcagtgg ctttattcag gtgatgttga 
18481 ttttaacgcc agcggtcgtc tttatcggtg 
18541 atgtgatcct gatgcctgtc gccgcgatga 
18601 ccggtattgc cgccgctttc gctgggttag 
18661 cgattatcga cccgattatg cagggcttca 
18721 gttataacgt caatgtgctg tgtaactact 
18781 ttttggtgtg ctggtttatc accgataaaa 
18841 cactgaatag cgatctcagt gggttgcaga 
18901 aggagagtaa agcgtttaaa cgggcagggt 
18 961 ccgcagcact ctatccggaa aactccttat 
19021 ctgatgcacc gattatgcaa gcgattgtgc 
19081 gcctgatcta tggtttcagt tctggcacct 
19141 tggaaaacat tacccgctca ttggtctcgt 
19201 tcctctattc attcagccac tctaatattg 
19261 tcctgcgtac gctggcgatg cccgctggtt 
19321 gtgcgctgga tattattatt acgtcagcgt 
193 81 tggttccaat gctgatggcc gtagggattt 
19441 ttagctcaac agtgaatatc tcgacaccaa 
19501 actgtcgcca gtattgttcg aagaccggcg 
19561 atacactagg gttaaccgtg gtactgaccc 
19621 ttccattagg tttccagagt gggtatacgt 
19681 tgacgctgta ggggtgttag ctgcgcgcac 
19741 aaaaatgact atcgcactct cagaacaact 
19801 tagcagccag agcgatgccg cctctccaac 
19861 ggcgcagatg ttagctgaag aactgcgtca 
19921 acatgccacg gtgacagccc gtaaacccgg 
19981 tattacacat attgatacgg tcgacgttgg 
20041 acgctttacc ggtagcgatc tttgcctaaa 
20101 agaacatccg gaaatcctgc gttatcaggg 
20161 tgtactaggc gcagataata aagcggcagt 
20221 gacggctgac gattgccatg gcgatatcgt 
20281 tctgcgcggt gctaaagcgc tggatctggc 
20341 tgactgctgc gagctgggcg aagtggtgta 
20401 cgatatcatc ggggtgacgg ctcacccgat 
20461 ccgtgtcgcc tatgacatta tcagtgagtt 
20521 agggcgggaa ggctatgtct ggtttaccga 
20581 gaaaatagca atccgtgatt ttgataatgt 
20641 tgaagtggtc gcgaaagttt cagcacaata 
20701 cgatgtgtac agcaatatca gtaactcgat 
20761 cttctcttct atggcggaac tgaatatcga 
20821 cgatggtgcg gcgttatcaa cccaagggtt 
20881 taatttccac tcaccgttcg aatttttacc 
20941 aacccggacg atctgtttgt ctgcggccaa 
21001 aatgccttat ctccatgtag ggtactggcg 
21061 tgacttctgc tgtacgcctg ccactggcga 
21121 ttcttactga atgcaggtca ggcaatagga 
21181 aatgataatc gtttgcaatt agtataacaa 
21241 tatagtaagt gtcataataa tgtttttgat 
21301 agtaaatatc atctaattaa cagggttatt 
21361 attaatgatg gtattttatc ggaagttttt 
21421 cctggcatgg tcacaaagtc ataataacag 



atgtattcgg taatgtacgc agttgattaa 
gcagtgttgt cggcaactca gttaacttat 
gcagggtatc gggtaatgag gtcaaaccaa 
tttccaaaca ctctttcatt ctcgctacgg 
ttggatcatt ttcccatttt tcccatattg 
ttgatgtccg tgcagaatgg atttcacggt 
acgttatatt tgatagattc attgtattaa 
tattttggct gataaataac agggggtgga 
tgacggcctg agtcagttta tctgaacggg 
gaagggattg gcagtgatgg ctcccagata 
aatatatcga tattaagtat taaatggttt 
aaaaataaat attatcagtg agaatggaag 
attaatttaa attgacggtt taggattttt 
tattcattgg gttactatac ccaagttacg 
actcccaagc gcttagataa ctaagtgata 
agcttgaaag gcgacggata taaatgggtt 
tttatcagct gattttccat tattcattgg 
tctactgagg caggaaatca tcagtatctc 
tacaagggaa aggacagaat atgtcagatg 
tactgaacgc gattgaacgt attggtaatg 
acgcgctggg ggtgtgctgg gcactctcct 
tccaccctct gaccggtgat cagattaaag 
tgacctttat taccactatg gtcaaaaact 
ttgttgctac tttggggatt ggtattgccg 
aaaaaatgct caacgtcact ccacagcgta 
tggttagcca tgtcgcctct gattcagctt 
tgttctatgc cagtgggcgt cacccattag 
cagggggatt ctctgccagc tatacgccat 
cccaaggagc cgcgcacatt attgacccaa 
tcctcagttt cggtagcacc tttggcgtga 
ttgtggagcc tcgtttgctt aaatcgatgc 
gcaataatga gttacagaag gttactccgc 
tggttatgct gctgatgttg attggcttgg 
tgcgttcacc agatggcagc ctgacccgcc 
cactgctgtt ctttatgttt gcgattcctg 
ttaagagcac caaagatgtt accgccagca 
ttttggtctt tagcttcttc tgcgcccagt 
gcacattaat ggcgatttct ggcgcggact 
tcacgatttt aggtgtgatc ctgttatctt 
catctaaatg gtccattctg gcaccgatat 
cacccgagtt gacccaagcc agtttccgcg 
tgtttgcctt ctatccgctg atcattatgt 
tcgggacgtt atgcgccatg atgctgcctt 
tgatgttgta tctgttctgg ggtctggata 
atccagcaac tcaatgaccc ccttcggcct 
tcagtggttt tagctataaa taaaggaata 
gacaaaccgc tttttccgtt atctggcggt 
actgccgagt accgaagggc agcacaaaat 
gttggggttg gaagatatcc tgattgatga 
caatcagect accgcaccac gcattggttt 
cttatcgccc gatattcatc cgcaacgctt 
cgcagagcag ggcatctatc tacgtaccgc 
cgaagagatt attttcagcg atggcaccag 
gaccgtagtc atgacgttgt tggaaaacct 
ggtggcgttt gttcctgacg aagagattgg 
gcgttttgat gttgatttcg cttacactat 
tgaaaacttt aacgccgcct cggcagagat 
gtcagctaaa aatgtgctga tcaaccctat 
ctcgccacaa gagacgccag aacacactga 
tatgactgcc aaccctaaca gcgcaaaact 
gtcattcgcc gcgcgtaaag cctatatcgg 
tccacgagcg aagatcagct attcagtcac 
tggtgaagat aagcgggcta tcgacctgat 
acctaaagtg atccctatgc gcggcggtac 
gctgacccca aattatttta ccggggcgca 
gatcagctcc tttgtgaaat catatcaatt 
agcgtaaaag ctggaaaggt attgatatta 
tgttagcttg ccagtaccct attaatgttg 
gtaacaggtt gagaatcggt acccatgtcc 
ttgtataaaa tgtgatccaa cccacttgaa 
atgaaaggta acagcataaa ataaaaataa 
aaaaccttct tatcgattta tataaatgtg 
taatcttgtt atctcacttc attatggaaa 
tactgtaacg gttttaattt cattaccctg 
taaggatgaa ttagatacta ttacggtagt 



21541 ctttgattta gaaagagaaa atatagaaaa 
21601 tgtctatatg gttaaagcgg gcaatccatc 
21661 gacccccgga atggaaggga ttcaatcagt 
21721 tacttttgat acggaattat tggatgttga 
21781 gacgctatat ggtcgtaata ctgaatctgg 
21841 attttcacct gagtataaaa taggtctatc 
21901 cacggtttta ggtggaagta ttaatgacag 
21961 atatctttat gggaatggct actttaagcg 
22021 tctcaatgat tttagtggtc ggtttaaact 
22081 tgtgatgacg acctttgata tccaaaatag 
22141 agataaaata aagtcgggtc aaaaatatgt 
22201 tgatgcttat aaaggtcaag taaatgcagt 
22261 ggttagtgct tatgtagacg aaagaaaagt 
22321 atcaattagt gaactattga tgaatcgaaa 
22381 gaactctaaa tacagcgggc catttaattg 
22441 tgatgaaaat gaaattgatt tccgctatct 
22501 cgatatcaag accaataact acgcagtgtt 
22561 tgttgagctt gtggcgggtg caaggtatga 
22621 ggataatggg tttaacccat atcagccgta 
22681 ggcttttttg cctaaggtag ggttgaatta 
22741 cagcatagca cgggggtata aaagtggcgg 
228 01 cgcttataat gcagaatata tgacgactta 
22861 tagaacggtt cgttggaata cgtctctatt 
22921 agtggcgtat tatcctattt cttattcggt 
22981 ggagtctgaa cttgcgtggc gtattacgcg 
23041 tactgatgcg tattttaaaa atttcccaac 
23101 tgttaactat aaaggtaatc gaccggctaa 
23161 cgattataac ttccttaacg gttattttgt 
23221 ttatcttgat aatgcgaata gtaaaaaaca 
23281 cggttatgag aataaagatt atggtgtcaa 
23341 ttatgtgact cgtgctttca aaatggatga 
23401 tattaatttt ggggttaatt tcaatgttaa 
234 61 tgtgactatt agatacagcc acatattaat 
23521 actttatgtt tatgcataga gtcgttgatt 
23581 taatggtatg aatttattaa ataatgaaac 

23 641 ggcaacaaag tccaaatatg atgtcgcatt 
23701 ttactcttat agccaactga tatcactggg 
23761 aaacaatccg cctgggcctg ctattctgct 
23821 ctttatcgct tgtattatta ggggggtgcc 
23881 taaattgggt cgcagtattg ataccttgtc 
23941 tatattcacc ttaagttcgt tatccgacga 
24001 tcagattgtt tttgttgatc aacaactcac 
24061 aaattatgat gcactttatt atcccagtca 
24121 ttccacagca aagccaaaag cggtatgtaa 
24181 tatcttgctt gaattacatc tgcattgtca 
24241 tttttatcat gatttaggca tgttcagcgg 
24301 ctgtaatttc atgccttctg tgcactttat 
24361 taatgattat caggctaact ccggggctgc 
24421 aatggtgacg gacgaacaca tttgccagtt 
24481 tggggctgaa cccattcgcc cagagacaat 
24541 aggttttgat gcccacagtt tcacgcccgg 
24601 aagccataag cctatcgata ctgattactg 
24661 ggcaggtaaa gccattttag atccgcaagg 
24721 tagagcatgg caattgcaga ttgttgatcc 
24781 cattggggaa gtctgggttc gaggcggttc 
24841 actcacaaaa gccactttcc acaatacact 
24901 tacgggggat cttgcctttg tgtatcaagg 

24 961 tctcattatt gtggctgggc aaaattatat 
25021 cgcctgcgat gatatccaca cggggggagt 
25081 atttattgtg atggccgagg tctatcgtca 
25141 tcagcagatt gtggcgactg tagcccgcca 
25201 actcgcccgt ggaaagctct taaaaacatc 
25261 ttctctctat ttcacgcaac aattgagcgc 
25321 tcaagatgaa atgttatttg atcaagaaag 
25381 tgattggctc cataaaacag ggtgtgaatt 
25441 tgggctacat caggcagggt taaccagttt 
25501 tcattttctg ggctgtcatc tcaatgtcgc 
25561 actgacctta tttttatctc agcaccaaga 
25621 aggagatgat cgggatgtcg cgattgtggc 
25681 ggtgagctgg gcggactatt gcacagaatt 
25741 aaaaaatcga ctgcgacact gtgttttacc 
25801 cgacgctgct ttctttaata tcagcgaacg 
25861 tcatatttta gagctgagtt ggcacctgtt 
25921 gaaacaaggc gatattggtt tttacattgg 
25981 gttacagaaa aatgaccccg attatgccaa 



tatctacgaa tcaataatgc gcatccctaa 
tgacgcaggt ttttttacca tgcgaggtac 
aggtttcttt attgatggtg tttatgctaa 
tagaatcgaa gtattaagag gcccgcaagc 
ggtaatcaat gtcattacta aagatcctga 
ttatgggaat tataatcgta cacaggtaac 
tgaacagttt tcttatcgtg cagcattgaa 
tgattatgac ggtaaaaata atgtagataa 
acgttggcaa cctatggatg atggttggga 
gcgtaacgga aacacttctt ttaccgcatt 
tgactcaaat tatattggca aatctgatgt 
ttataccttt gatgatattg attttacctc 
tgataatcaa gacctcgatt ttacgcgatt 
aacaaaacaa ttttcacaag agtttcgttt 
gttgataggg agctattatt tttatcaaga 
gccctataac ttagcacaac tccggaaatc 
tggcaatgtt aattattacc tgctgaatga 
ctatgagaaa aagaaactta actttttgat 
ttctcatgat aataatagta attcatttgg 
ttatattact ggcgacgcca tgctatatac 
gtttaacacg ttagggcctc aatctagtcg 
tgaggccggt gttaaaaccg aatggtttga 
ctggaatgat atgaaagatc agcaagtcga 
taacagcggt aagtcattaa gccgtgggtt 
tggcctgact gtatctgcaa atgtaggcta 
agaaataaaa gtagataata actatattcc 
ttcgccaggg tacacttata gtattggtgc 
taatgctact tataatgtta aaggaagtac 
accagcttat gggctattag atcttactgc 
tgtttggata aagaatattc ttgatgaaac 
tgggatatgg tatgggcgcg cgggtgaacc 
attttaataa atattaataa tacatatggc 
tttggtgtaa aagagctatg tcctgtgggc 
tatcaaaagt gatctatatg caactacagg 
tcgttgggtg agggaggcgt tatttgaaca 
gcgttattgt gataaagatg atgtagattc 
ctattcgtta gctacacaga ttgaaataga 
catgccagga ggctcgcttt ttgtcataag 
ggctgtccct gtgcatctaa ccaatcattt 
ccatattttg gctgatagcc atggtgaata 
aattaaaaaa caaggttggc atcgtacaca 
cggtagtcgt cgctatttcc ccacgaaaaa 
tttacctgtg tatctgcaat attcttctgg 
ttacgatcat aatatgcggg tgcaacatca 
gccaaaaatc atcaccgcga actggctgcc 
tttgttattg ccattgttat ccggtggttg 
tacagagcca tttcgttggc tgaaaatgat 
ccccgatttt gcgtgggatt tgtgtaccac 
ggatctctcc tccatcaaga tggcaatgaa 
ggcaaatttt gcggcccgtt ttgcatccac 
ttatgggatg gccgaagcaa ctctgaccgt 
ctgtttggca tttaacccaa tggcattagc 
gtgtcggttg gtaagctcag ggcatgtggt 
ccagacatgt cagcccttgc ctgaggcgaa 
tgtggcgggg gggtattggc agcagcccga 
ggcaggtaac tctcaccatg attatttacg 
ggaactcttt atttgtgggc gattaaaaga 
gcccaaggat attgaaacag caattgagta 
gtgtgtgtgt cagcattttg acagcggcga 
tttaacatca acggcacttc agcgtattgc 
ttttcaactg aaatctgaca aggtcatctt 
gagtggcaag atccgccgca gtcatatgct 
gctttatgta caggagtcac agagaaagac 
cgtgttattt gagcaagagg ccatactgtt 
actcggttgc gaggcgatca atcctgatag 
gttagcgata caattttgtc agagtattaa 
agatctcttc gcctatccta cattacggca 
agatatcgcc tcctcaagtt caacggaagc 
tatcagttgt cgcttaccgg ggcaggtggg 
atccgcaggt aacagcgcgg tacgtcatga 
gataggcgca ttagaagata ttgatcagtt 
agaagccatc ttactcgatc cacaacagcg 
tgagcaggcg ggttggcaac ctgaacaatt 
tcaaagtggc agcgaatatg ggcaaatgtt 
aagttatctg gcgacgggga tttgtagcag 



26101 taccgcttgt tcttcttcgc tggtcgcact ggatagcgcg atgttgtcac tacagcaaaa 
26161 acagtgcagc gcggctgtcg tggggggcgt gaatttactc ctatcggcac aaatagaaca 
26221 aactcttatc aatgcgggta tgttatcacc caaggggcac tgtgctacgt tcagtgatga 
26281 tgctgatggc tatgcccgag gagaaggggc ggtattattc ctgttaaaac cttatcgtca 
26341 ggccgtgata gatggcgatc ctgtcttagc gatcattgag ggcagtgtcg ttgcgcagga 
26401 tggtgagagc agcagcctga cggcaccgaa tccacacgct caagcggcta tgatgtcgca 
26461 gcttttacag cgagcatcat tggctcccga tgatatctcg ttattggaaa cccatggaac 
26521 gggtacttcg ttaggggatc ccgttgagct aagcgccatt gatcgtgtct atggtcaacg 
26581 aaaaacgcca ttacgtttag gggcgagtaa aagtcaggtt gggcacttgg aggccgctgc 
26641 ggggggattt gcattagcca gggctgtcgc gcagatacaa caacagaagg cctttggtca 
26701 ccccacgcta aatagctata acccactgat agcagaaaaa ttatcacgtt attatttcga 
26761 taaagatacc tgcccttggg taatcaaacg tgtcgccatc aatgctttta gctttaccgg 
26821 aacgatggct gcggccgtgt tgagaaagcc tacggcagtt gctagatcaa taaagtcaat 
26881 tacatcaata aagtcagcca catcaataaa atcagttaca tcaataaatg taacgaacgg 
26941 tgcaatagtg atccacaccc aacgcctgaa atcagatcca gggggtaatc tgctctcctg 
27001 attcaggaga gtttatggtc acttttgaga cagttatgga aattaaaatc ctgcacaagc 
27061 agggaatgag tagccgggcg attgccagag aactggggat ctcccgcaat accgttaaac 
27121 gttatttgca ggcaaaatct gagccgccaa aatatacgcc gcgacctgct gttgcttcac 
27181 tcctggatga ataccgggat tatattcgtc aacgcatcgc cgatgctcat ccttacaaaa 
27241 tcccggcaac ggtaatcgct cgcgagatca gagaccaggg atatcgtggc ggaatgacca 
273 01 ttctcagggc attcattcgt tctctctcgg ttcctcagga gcaggagcct gccgttcggt 
27361 tcgaaactga acccggacga cagatgcagg ttgactgggg cactatgcgt aatggtcgct 
27421 caccgcttca cgtgttcgtt gctgttctcg gatacagccg aatgctgtac atcgaattca 
27481 ctgacaatat gcgttatgac acgctggaga cctgccatcg taatgcgttc cgcttctttg 
27541 gtggtgtgcc gcgcgaagtg ttgtatgaca atatgaaaac tgtggttctg caacgtgacg 
27601 catatcagac cggtcagcac cggttccatc cttcgctgtg gcagttcggc aaggagatgg 
27661 gcttctctcc ccgactgtgt cgccccttca gggcacagac taaaggtaag gtggaacgga 
27721 tggtgcagta cacccgtaac agtttttaca tcccactaat gactcgcctg cgcccgatgg 
27781 ggatcactgt cgatgttgaa acagccaacc gccacggtct gcgctggctg cacgatgtcg 
27841 ctaaccaacg aaagcatgaa acaatccagg cccgtccctg cgatcgctgg ctcgaagagc 
27901 agcagtccat gctggcactg cctccggaga aaaaagagta tgacgtgcat cttgatgaaa 
27961 atctggtgaa cttcgacaaa caccccctgc atcatccact ctccatctac gactcattct 
28021 gcagaggagt ggcgtgatga tggaactgca acatcaacga ctgatggcgc tcgccgggca 
28081 gttgcaactg gaaagcctta taagcgcagc gcctgcgctg tcacaacagg cagtagacca 
2 8141 ggaatggagt tatatggact tcctggagca tctgcttcat gaagaaaaac tggcacgtca 
28201 tcaacgtaaa caggcgatgt atacccgaat ggcagccttc ccggcggtga aaacgttcga 
28261 agagtatgac ttcacattcg ccaccggagc accgcagaag caactccagt cgttacgctc 
28321 actcagcttc atagaacgta atgaaaatat cgtattactg gggccatcag gtgtggggaa 
28381 aacccatctg gcaatagcga tgggctatga agcagtccgt gcaggtatca aagttcgctt 
28441 cacaacagca gcagatctgt tacttcagtt atctacggca caacgtcagg gccgttataa 
28501 aacgacgctt cagcgtggag taatggcccc ccgcctgctc atcattgatg aaataggcta 
28561 tctgccgttc agtcaggaag aagcaaagct gttcttccag gtcatcgcta aacgttacga 
28621 aaagagcgca atgatcctga catccaatct gccgttcggg cagtgggatc aaacgttcgc 
28681 cggtgatgca gcactgacct cagcgatgct ggaccgtatc ttacaccact cacatgtcgt 
2 8 741 tcaaatcaaa ggagaaagct atcgactcag acagaaacga aaggccgggg ttatagcaga 
28801 agctaatcct gagtaaaacg gtggatcaat attgggccgt tggtggagat ataagtggat 
28861 cacttttcat ccgtcgttga caaataacgc tcccgctaca aaaagagatg ggtttttacg 
28921 gtgttcgaaa tacagcccaa gcgggtaaca tcatactcag tagccccgca ggacgtattc 
28981 gtctggttat ctcaagccgt gggcgtataa ggttatgtag tgaacaacaa tccatggcag 
29041 gcatccattt atgcctttag tggtcaatgt cttgtctacc ccactcaacg cgcagccaac 
2 9101 acgaaccgcg ggttttacct tgcctgaaat gatgctggcg ctaagtttcg gtagcctgat 
2 9161 agcgctgagc acagcacagg tcttacccaa attaagccag caaatttcgg tgttgcaaca 
29221 gcattatcgc ctggaattag taatgaatca ggctatgggg gctatggaga aagatctgcg 
29281 cagagcgggc ttttgccacg ggaaatgtca gggggaggcc ataacgatag aacattatct 
29341 cggtgaaacg gctcactctt gcctgattgt tgcttatgat cttaattgca atgggagatg 
29401 ggaaggggcg aaacatcaag agtcagaata ttttggctat cggctacgca ataaagcgtt 
29461 agagagtcag cgtggcgaac tgaactgcca tggccgtggt tgggaaaaac tgtttgatcc 
29521 acgagatgtt actgtgactc acttttccgt aacccctctg tctgggcaac ttttcaaatt 
29581 gcggcttgtg gggcataaaa cgggtaatcc agctatccac catcaggtga cttaccttat 
29641 ccgtgggaat aacgtatgac gcaggtattt caacgaggca gcagtaccct ggcggcggtt 
29701 atgacacttt tttctttggg attattttgg ctatcagcga tacatcggca attagacaat 

2 9761 atccagcaaa ttaccggaga ggaacaacgc tatttacggg cttacaatca ggctgaatca 
29821 tcactcaact ggggtgtcag ccagcgctgg gcgctcagaa taccttggcg ggtaggttca 
29881 gcatggcatt gcatggcaca tcaggagctc ggcttaaaag cttgcgttaa gcggtcatca 
29941 ctggcgggtt tttttattct caaaggagaa agtcttccgc ttgggtcgct ccctccgctg 

3 0001 atgctttacc aacgtgttaa actgaaggca gttacaggaa gttcggggaa ttatcagttg 
30061 attgacacgc ctcatggttg gttggatttt tgccctgata aggatgcaca gttttgcctc 
3 0121 gattaataac caaacacgct tatggacaaa ctatttcagt ggagacaggc ggtttcaacc 
3 0181 tacgtcacct gcgtggattc agcttactgg aagttttgat tgccgcatta tttttctcaa 
3 0241 tctcactact ggggctactg caataccatc aggcactttt acagggcttt gcctcctcat 
3 03 01 ggcaacaacg gcaagcctgg tcgtggttac atcaatatat ggagagtcag agcggtacag 
3 0361 accccgaaga acggtgggca cctgaagtga aaccaggctg gcactatcgc caatttatcg 
3 0421 atcgcattga gggggattgc cgggaattta gcgttaagtt gataacccgc caagactcgc 
30481 cagtagaatt aagccgatgg ttttgttcaa ctgagatgat ttagggtatg tattaccgat 
30541 acctgttgcg cagactcatt atcaagttag agtattcaac gtcaggaact atccatcagc 



3 0661 gttcacggtt tatcattcta atcaactgga tttactcaaa gcgctcacca ctgcgctgat 
3 0721 agaaagagaa ccgttggata atcctttcca gcaagaagtg gtgttagtac aaagccccgg 
3 0781 catggcccaa tggttacaaa tgcaactggc tcaacagttc agtattgctg ccaatatagt 
3 0841 gtttcctcta cccgctactt ttatctggga tatgtttacc cgcgttctgc ctgatattcc 
3 0901 taaagaaagt gcattcagca aagaggctat gacatggaaa cttatgtggc tactgccaga 
3 0961 tttattggag aatccactat tttctcccat gaagcgctat ttaagtgatg atggggatcg 
31021 gcgtaaaatt caccaactcg cggcacgggt agccgatctt tttgaccaat atctggtgta 
31081 tcgtccggaa tggctggaaa gttgggaacg tgggcaattg attgaggggc ttgatgatgc 
31141 acagcaatgg caagcattac tgtgggttga actgacccgc tatactcgcc aattggaaca 
31201 accggaatgg caccgggcaa acctctatca gcgttttatc caccaactac tcaagtcgga 
31261 cgtttgcccg caagggttac cgaaacgggt atttatatgt ggtatatcgg cgttgccacc 
31321 tatctattta caggccctac aagcattagg taaacacatt gatatccatt tgatgtttac 
313 81 taacccctgc cgttacttct ggggagatat tcaggattac actttcctcg ctaaattaca 
31441 aagccgtaaa cgtcggcatt accgtgaatc gattgaacta agtctgtttc gccacccgca 
31501 gcaggcagag caactattta ataccgatgg tgagcagaac ctcagtaacc cactcttggc 
31561 ctcatggggg cgattaggga gggatcacat gtatttgctc tctcaaattg atgaaattca 
31621 agaggttcat gcttttgtcg atatcgagcc agataatctt ttgcatggca ttcaacacga 
31681 tatgctggag ttggaagatc acgcggttat tggcacaacc ccagaaacct tagcgcgtag 
31741 tgatcaaaaa cgccgattag atctggatga tcgttcactt agtttccatg tctgccacag 
31801 cccacaacga gaagttgagg tgctgcaaga tcatttactt gggctattag ccgcagaccc 
31861 tgaattgacc ccgcgcgaca tcattgtgat ggtggtggat atcgacagtt atacaccgta 
31921 tattcaagct gcgtttggta atgctccgtc tgagcgttat ctgccgtttg cgatttctga 
31981 tcgcaaagcc agtcaggctc atcctgcgtt gcacgccttt attaccctac ttgatttacc 
32041 ccagagtcgc tttacggctg aacaagtctt agcactgttg gaagtccctg ctttggcgac 
3 2101 caaatttggt atcactgaag atggtttacg gcgcctgcga caatgggttg gtgagtcagg 
32161 catccgttgg ggattggatg atgataatgt gcgggagcta tccttacctg ctacgggtca 
32221 acacacctgg cgttttggtt tgacccgcat gctactgggt tatgcgatgg acagtaccgc 
32281 aggggattgg caaggtattc tgccttacga tgaatccagc gggctggcag cggaactcgc 
32341 cgggcagttg gccgatatgc tgatgcacct gagtcaatgg cggcaacaat taggtcagcc 
32401 ccgagaactt agtgaatggt taccaatatg tcgtcaatta ttggatacgt tcttcgatca 
32461 agacaatgac actgaggccg ctctggtgct gattgaacaa cagtggcaga aagtgatcgg 
32521 ctacggtatt gcagcccaat atccagacgt tgtgccactt aatctgctac gagacgaatt 
32581 ggcggcacgt ttagataatg aacgtattag ccaacgtttt ctcgccgggc ctatcaattt 
32641 ctgtacctta atgccaatgc gctccatccc atttaaagtg gtttgtctat tgggcatgaa 
32701 tgatggggta tacccgagaa cattgccccc ccagggattt gatctaatgg ccaagaaggt 
32761 tcgtcggggt gaccgtagcc gccgcgacga tgatcgctat ttgttcctag aggctctgct 
32821 atctgctcag caacagcttt atatcagcta cattgggcgt tctattcaag acaacagtaa 
32881 gcgttatcct tccgtgctgg taagcgagtt aattgaatat gtcgcacaaa gctatcattt 
32941 accgggcgat gaaaaactca gtgctgatga cagtgcacaa cgggtgacgc agcacttatt 
33001 atgttggcat gcccgtatgc cattctccgc tgaaaatttt attaaaaata gcgaattaca 
3 3 061 gagctatgcc gctgaatggc tgccttctgc cgagtcgaag gggcatgcac acccaaattt 
3 3121 taaccaaccg ctacaggctg agccgctggc tgaaattaca ctggatgaat tagtgcgttt 
33181 ttaccgccat ccagtgagag catttttcca actgcggcta ggggttaatt ttgttattga 
33241 agaaacagaa ttacctgacg aagagccatt tactttggat aaccttagcc gttatcaatt 
33301 taatacccag ctacttaatg cgctgataaa tgaggacgat attaatagcg tctttgccag 
3 3361 agccagagct gccggcgtac tgccatatgg atcttttggt gaactttatt gggagagtca 
3 3421 acaagatgag atggtaccct tggctgaaca aatccgcagt gaacgtaaag agaatcacag 
33481 tattgaactc aacattgaat tcgctgacat cacggttacg ggatggattc atcaggttca 
33541 ggacgatggt ttggtccgtt ggcggccatc aatattaacc gccgtcgatg gccttttatt 
33601 gtggcttgaa catctcgtgt actgttcggc gggtggtgag ggagaaagcc ggatatatgg 
33661 acgaaaaggg actgcctggc gttatgcccc tatggcggct gatgaagccc ggccctattt 
3 3 721 acaacaactc attaaaggtt atcaacaagg tttatgtgag ccattgatgc tcttgagtaa 
3 3 781 aagtggatgg gcgtggttga gccaatgctt tgatcgtgag tcaggccaaa ttttgtggga 
3 3 841 cgaagagacg caaggtaaag cacggatgaa acttttacag gtttggcaag gtgatcagag 
3 3 901 agtgacaggc gaaggtgaag accactatat acaaagggtt tgcagacgaa tggacaacca 
3 3 961 acatttggac ataattttac atgaaactga acgttatttg ttaccgatag ctcgccataa 
34021 taaggcgtga agagcgcttg ttgtcagttc aatacaataa agccaatacg tgctttagtg 
34081 accattctta tgaagctagt ggagaaatgg aatgcataaa cagcttgccc gcattgtcgg 
34141 tatcgttttc tttttaggtt tactggctcc atcgagttgg gctgcaattc ctgcgtggca 
34201 acctttggcg gaaaccattt acaaaagtga acatgactta cgtaaatatc aggcaataaa 
34261 attatccaat ggtatgacgg tactactggt ttccgataca caagccccta aatctttggc 
34321 ggccttagcg ctaccggtag ggtcactgga agacccagat aatcagctcg gcttggccca 
34381 ctacttggag catatggtgc tcatggggtc caaacatttc cctgaacctg gcagtttttc 
34441 cgaattccta aaaaagcatg gggggagcca taatgcgagc acggcttctt accgaaccgc 
34501 tttctatctt gaaattgaaa atgatgcatt ggcacctgcg gttgagcggt tggcggatgc 
34561 cattgctgaa cccttgttag accccattaa tgcggatcgt gagcgtaatg ctgtcaatgc 
34621 tgaattgacg atggcccgtt cacgcgatgg tatgcgtatg gcgcaagtta atgcggaaac 
34681 gcttaatccg gcacatccaa gcgcacgttt ttctggtggt aaccttgaaa ctcttaggga 
34741 caagcccgat ggcaagctac atgatgaatt ggtgagcttc tatcaccgct actattctgc 
34801 caatctcatg gtcggggtgt tgtacagcaa ccaatcatta gagcagctag cacaattagc 
34861 cgcagacacc ttcggccgta tccctaatag ggatgcaaaa gtacccacga ttacggtccc 
34921 agtcgtgaca cctgatcaaa cgggaatcat cattcactat gttcctgctc aacctcgtaa 
34981 acagataaaa gtcgatttcc gtattgcgaa caatagtgcg gattttcgta gtaaaacaga 
3 5041 tacctatatt agctacttga tcagtaatcg cagtaaaaac accttatctg actggctaca 
35101 aaaacagggg ctggcagatg caatcaatgc gggtgctgac ccaatgttgg atagaaatgg 



35221 cgtcgcggca atttttgatt acatcaatat gctgcataaa gaggggatta aaaaaagcta 
35281 tttcgatgaa attgcacatg tattgaatct cgatttccgt tatccctcta tcacccgtga 
35341 tatggactat atcgaatggc tcgtggatat gatgttacga gttcctgttg cacatacgct 
35401 tgatgcgcct tatctggctg atcagtatga tcccaaagca attgcatccc gattggccga 
35461 gatgacgcct gaaaatgccc gcatctggtt tgtcagccct gaagaaccgc ataacaaagt 
3 5521 agcttatttt gtcgatgcac cttatcaggt cgataaaata ggcgtacagc ggatgaaaga 
35581 atggcagcaa ctggggcaaa agattgcgct aagtttgcca gcactgaacc cgtacattcc 
35641 tgataatttc actctgatca aagcagacaa gaacattacc cgcccacaga acgtggcaga 
35701 ccagccagga ttgcgggtgt tttacatgcc aagtcagtat tttgctgacg aacccaaagc 
35761 cgacattacg gttgctttcc gcaaccctca tgcattgaac tcagctcgcc atcaggtgct 
35821 ttttgccctg acggattacc ttgcgggtct ctcacttgat caactgagtt atcaggcctc 
35881 tattggtggg atcagtttct ctaccgcacc gaacaatggc ttgtatgtta atgctggtgg 
35941 ttttacacag cgtatgccgc agttactgac atctttggtt tcaggctatg ccagttttac 
36001 tccgacagaa gagcaattgg tacaggctaa atcttggtat cgtgagcaat tagacgtggc 
36061 agagaaaggc aaagcttatg agttggctat tcagcccgct aagctgctat cgaatgtgcc 
36121 ttattcggag cgaagtgagc gacgtaaact acttgatagc atcagtgtgc aggatgtgct 
36181 gacctataga gatgatttac tgaaacagtc tgcgatagaa gttttggccg tgggcaatat 
3 6241 gacagccgaa caagtcactg aactcactga atcgctgaaa aaacagctga acttaatcgg 
36301 aaccacgtgg tgggtcggtg aagatgtcat tattgagaaa acacagttgg ccaatatgga 
3 6361 acgggtcggc agtagttctg acgcggcttt ggcagcagtt tatgtcccta ctggctatac 
3 6421 agaaattgct ggtatggcac gtagtgcctt gttagggcag atcattcaac catggttcta 
3 6481 tgatcaatta cggacagaag aacagcttgg ttatgctgta ttttctttcc caatgtcggt 
3 6541 tggtcatcaa tggggcatcg gcttcttact gcaaagtaat agtaaggaac ctaattatct 
36601 ttaccagcga tacctcgcat tttatccgca agctgaaaaa cgcctgcgcg aaatgaagcc 
3 6661 cgatgatttt gaacaatata agcaggggct ggtcaatcaa ctattgcaaa ggccacagac 
36721 attagatgaa gaggcagagc gctaccgtaa agacttcaat ctcaataatt ttgcatttga 
36781 tagccgtgag aagatgattg ctcaagtgaa acagcttacg gctaatgaac tggcggattt 
3 6841 cttccagcaa gcggtcatta aaccacaagg tctggcactg ctttctcaag ttaaaggtca 
3 6901 ggggcaggcc ggaggattcg cagtaccgga gggatggacg acttatccaa ccacttccgc 
36961 tttacaggct acgctgccgc agaaggtatt ggcaccatga cacccacgac tccccagcgg 
37021 cttgagccgc tggcgcttcc cctctatggc gagaggctga tagaagcctc tgccggtact 
3 7081 ggtaagacat ttaccatcgg ggtcctttat cttagattgt tgcttgggtt agggggcgat 
3 7141 gctgcattcc gtcgcccttt gatggtggag gaaatcctag tggtgacctt tactgaagcc 
3 7201 gcgaccgaag agctcagggg gcgaattcgc gataatattc atgggttgcg tatcgcttgc 
3 7261 gttcggggag tcagtgatga cccaatgtac caggctctat tagccgaaat caccgattgg 
3 7321 gccgtagccg ccgcgcaatt attggcggct gagcgtcaga tggatgaagc cgcgatttac 
37381 actattcatg gtttctgcca aagaatgctg gcaaataacg cttttgaatc cggcatttta 
37441 ttcgaacaaa cgttagttca ggatgaactc ccgttatggc ggcaggcctg tgctgacttc 
37501 tggcggcgtc attgttatcc gcttccctta gtcatcgcac gggcggtcag tcaggagtgg 
37561 agcgggccgg aggcattact cagtgatctg agtgcttatt tacaaggtga aacgccaaag 
37621 tttcggcaag cgcccggtga cgatgaaacg atccttagcc gccatcagca cgttgtgaca 
3 7681 cagattgatg cacttaaagc agagtggcgc ttagccgccc ctgaattaga agcattaatt 
3 7741 agtcgttctg gcgttgataa acgcagttat agcagcaagc acttacccaa ctggctggaa 
3 7801 aaagtaggat tatgggcgga gcaggaaacg gaagattatc aattacccaa agaactggaa 
37861 aaatttcgcc agtcggtgtt attggaaaag accaagaagg gagccgcccc acaacatact 
3 7921 ttatttagtg cgatagactc gatatttgaa cagcccctaa cgttgaggga tcttatttta 
3 7981 gcccgagcca tcagcgaaat acggttttct gtacagcagg aaaaacggca gcgtgcagag 
38041 ctggggtttg atgatttact cagtaagtta gatgccgcat tacaacaatc ggggggtgag 
38101 ttgctcgccc aatctattcg tacccgttat cccgttgcaa tgattgatga atttcaggat 
3 8161 accgacccgc aacaataccg cattttccac accgtttatg gcgggcaaga agcgtgcgct 
38221 ttactgttga tcggtgatcc taaacaggct atttatgctt ttcgtggcgc agatatattt 
38281 acctatattc gcgcccgttc agaagtaagc gcccactaca cattggaaac caactggcgc 
38341 tcttcttttc ccatggtgca atcagtgaac aggctattta gtctggttaa tacgcctttc 
38401 ttatttgagc aaatcccctt tattaatgtc gccgccgcca aaaagaatca tgatttatcc 
38461 tttgaaatta aaggagaaaa gcaacctgcg ctacattttt ggctgcaacc cggtgacggt 
38521 gttggtgtaa gtgagtacca acaagtaatg gctagacaat gtgcagcgca aattcgtgat 
38581 tggttgactg ccggccagca ggggcaagca gaactactga ccacgaaagg accaaaaccg 
38641 gttcaggcat cagatataac catactggtg cgtagccggg cagaggccgc attgatacgt 
3 8701 gatgcactta gtgcgctggc gatcccatcg gtctatttat ctaatcgtga cagtgtcttt 
3 8761 gagactacag aagccaaaga tctcttgtgg ttactgcaag ccgtgttagc accagagcag 
3 8821 gagcgcgcac tacgtagtgc tatggctacg gggatcttgg ggttggatgc cagaatgttg 
38881 gatgcgttaa atcacgatga gcgagcttgg gatgcgctgg tcgatgagtt tgatcactat 
3 8941 cgccagcatt ggcagcgccg tggggtatta cccatgctgc gggagatgat ggcccgccgc 
3 9001 catttagctg aaaacttgtt ggcaacaccc ggtggggagc gtcggctgac agatttacta 
39061 catcttgggg aattactgca agaggccgcc tcgcaactgg atagtgagca cgcactaatt 
39121 cgctggttgg cccagcaaat tgctcaaccc aatccccagt cagatagcca acaattacgc 
39181 ttggaaagtg atcgccattt ggttcgagtg gttactatcc ataaatcgaa agggttggaa 
3 9241 tatccattag tttggctgcc ttttatcggt aattttcggc aacaacagaa tgtgctttac 
3 93 01 catgaccgac atagctttga ggctttactc gatcttaatg ctgatgaaga aagccaggct 
3 9361 ttagctgaag aagaacgtct ggctgaggat ttacggctat tgtatgtcgc gttgacgcgt 
3 9421 tcggtttatc attgcagcgt gggcattgca ccgctgatta aaggggggag gaaaaagcaa 
3 9481 ggcgagagtg atatgcatcg cagtgcgtta ggttatttgg ttcagcatgg tcaacccggt 
3 9541 gatgctcatg ttctaagcga aaagttaact gagttagcgc gctcatcgaa tggtgatatt 
3 9601 tgcgtttcgc ccgttgagcc tcctgatgat aaaccttggc aaccacagcc cgaagccttg 
3 9661 cctgaactta ccgcccgcca ttttactcga aaaatgcaag attattggcg cgtaaccagc 



3 9781 gtatcgcagg ctctgggggc accattacag gtcttgttgc cacggttgga cattgatgcc 

3 9841 gtaggtgagc aggcggtggt gacagaagac gatcatttaa cacctcatac cttccctcgg 
39901 ggagcggccc cagggacttt cttacatgat ttgcttgagc cattagattt tagtcaaccg 
39961 attgaattgg cgtggttgtc tgagcggcta cagcaacagg gttttagcga acagtgggca 
40021 ccggtgttac accagtggct aaatgatata gtgcaaacgc cgcttaccga taccggggcc 
40081 actttggctg ggttgtcgcc agagcgcaaa caggccgagc tacaatttta tctacccatc 
40141 gatagcctgc tacgggccga ggtattagac cgtttaatca aacagtatga tccactttcc 
40201 cggcagtgcc ctgtattgga ttttcaacag gttagaggga tgctcaaggg gtttattgac 

4 0261 ttggtttttt gctggcaggg gcaatattac ttgctggatt acaaatccaa ttggttgggt 
4 0321 gaggacagtc gtgcttatac cgttgaggcg atgacacaag cgatggctga acatcgctat 
40381 gatctgcaat accaacttta tactttggct ttacaccgct atttacgcca tcgtttggcc 
4 0441 gattatgact atcaacgcca ctttggtggg gttatttacc tcttcttgcg cggcgttgat 
40501 cagcaacatt cagggcacgg tatttttaat tgccggcctg aacagaaact gattgaagaa 
4 0561 atggatcgct tattcagtgg cacaattggc agtaacgcaa ccgcgagtga caaaagcggt 
4 0621 agcaatgtca ttcacagcgg cgagagggca tcacgatgat gacgctattg gctcaggctg 
40681 ctcgagaccg cttattgcgc ccactggatg tgcagttttc acgaatgatt gccggcgatg 
4 0741 atgacccgcg attacaactg gccgcagcga tccttagtgc agaagtaggg gccggtcatg 
40801 tttgcctgcc attacgctat ttacagccag aactgctatt cggtgggcgg caacccgatt 
4 0861 tatcactggc gctatggcag gcggcgggct cgccagataa agcacagtgg ctacaggcat 
40921 tacaaaacgc cccagtagtc agtgatggct ctcaaccaac accattagtg ctacagcagg 
40 981 aacgtttata tctgcaacga atgtggcagt acgaaggaga tgtcgtccaa tttattgcca 
41041 gtgactctgt ttttataaac agagatagtg attttatgac cagtaaaggt attgccactt 
41101 ctgttgatga atcgctatta cgggagacgc tagacgcttt gtttggttgt gctggtagtg 
41161 aggttgattg gcaaaaagtg gcggcagcgg ttgcggctac ccgtagaatt tcagttattt 
41221 ccggcgggcc gggcacgggg aaaaccacca cagtagccaa acttttgaca gcattgatac 
41281 ggcttagcca ggggcaacgt ttacgtatta aacttgcggc cccaaccggt aaagccgctg 
41341 cacgccttac tgaatcactg ggcaaagcga ttcggcaatt ctttcttact gatgatgaac 
41401 gtaaattatt ccctgatcag gcttcgacct tgcaccgttt actgggtgtt cagcccaata 
41461 gccagcgcct gcgttaccac cgtggtaatc cattaaatct tgatgtactg gtcgttgatg 
41521 aagcatcaat ggttgattta cctatgatgg cgcgcttaat tgcggcatta cctgccaagg 
41581 ccaaggtgat attccttggt gaccgtgacc aactggcctc ggttgaggcg ggggcggttc 
41641 tgggggatat ctgccgtttt gccgaattag gttacagcga gcaacgggca cagcaattaa 
41701 ctcagttaac gggctattta ttaactaata atgcattaac cagcaatgta ctaaccaata 
41761 aggcattagc caataaagcg cagccagatg aaacctattc tgattcagcg aatgtacgcg 
41821 atagcctgtg tttactgcgt aaaagttacc gcttcgatga aaagtcaggg attgggcagc 
41881 ttgctctggc cgttaatgca ggggagtatc ggcaggcttt atccgtatta aacagtgcgt 
41941 attctgatgt tgaacgcttc ccgctggccg atgaggaaga ttatcaggta ctgctagagg 
42001 cttgtgctgt ggggtatcag cattacctcg aacgggtggc tgcgttagcc cctgcggcag 
42061 aggtattggc agcatttggc cgctaccaac tgctgtgtgc attacgttcc gggccattcg 
42121 gtgtcagtgg cttaaacgaa cgcattgaac aggttttaca tcgtaaagga ttcattatcc 
42181 gtccatcagg gccttctggc cgttggtatg ttgggcggcc agtgatgata gaacttaatg 
42241 atagcgcact tggcttattc aacggggata ttggtattgc cttacatgat aatgaaggcg 
423 01 agctacgggt atattttcaa ttgcccgatg gcaatataaa gtcagtacaa cctagtcgcc 
423 61 ttcccagcca tgaaaccgct tatgctatga cggttcataa atcgcagggt tcagagtttg 
42421 aacacacagc attggtatta cctaacacgt tcatgcccgt gttaactcgt gaattggtct 
42481 ataccgccat tacccgcgcc cgtcagcatc tgacgcttta ttgcagcgat gccgtgctaa 
42541 gtcatgccat acgtacaccg actctccggc tgagtgggct agttgatcgc ttgaatacat 
42601 taaatcgcca ataatctcag ggcataagcg ccctgagtta tctgtttcaa ataagtattt 
42661 caaataagtg tttcaaataa gtacttccaa taagtgattc gagtgcgtga gcgtagccgc 
42 721 cgccctgacc cccaatttca tttacgcatg tataaagggg atagggggct atagatcggc 
42781 cagtaagatc ttcgaacggc gctgatagtt atacagttcc tgtttttgaa taggtaaaac 
42841 atctacctct gcaggtgtaa aaccgcgttc ctggaaccag tgaatgctgc gtgtggtcag 
42901 cacaaacagc tttttaaggc ccatttgtcg cgcctgattg gttatcctat tcagcagcat 

42 961 ttctccccgg gaggagctgc gataatctgg atggaccgcg acacaagcca tttcaccaat 

43 021 atgttcatca gggaacggat agagggcggc gcaagcgata gttaaattat cccgttcaat 
43 081 aatagtaaac ttatcgatct ccatctctaa ttgttcacgt gaacggcgca ccaagatccc 
43141 ttgttgttcc aatgggcgaa tcagttccaa gataccgccg atatcgttga tggtggctcg 
43201 gcggacttgc tccgcacttt ccatgacaat ctgggtaccg ataccgtcac gggagaagag 
43261 ttcttgaatc agagcgccat cttcctgata gctgagtaaa tggctacggc gcacaccgct 
43321 gcggcaggct ttaacggcac cacgcaagaa acgtactgtg cccgaattgt aatccccatc 
43381 ctgttccagg tcttcaatcc gcttttgcgc atcgtttggg aacaattctg aaataatatt 
43441 accttcgcta tcagtcaccc cttgtgatga gcagaatcca atcattttct ccgctttcag 
43501 tttgatggct aactgagttg caacttcctc tgatgtgagg ttaaagcttt cacctgtgac 
43 561 cgataccgca actggcccca gcaaaacgat agcgccatta tccaattggc gatgaatagc 
43 621 ttcttcatca atgcgccgga tacgcccgct atggcaataa tcaacaccat catccacacc 
43681 tagcggttga gcaatgataa agttaccgct aactacgttg atatgagcac cttgcagtgg 
43741 ggtgttgttc aaactcattg aaagcctggc cgtgatatcc agttgtaata aaccggccgc 
43 801 ttgcttgacc atctccagag tccgggcgtc agtcactcgg gtatgtttat gataaatagg 
43861 ttcgtagtta tgatctgcca gattactgtc aatctgtggt cgggcaccgt ataccacgac 
43 921 taatctgatc ccaaggctat gtaataagcc tatatcgttg acgatactag agaagttttc 

43 981 atgttcaatc gcttcgccac ctagcatgac aacaaatgtc ttgccgcggt gcgcattgat 

44 041 atagggaact gagtggcgaa atccttgaac cagttcagta ctacgttcct tcacggcaga 
44101 tcctcttttg aatttttatt cggtattttt gtatttttat tcttttttga ccaagatggc 
44161 aagcacgaaa ttgatactaa ctgcatatat ataaagtgta tgtattaata aactgagtca 
44221 cataaagatg tttttctgat gacagcgtta aattgattcg ttaaagtttt tcctcataat 



44341 aaatcataat tctggtcgtc gccgtctatt acaaggtgcc gcagcagctt ggatgctgag 
44401 tataagccgg gttggatttg ccgcatcttc acatattgtt gctgtccggg tgtggccctc 
44461 ttctacttac acccgagtca cattagagtc aaatacacca ctgaagtacc gccaattcgc 
44521 cttaactcat cccgaccgta ttgttgttga tattgaaggc gtacagctaa atagcgtatt 
44581 gaaagagatc agccggcaag ttcaatccgc tgacccttat ctcaaacagg cgcgggtagg 
44641 qcagttcgat aaaaatactg ttcgtttggt attggaactg aagcagagta tcagccctca 
44701 attatttacc ctgaaacctt tcgctgaatt ccgtaatcgc ttagtggtcg atctgtaccc 
44 761 tgcggagggg ggcacatcgg cagaagatga tccgctgctg gccctattgg aagattacaa 
44821 taaaggtaat gtggatcgaa ccttgccacc agaaacaccc aaagcaggta aagcagggcg 
44881 tgacagaccc attattatca tgttggaccc aggacatggc ggggaagatc ctggggctat 
44941 cggcaaaaat aaaaccaagg aaaaagacat tgtgctgcaa attgctcgtc ggttacgggc 
450 01 gcttattcag aaagaagcca atatgcgggt gtttatgact cgtaatgaag atgtctttat 
4 5061 cccgctgaaa gttcgggtgg caaaagcacg taagcttcgt gcggatttat tcatttcgat 
45121 tcatgccgat gcttttacca atcgggcggc tagcggctcc tcggtctttg ctttatcgac 
45181 caaaggagcc accagtactg ccgctcgttt tctggctcag acacaaaatg aggccgatca 
45241 gattggtggg gtcagcaaaa gtggtgatcc ttatcttgac cataccatta ttgatttatt 
45301 gcaaaccgca actatcaacg atagcttgaa gtttggtaaa gaagtactga gtcggatgag 
453 61 taaaatcaac aaattgcaca aaaatcgtgt cgatcaagca ggatttgcgg tactaaaagc 
45421 acctgatatc ccgtcaattc tggtagaaac tgcgttcatc agtaatttgg aagaagagcg 
45481 gaaattacgt accagccgtt ttcagcaaca gatagcggaa tctatttttg ctggtatcaa 
45541 agcgtacttt gctaatggcg gggcaatggc gcgattgtga tggatgtttc tgtaaaataa 
456 01 tgggtgtttc tgtcaaaaat agagtgacta taaatagcgt ggttatagtt caatacacga 
45661 ggacgaggca tcctaccagg gcaaagcaat tatgcggcag gcgcagataa gcacttgcgc 
45721 ttaggggagt accaaggacg gtatagaacg ccagaaacaa aaaaacaccc gatagggtgt 
45781 ctaatgatga tatcaatatg attaagaatt ggttgcgggg gccggatttg aaccgacgac 
45841 cttcgggtta tgagcccgac gagctaccag gctgctccac cccgcgtccg tcactacact 
45901 attaactcag tatcacttct caactgagat gcatcgctat tttactgaat acatcttaaa 
45961 tttcaacacc atcgaactac atttgatact gctggtaatc agtatcggtt ggttgcgggg 
46021 gccggatttg aaccgacgac cttcgggtta tgagcccgac gagctaccag gctgctccac 
46081 cccgcgtccg attgagcgca ctatactctg gatgagatct gttgcaagta gttttaccat 
4 6141 aaaattcctg taaatatgtt ttttcgatta atttatcgac tatctgtttt ctttttctcc 

462 01 ataaccgtca tcgagatttt tttctgcctg ctgattcctt tatccatagg cgtttgttat 
46261 cgttcgccgg tagataaaat gagtcaaaga gaaggcgtga aatgacaagt cgttggggca 
46321 aatacctact gagtggaata atgattgctg ttctggccgg ttgccaatca cggccaaccg 

463 81 atcgtgggca acaatataag gatggtcgcc tggagcaatc actggaactg gtgaacgaac 
46441 ccaatgcggc aggtaagcca gtcaacgcca aagattattc tgatcaggtg aaagtgatta 
46501 atcagtcttc accgggtctg tataaccgca acagcgatac attcaatgcg gtacaaaact 
46561 ggatgctggc cggtgctgat accagtaagt tgagcttgtt tggtttgaat gcttatcaga 
46621 tggaaggtgt cgataacttc ggtaacgtac aatttactgg ttactacact cctgtgcttc 
46681 aggcccgcta cacgccacaa ggcgaatttc gtcatccact gtatcgtatg cctgctaaag 
46741 gaaaacgccg tttaccggat cgggcggcaa tttatgcggg tgcgctggat aaccgtaatt 
46801 taatcatcgc gtataccaat tcgctggtag acaatttcat gatggaagtt cagggcagcg 
46861 gctatgttga ttatggtgat gggcggcctc taacattctt cgggtacgcg ggtaaaaatg 
46921 gtcacgctta ccgcagcatc ggtaaagtac ttattgatcg cggtgaagtc gcaagggctg 
46981 atatgtcgat gcaggctatc cgtcaatggg ctgaaaacca cagtgaagct gaagtcaggg 
47041 agttgttgga gcagaatccc tctttcgttt tctttaaacc ggtaatgtat gcgccagtta 
47101 aaggggccag tgcggtccca ttgattgcta aagcctctgt ggcatcagac aaatctctga 
47161 ttccgccggg taccaccctg ttggcagaag tgcctttact tgatgaccaa gggaaattta 
47221 ccggcaaata tcaaatgcgt ctgatggtgg cattggatgt tgggggcgca ataaaaggcc 
47281 aacattttga tatttatcaa gggattggtc atgaggcagg tcaggctgct gggttctaca 
4 7341 accattatgg ccgggtctgg gtactgaaaa atgcacaaag cagtggtccg ttgttcactg 
47401 cttataaagg tggcacgcag agcgagccta ccagtaatga ttcgtcatta ctggtgaata 
47461 atcaggatcg ataagtcagc ggatgtgcag ggtaaacatt tgttttgatt aagcggccag 
47521 ttggccgctt ttgcatgagt acttagaatc ttattgagta cgaagcgtct tttttgaagc 
47581 catttattat ttctacgtta ttaaggtaac aagtctatga gcacagccta ttccgaagct 
47641 taccaacagc ggtttggtgg tattgcacgc ctctatggcc aacaggcatt ggcgcttttt 
47701 tcacaggcgc atgtttgtgt catcggtatt ggcggcgtag gctcatgggc ggcggaagca 
47761 ttggcgcgta ctggtatcgg tgcgattaca ctaattgata tggatgatgt ttgtgtcacg 
47821 aatactaacc gtcaaatcca tgctctgcgc cacaacatcg ggcaggctaa aacagaagtc 
4 7881 atggccgagc gtattctggc catcaatcct gaatgtcatg tgacttgtat tgatgatttt 
47941 attactgcgg ataatgtcgc tgagctttta aataagaatt tcagctatgt tattgatgca 
48001 attgatagtg tgcgcccaaa agctgcgctg ttgtcttact gccgtcgcta taaaatccca 
48061 gtggtgacta ccggtggagc gggtgggcaa attgatccaa cccgtattgc cgtggttgat 
48121 ctggctaaaa ccatccaaga tcctttagct gcaaaactgc gcgaaagact gaaaagtgat 
48181 tttaacgtgg tgaaaaacag caaaggtaag ctaggtattg actgtgtttt ctccagtgag 
48241 cctttggtct atccacaagc cgatggctct gtgtgtgcat cccgcagcac tgccgaagga 
48301 cctaaaaaaa tggattgcac ctctggattc ggctcggcaa cgatggttac tgccacgttt 
48361 ggctttgttg ctgtgtctca tgtattgaag aagatgatgg ctaaagcggc ccgtcagcac 
4 8421 gagtcagccg ccactgcgtc ataatcgatt gttactctgc aacttattgt tctgaataat 
48481 tatcgttctg cgtatttggc cgcgatggtt tgtacacctt gagccagtgc ttgtaatcca 
48541 ttagcacgcg aggtgctgag ttgttggcgc agccccagtt gctcaaacag ggccaagggg 
48601 tctgcggcca gcacctgttg tggtgtttta ccctccacgg cggttagaat caccgctagc 
48661 agcccgcgta ctatgcggcc ttcactgtcg ccatagaaat gcaaagtgcc atctggcaaa 
48721 tgttggtgcc cgagccagac tctgttttca caacccgtca attccagttc attttttttc 
48781 catgcctcct gcaacggtgg gagttgcttg gccagcagaa ttaactgacg gtagcgatcc 



48901 aatgggtgtg gggcaatcat catattctca tcttgtttgt atattgatac ccgccattct 
48961 tcaagctggc ttgtgttgat ggctttcgtt cacttgattg ctgctctctt gcaactggaa 
4 9021 tcatttaggg tatatcgaaa attatcattg gcagcagtta gtctcgaagc agttccaatg 
49081 ccttaccaag cgcagagcaa agcatttcaa catcttgggg ggtattataa ggcgcaaaag 
49141 aagcccgcag actgccattg actcccagag ccgccatcag tggctgtgcg cagtgctgcc 
49201 cagcacgtag tgcgatacct tgctcggcca ataacgccac taaatcactg tgatgaacac 
49261 catcgaaagt aaatgctaat aaactggatt gctgacaacg gtaactgtga aaacccggta 
49321 attgtgcgag cttattttct gccattgtag ctaaaccttg gctataaact tcggcctctt 
49381 ccagatcaat atgttccagc catttgagta ccgctgataa accaataacg ccagcaatat 
49441 tgggtgtacc cgcttcaaag cgataaggca cttcatgagg tgtaaagccc ccgaatgatg 
49501 catgggtaag cattttaccg ccgccttgcc aggcgggcat ctcttctaat aattcagtct 
49561 tcccatacag cacgccaatg ccggttgggc catacaattt atgggaagag aatgcataaa 
4 9621 aatcaatatc caatgcctga acatcggctg ggtaatgaac aattccctga gcaccgtcaa 
4 9681 ccacgacaac acagtcatat tggtgagcca gcctaatagc ctgagccaga tcagggctac 
49741 cgcctgttac gttagacatc tgccccagcg ctaatatgcg tgttttttca ttaagtagtt 
49801 gagggagttg ctgtaaatct ggcagatgat caaggccaag aggtaatttg actatttttg 
49861 caccggtctg ctccgctacc atcaaccagg gaattaaatt cgcatgatgt tcagcttcgc 
49921 tgacaataat ttcatcttca ggttgtaaac gggggcgggc ataactttgc gcgatcagat 
49981 tgatggcttc agttgttcct cgcgtccaga taatattttc tgctgtagat gagttaataa 
50041 aatcagccac ttgttggcgg gtgttttcaa agcgaaccgt taatgacagc gccgattgat 
50101 gttggctgcg gtgtaccgtt gctgaatcct gctgataaaa ttgctgtgtc gcgtcaatca 
50161 ttgcacgtgg tttcaaggcg gtcgccgcac tgtccagata ggttaattta tcactgagcg 
50221 cagggaattc ccgacgaaaa tccattggat taaaaacttt catactgttg ttggcatcct 
50281 gtgagtgacc ccgaaatttt tttgctttta tcatggtcgc gtgtttgagc aatgatgtta 
50341 tgcatattga cgttatacac attagtgatg gtcgcactat cgttactgat ggtaaaccga 
50401 aagtagacga tgagatagag atgatcagtc acccctgtgt gaatgggggc cgagaatagg 
50461 taaatcgatc agacgtaaaa gaaatggtta aagataacga gtttaatgct aaaccgaggg 
50521 agagaagaga ccggatacaa cattaaggtg gtagcaacaa caagttgaat ttcatctcga 
50581 aaataggata aaaaaaagca ccgccaaaat aacggtgctg cataaaatca ctatggacag 
50641 acagggtaaa tgtacaggaa gtgaaaaaaa gcagtagctt agctactatg tctggcatac 
50701 cagaccattt gcaaacacaa catcaccacc acaaagccaa aagcatttca gtacctggga 
50761 taccaaacct acttttcgtt ccggctaggg gaagtgcggc cactataggt atttactggc 
50821 gtatcctcaa tggacaattt ataatggctc ggattataaa aactaataag taaacaaagg 
50881 gtttcatctg atgatgggcc gttataaaaa agtgtccaat tcatgtcaaa acgattacca 
50941 cccctgaatg ccttacgggc ttttgatgcc gccgcccgtc acctcagttt cactaaagca 
51001 gccgaggagt tatttgtcac tcaagccgct gttagccacc agatcaaatc gctggaggat 
51061 tttctcgggc tgaaattgtt ccgccgacgt aatcgctctt tgttgttgac cgaagaaggg 
51121 cagagctatt acctcgatat caaagagatt ttcacgtcta ttaatgaagc gactcgtaag 
51181 ttgcaagcgc gcagtgcgaa gggagcgtta accgtcagtt tgccgcccag ttttgccatt 
51241 caatggctag ttcctcgttt gtctgggttt aatgcggctt atccaggcat tgatgtgagg 
51301 atccaagcgg tggatcggga agaggataag ctcgctgatg atgtggatgt cgcgatattc 
51361 tacggccgcg gtaattggag tgggttgcgt acagagcgct tgtatgctga attcctatta 
51421 cccgtctgtg cacctagctt actaactggt gagaatggat taaaagtacc atcagatctg 
51481 gctaatcaca ccttgttgca tgatacttca cgccgtgact ggttagcgta tacccgccaa 
51541 ctgggggtac cgcagattaa tgtgcagcaa ggcccgatat ttagccacag tgccatggtg 
51601 gttcaggctg cggttcatgg tcaagggatt gctctggtga ataatgtcat ggcccagtct 
51661 gagattgaag cgggccgatt ggtttgcccg tttaacgatg tattggtgag taagaatgct 
51721 ttttatttgg tttgtcatga cagtcaggca gaactgggta aaatagccgc cttccgtcag 
51781 tggatactgg caagagctgc cagcgagcag gagaaattac gttttcgtta tgaaaactga 
51841 tcaatcataa tgtttggtta attgtaatgc ttggttaatc actgcacttg attggttaca 
51901 gtatctgcgc agtaatgaca acccattcat gatggtaacc tgccacgaat agggcgatta 
51961 ccgggcgata taacgataac tttgtctaat ttaaggttag taacgatgaa cagtcgttta 
52021 atgctgatat tttccgctct gagcggtttt ttctatgttg ctttcggtgc cttcggggct 
52081 catgtattaa gcacctctct ggggctgaat gaaatggcct ggattcgtac cgggctggag 
52141 tatcaggggt tccatacgtt ggttattctg atattagctg ttgcgatgca acggcaggtc 
52201 agtctttggt tttactggag tggggccttg ttagcgctcg gaacgctact ctttagcggc 
52261 agtttatact gtctggctct atcacacctg agactgtgga tatatatcac gccagtaggc 
52321 ggagtctgtt ttctggcggg atggattttg atgttgattg gcgcactgcg tctaagaaaa 
52381 agggcagaac gccatgaata ataaaattgc tttatattgc cgctccggct ttgagaaaga 
52441 gtgtgcggca gaaataacag agaaggctgc acagctcgaa atctttggtt ttgcacgggt 
52501 aaaagaaaac agtggctatg tattgtttga atgctaccag ttagaggatg cggatcgact 
52561 catccgtgaa ataccgttcc gcgagtttat tttcgcccgc caaatgatgg tggtggggga 
52621 gttattgaaa gacttacccc cagaagaccg tgtgtcaccg atcgtgggaa tgttggttgg 
52681 cgttattgaa aaagctggcg agttacgtgt tgaggtggcc gacaccaatg aaagcaaaga 
52741 gttactcaaa ttttgccgta agctgacggt acctttgcgt agtgctttgc gtgaacagaa 
52801 aattctgtcc gctcgtgaga atgcacatcg gccagtggtg catgtgttct ttattgcgcc 
52861 gggttgttgt tatgttggtt attcttacag caataataac tctccattct atatggggat 
52921 ccctcgactt aaattcccct ccgatgcacc aagtcgctca accttaaaac tggaagaggc 
52981 attccatgta tttatccccg cggatgaatg ggaggagcgt ttagccagtg gtatgcatgc 
53 041 tgtcgattta ggcgcttgtc ccggtggctg gacttatcaa ttagttcaac gcagcatgat 
53101 gatacaagcg gttgataatg gtttgatggc acaaagcctg atggatacgg ggcaggtcac 
53161 acatcaccgt gctgatggtt ttaagtatga gcctacacgc agcaatattt attggttagt 
53221 gtgtgacatg gtggaaaaac caacaaaagt cacgcagctt atcaccaaat ggctagtcaa 
53281 tggttggtgc cgtgaagcga tcttcaacct gaaattacca atgaaaaagc gttatgaaga 
53341 agttgtacaa aatttagcaa tgatggatga gcagttgaaa gaaaacggta ttaatgctga 



53461 ctggtcgggt gcaccgggcc gccgtgatga gcgttattag gttttagcga tagcgtaaca 
53521 tcatatttaa ggggcctgtt ggcccctgag gttattgaca aagtgcccgc gacggagaga 
53581 acaggcagat cgtaaagacg ccgtaaatac atccatgtag taggctcgag ccgcgccatc 
53641 cttggcgcgg acgctttact cctctgccta tcctcaccgt tcaagatcgc gtcatcgggg 
53701 tttgtcagca gtctgagggg cctattggcc ccttagtttt tatgcgtata ttattttttc 
53761 aaacgtaact gttgtagatt gccactcaat aataagtccg ttttcagcgt ggcgatttgt 
53821 ttgcaggtaa acgccatctg ttggtgttgt tgtaattttt tacgccattt ctccggtatt 
53 881 gattccagat tctgatagag cacttccaac gtatcggctt gttgcagtag cagcgtagcg 

53 941 gttttggcac caacgcctgc aacacctgga attttactgc tgctgatccc cgccagcccc 
54001 cagtaatcag gtaattgacg aggcaataca ccaaactcct gcttcacaaa cggcatatcc 

54 061 agccaccgct tttggaaata gtcacgaatt tggatattgg gagccaataa ctggcaatag 
54121 cctttgtcgg tagaaacgat ggttacctga tggcccgctc cagcaacttt taccaccaga 
54181 gtggcggcta agtcatcggc ttcattaccg ggtgaatgcc agcatgctac ccctaattca 
54 241 ttaaatgcct ggcgaatgag tggcatttcc tgttgcaggt tatcaggcat gggagagcgg 
543 01 ccagctttgt aatctggcag acattgatgt cgccagctat cagagcggtc atcttcatca 

543 61 aaaacggcaa cggcatggct gggttggctg tgttgaatca gttgttgcag tgcatgctgg 
54421 caggccttaa cacaaggtga cccctgcacc gcatgaatac ggcgaatcaa attaagagca 

544 81 tcgacaataa gaaggtgtat ttgcataacc tgttttatgt aagggctgaa tcgtcattaa 
54541 gcccgagtaa tattgggcaa cgttagtcgc ccagtgaatt cccttcttac ttggcgttgc 
54601 agcagctgtc gtgacttgag ctatttacag gcctcgtctc gacgaggccg ctgcaagcag 
54661 catttaaatt tactcttggc agatttgtca ttttttgctg ctcaactaca acgccaatta 
54 721 ctttgggttc aattgatcta ttcatcaact acagttatct attcatcagc tacaaatttc 
54781 atagcaaggt gtatagacgc tacctggtaa cttcatccga tgttgagcaa taaaatcttg 
54 841 cagtaggcgg tccatttgct tcattaattg cggatcgcca tgcagtttaa atggcccatg 
54 901 cgcttcaata gcctggatac ccacttcctt gacattacct gccacaatcc cagaaaacgc 

54 961 ccggcgtaat gcggcggcca gtttctccgg tggttgatta tgagacagat ccagattagc 

55 021 catattttca tggtttggtt cgaacggatg ctgtaaatct ggttcaatac gaatagacca 
55081 gttgaagcta taagcatcgc cggtattacg gcgatactct ttcaccagtg gcatcgcttt 
55141 cttcatctga cgggcgactt cgtctgggtc atcaataatt atctgatagt atttacgagc 
55201 gccatcaccc agcgtattca tgatgaactc atccaccact ttgaagtaat ccgcactctc 
55261 ttttggcccg gttaaaatca gcggcagcac ttgctcacta- ttttccggat tcatcaaaat 
55321 ccctaataga tagagtaatt cttctgctgt gcccacccca cctgggaaaa taatgatgcc 
553 81 atgagcgatt cggacgaagg cttccagcct cttctcaata tcgggcatga taaccagttc 
55441 attaaccaga gggtttggtg gttctgcagc gataatggac ggttctgtca tgccgataaa 
55501 acggcctttt ttataatgtt gctgagcatg tccgacagcg gcacctttca tcggggcttc 
55561 catggcaccc ggaccacagc cggtacagat atttaattca cgcaaaccca gttggctgcc 
55621 cactttacgg gcgtacaaat actcggtttc attaattgaa tgccctcccc agcacacgac 
55681 catattgggt tcttcatcaa tatgtagggc acgggcatta cgtaaaatag aaaagaccag 
55741 gttagtcaaa tgactggagc tctccatatt caaatgctgg aagttgctcg cactgacgat 
55801 ttgaccattg acaaataaga tatcgcgtaa taccgcgaac aggtttgctt gtaatgagcg 
55861 aataattttg ccgtcaacaa acgcatgttc tggtgggttg accaactcga gtttaactcc 
55921 tcgctcacgg cgaagcacgt tgatatcaaa ggtttcattg cgagataaca actctttgct 

55 981 gttatcggtc aggctacctg aattgagaac agccagtgaa caattacgga ataaacggta 

56 041 caaatcactg ctggcggtac gtttgagcat atccacttcg agctgtgata acagatccat 
56101 tgagccgagc gggctgacat gtgtaatcaa aatgactcct ttgcccctga ggggggtatt 
56161 ccgtgtatca cggtaggaca gcagaaaagg ccgtaaaata ctgtaattaa ccttaacgtt 
56221 gtctttcttt ttttaccaat actaacgtgc agtgagcggt aatattgcgc acaggcgcgc 
56281 atttattgcc gtgccaagcg gccggttagc ggtacaaaat ctgtgttgga acgccatgga 
56341 ttgatatcca gcccgcctct acgggtataa cgggcataaa cggttaatgt ttcaggctga 
564 01 caaaaacgca taatgtcgtt aaaaatgcgc tctacacatt gttcatgaaa ctcattgtgg 
564 61 tggcggaaag agaccagata gcgcagcagt gcctcgtggt caatctgtgg gccgcggtag 
56521 tgaatttgca ctgatcccca atctggctga tgggtaatca ggcagttaga ttttagcaaa 
56581 tggctgacca ggctctcctc cacatgatct ttacccgcag caccttgcag gtaatcagca 
56641 ttgaattcat agctatcgat acgaatatct tgtttatcca gacattcgcc attgaaattg 
56701 gagatgggtt ggcccgttat ttcatccaga tcatacagtg cgacactgac gttaccttga 
56761 gcacaggcag ccagatctcg ttgtaaggtc attcgcacgc tttctttatc ggcgaaaatg 
56821 gtttggttaa aactgttgag ataaagttta aagctttttg actcgatcag attgatgcta 
56881 tctgcgttta aacttatttc gcccacagca acttgcggca ggccattact gtttaaccaa 
56941 gataattcat atagcgtcca aatatcagca ccatggaatg gtaaattgtc tggatatagg 
57001 ccaagcggtt cacggttcat gctgcgcggc acagcctgca acagcgtggc atcatagtag 
57061 tcgcagtaag ccgttggttt gcccagcgtc agttccgcca acgccttgtg gttttgatat 
57121 gaagacatgc tgttcccttg atgagagatc ggtacaatag cgctagttta tcaatttatg 
57181 ccgcacttaa ctgtatttct gagctggggt cagttaccac tcaggtgcag ggcaagccgc 
57241 cgcgatacag agaataagaa tggacttgaa tatttcaacg gcacttcgca gttttactca 
573 01 acgctatatt gatttatggc agcagcaaac cgggcattta cccgcaagta aagagctata 
57361 tggcgttcct tcaccttgta ttgtggagac gggggaggat caggtatttt ggcaacctca 
57421 ggcgtttctg ccagaagcga cgttgaccaa tattgaacgg gcgttggaga ttcagctaca 
57481 tcccgatatt catgattttt acactcagca atacgctggc gatatgatgg ccgatttggg 
57541 aaaccatcgt tttacattgc tgcaagtgtg gagtgaggat gatttcatcc gtttacagga 
57601 aaatctgatt ggccacttag ttacacagaa aaggctaaag ctctcgccaa cgttattttt 
57661 agcgactacc tcgtcggaaa tgacgatggc ttcattatgt aatgtcagcg gaaatgttgt 
57721 tttggagcaa tttggtagtg ataaacgcac actgctggct tcaacgctca gtcacttctt 
57781 agatgcatta cggccagtat tgcctgagta accccatggt tttcacttat tagcgtgtta 
57841 ctggtgtgcg agatctctca caccagctgt gagacatcac ttctattctt tctatgtaaa 
57901 attctcgtgt tagatttata tccaataaaa tcatttcgtt ataaatattc atttttccat 



58021 atcctttact cgctggatag gatctggcgg agagggaacg catctggaag atgtgtcttc 
58081 aggacgaaga acacagggac tgcttaggac gagtgaaggg acgtttcagg atgaaacaag 
58141 ggacacctcc aggatggaga ttgagagcca gttcaggatg attggtgggt taggatagcc 
58201 taaggattaa cgccgggatg gtgtataaca ttgcgatgga ttgctggtta gtataaccat 
58261 acggaaaagt tttcagggat tgagcaggga gcatcaattt tagctggatt gctataaaac 
58321 gaattgaggg gtactggtaa aacagtaccc cttttttgtg cccagaaaac ccccagctag 
583 81 gctgggggtt cagtaaagct ttcagctttg ggtcagttat aaaaacccct tttgatttgt 
58441 taaaacagtt tgcggtctgg caactgcaaa tgttcaacaa gaaatcaaaa gggggtccca 
58501 atgagggatg aaaagagctt agcgcacacc cgatggaact gtaaatatca tatagttttt 
58561 gcgccgaagt accgaaggca ggtgttctac agggaaaaac gcagagcgat tggcagtatt 
58621 ttaagaaaac tgtgcgaatg gaaaaacgtg aatatcctgg aagcagaatg ctgtgtggat 
58681 cacatccata tgcttctgga gatcccgccc aagatgagtg tctcgggatt tatggggtac 
58741 ctgaagggaa agagcagtct gatgctttat gagcagtttg gcgatttgaa gttcaaatac 
58 8 01 cgtaacaggg agttttggtg tcgagggtat tacgttgata cggtagggaa aaacacggcc 
58 861 aggatacaag aatacataaa gcaccaattg gaagaggata aaatgggtga gcaactctcg 
58 921 atcccgtatc ccggtagccc gtttacgggc cgtaagtaat ccatagatgc aaatgtcaga 
58 981 tcgcgatgcg cctgttaggg cgcggctggt aacagagcct tataggcgca tatgaaaaac 
59041 ctccggctat gccggaggat atttatttgt ttttttgttt tgttgtatag attttgtctc 
59101 gcctgaatcc attgctgatg gcgttaattt gggcatattc tcccttcaat ggtatgcttg 
59161 cccaccgttg aaggtgctgg taagtggcaa tttttgggtt gagagcataa tcatgagtac 
59221 agaaaataag gtgcgtcaaa gcttacagga tattgagttt gccatgcgag agactgacct 
59281 ctggcaaacc gtcccaccgg aagcagaagc ttttgagagt aacgagcctt ttagtattga 
59341 tactatggcg gcagagcagt ggctacagtg ggtgtttctc ccccggatgt atgcattgct 
594 01 ggagctaaat ggcccgttcc ccacccgttt tgctattacg ccgtattttg aagaggcttt 
59461 gagcggtgat gggcggccag attactcggc gcttttggca caattacggt gccttgatga 
59521 tctgttgaat aaagaaagtg aataatgctg gaaattatct atcaagatga acacattgtt 
59581 gcggtgaata agcctgccgg ttggttagtg catcgtagtt ggttggaccg caatgaaacg 
59641 gtatttgtta tgcaaacggt acgtgatcag attggtcagc atgtgtatac cgttcatcgc 
59701 cttgatcgcc cgacatcagg cgtattgctc atggcattgt ccagtgatgt ggctcgtatg 
59761 ttgtcgttac aatttgagca acaccagatt cagaaaacgt atcatgcggt ggtgcgcgga 
59821 tatgtattggaaggtggcac cgttgattac gcgatggccg aagagctgga taaaatagcg 
59881 gataagtttg ctaagtctga taaggctcca cagccttctg tcagccatta tgaagcatta 
59941 gcgcaagtgg aagttccttt ggcgattggc cgctatgaaa ctgctcgcta tagcttggtg 
60001 gcattgaaac cggaaacggg tcgtaagcac cagttgcgtc ggcatatggc tcatatccgt 
60061 caccccatta ttggtgacag tacacatggt gatttacgtc agaacagagg tgtcgctcag 
60121 cattttggtt gttcacggtt gatgctgcat gccagccatt tacacctcaa tcaccctgtg 
60181 accggtgaag cgctgacgct taccgctcgt tgggatgaac cctggcaagg gttgatgtca 
60241 caatttggct ggtcgggtat tgcccctcat cttgaaaggg ttgagtttcc gctgactgcc 
60301 agtcaggata atgaataatt ttttggctac aaaggaatat cttttatggc tcaagttggg 
60361 atttttgttg gtactgtcta tggtaatgcg ttgttagtgg cagaagaagc tgagaacatc 
60421 ttgaaagatc aagggcatga ggtaaagtta tttgacgaag gcacgttaga ggactggcag 
60481 cattaccgtc aacattatgt gttggtcatt acctcaacca ccgggcaggg cgatttccca 
60541 gatagcatcg catcactatt tgttgctgtt cgtgatcagg tggggttcca acccgagtta 
60601 cgctatggtg tcattgcgct cggcgacagt agctatgata atttctgcgg tgctgggcgt 
60661 acgttcgatg agctattgca ggagcagggc gcgacacgga ttggcgaacg gttagagatt 
60721 gatgctattg agcactcaga accagaggcg atatctgcgc catgggtgga acaatggggg 
60781 cgtttgcttc agtaatcttt atatcttgat tttattgcgt attgtggcaa gtttcgttgt 
60841 cacaatacgt attcggttac cgaggttatc atgccacccc tgatttagcg gccattggtg 
60901 agcttttcta aatcagcttc aatttccgca atcttattgg tgacaacact ttccaggtga 
60961 cgtaggtcat ccagaatttt tcttttcaaa tcaacttcaa tctgatcacg cttacaaagc 
61021 tgatccaact catcaatgac atagcggaga ttagggttga tttcattgat ttctttatag 
61081 ccatggcttg cattatcaga cactacagtc ttgcgttggc gtggatattt gaattttaca 
61141 ctcttggcaa aaaactcccc tttatctttg cggaagtaaa ttttcagaat gtcattattg 
61201 gcctcttgac gcaggctata gcgatcaaca tcttctggtt gtgtgatgcc taaacttttt 
61261 agattgtcat acataacgcc acctttttga ttttatattc cacagttttc tattatggcg 
61321 gaaaatgtcg ccaatgactt tggtatgcaa ggcgtaaggc aaaaaaatag cgggtggtta 
61381 gcccgctact cttaatctta gtcgatggtg cgtaataact cattaatacc gactttactg 
61441 cgagttttcg cgtcaacttt tttcacgata actgcgcagt acaagctata gctgccatct 
61501 ttcgatggca agttgccaga gacgacaacc gaacctgctg gtactcggcc ataatggact 
61561 tcgccagttt cacgatcata aatacgggtg ctttgaccga taaacacgcc catggagata 
61621 acagaacctt cttcgacgat aacgccttca accacttctg agcgtgcgcc tacgaagcag 
61681 ttatcttcga taatggttgg gttcgcttgc agtggctcca agacgccgcc aataccaacg 
61741 ccaccagata gatggacgtt tttaccgatt tgcgcgcaag aacccacggt ggcccaagta 
618 01 tcgaccatag tgccctcatc aacaaaagca ccaatattga cataggaagg catcaacacc 
61861 gtattgcggg caataaatgc gcctttacgc acggttgctg gtggaacaac gcggaaacct 
61921 tcacgttgga aacgcgcttc atcatagcct gcgaatttca tgggtacttt gtcatagtaa 
61981 cgggtttcag cgccttccat tacttgattg tcattgatgc ggaaagaaag cagtaccgct 
62041 tttttcagcc actgatgcgt cacccactga ccatcaattt tttctgcgac gcgcagcgcg 
62101 ccggtgtcca gtaaatcgat cacatgggta atcgcttcgc gcgttacggt gtctacattc 
62161 gctggcgtaa tgtcagcacg gcgctcgaag gcggtttcaa taacgttttg tagttgctgc 
62221 atcttggtct ctttcctgat atcgatatac cctatcttac ttgacattac agtgttattc 
62281 gctgctgcct gtaacgctaa tcgctggggt ataggttgta ataaaatgag tcgtgataaa 
62341 aacgaatagt aaaaaattaa ttttaataaa tattaattac taaaaagata tttgtatcac 
624 01 agtttatcgt ttggattgag agtatctgcc aatcgttgtg ctaattcacg acgggtttca 
62461 aggctcagtg ctcgacgatc tttatctgct aaaacaaata aatcctctac tcgctcgccg 



62581 cgtgccagta accctggttg atctaacgca 
62641 tgggttggta aaaagttagc ttcggttggc 
62701 cgcctaaccc gcggcgggtg ttggtaatct 
62761 ctgattattg gatgcctatc ctgtgccaac 
62821 gtatccatcg ccataccatc acgattggtg 
62881 cggcgatcaa gctccccaac aacagcggca 
62941 aatatttcag tcccaccacg ggtagcctga 
63001 tcgtgctcca gcaaatggcg ggcgtgccag 
63 061 taatccgctc ggcaacggct ccagatacga 
63121 cgcagtagag ctaacgcttg caggcggtgg 
63181 ttttgcattc cgcggcgtag ctgtttttcg 
63241 tgtttccagc tattccatag attttcattg 
633 01 accagatagc gcaggcgcgt ttcactttgc 
63361 gggtcctgaa tatccctacg ttgggcggtc 
63421 caagcgacta attgggattc gcgtgaatta 
63481 tcgtgggcac ccagtattga atgatcaccg 
63 541 gcggccagca gcagtaattc aggctgaggc 
63 601 tgacgtggcc gggtatcttc atccgcgaag 
63661 tgttcatcca ccgtataagc atgaaataga 
63721 cattgcggca tataggccca taaaacactg 

63 781 acggcaccgg ggtggcgcaa aatggccatg 
63841 aaaggggctt tcagatgacg acgggcatgg 
63901 cctttaatat cctgattgcg caccatcaga 
63961 cgaacaaata ggttctcatc acggagatca 
64021 tcaagtggac gcggtttttc attcgcatct 

64 081 agcagcatat tgttcagttc actgacacgc 
64141 tgttcaacgg gctcattgcc ttcaccttcg 
64201 cggtcaaaca gcaggcggtt atcgtagcga 
64261 cgccataaaa aactctggct ttcattcagt 
64321 aaatcaacca tttcacttaa cgatgtcgca 
643 81 gtgtggatat cccgtaatcc accagggcta 
64441 gtgccgtgat agcgtttatg acgctcttgc 
64501 ggggagggcc agaaactgtc actgaaaata 
64 561 tcaccgcaaa tcaaccgtga ctcaatcata 
64621 tccagcaagc actcttccag tgtgcgcaca 
646 81 aataaggtaa tgagttggcc aacccgttgc 
64741 agtactaaca catcaatatc tgataatgga 
64801 gcaaccagag ctgtttctgg caccttgtca 
64861 agatgatcga tataatcact gcgtgccgca 
64 921 aatgcctctg cctgccagcg ttggaaagcg 
64981 atgtcgctat caagataagt gctgggaagt 
65041 gtcagcgata gggagtgttc ggtgtgattg 
65101 agcccataaa aaagccggcg gtgccggctt 
65161 ggatagtgtc atccttacgc aacgtcatta 
65221 tatgctcata ctgcgccgac aagttgcgat 
65281 tggtccgaat ccggtaatcc ccggcgttca 
65341 cttgcaggac gacgccgccg tcatccgcat 
654 01 aaccttcacc gatcccgtga ccacaatatt 
654 61 caaatttctg gatagcttta cccagcgtcc 
65521 tcgccaggta caggctctct tgagtcacgc 
65581 taccaacaat aaacattttt gaggtatcgc 
65641 cgatattgac gatatcaccc tctttcagca 
65701 tcacttcatt taccgagatg cagaccgatt 
65761 aaatagcctg ttggtggttg gtgatatgat 
65821 ttacccctgg tttcacatag ggttcgatga 
65881 caacgcgcat tttttgaata tcttcaggtg 
65941 cgtaggggct gtaaatagcc agccgccgtt 
66001 ctgtcgatta ataaaaatct gtggctaatg 
66061 tcctttatag ggcctcaccg aatacaacaa 
66121 aaagcgcgcc ggcgaaccgc actatattcc 
66181 tgacgataag tagccagttt agccgggcaa 
66241 cacacgtgtc gacacataca ccggggtgcc 
66301 tggggtcggt gttatgggac acgtggaggc 
663 61 atcatggcaa ctgtttccat gcgcgacatg 
66421 acccgttact ggaacccgaa aatgaagcct 
66481 atcatcaacc tggaaaaaac tgtaccaatg 
66541 atttcttccc gtaaaggtaa gatcctgttt 
66601 gtaaaagaag ctgctaacaa ctgcgaccag 
66661 atgctgacta actggaaaac tgttcgtcag 
66721 cagtctcaag acggcacctt tgataagctg 
66781 gaactgaaca agctggaaaa cagcctgggt 
66841 gcactgtttg ttgttgatgc tgatcacgaa 
66901 ggtatcccgg tattctctat cgttgatact 
66961 atccctggta acgatgacgc aatccgtgca 
67021 gccgtccgtg aaggtcgttc tcaagatctg 



attaattcca gataagtacg ccgctcgtta 
acgctgaaat gccgtaattt aggtgataat 
gaccggttta tcgcttgctg taatgcatgg 
gggctgccat ctggctccaa tacaataaag 
aaaatttgtg cgtcatggac gctcaagttg 
aagagtgatg ggcggtcagg gctccagata 
cggctgacta agactagcgg tttggttgag 
gcaagttgat tcggcgagtg gcgtaggaag 
tgcagtgcct cttcatcaat attatccatc 
tggcgtacac gctcacgtaa atcagggcta 
gtcgcaaagt acagttcgcg taacaggctc 
gtggcgcaaa tatctgccac ggtcaggctg 
acttcggcag aaaattgctg aataacggcc 
actgacatca ataaatggca gcgaactaac 
agcccatgct gttcagcaaa ctccaccgcg 
ccacgccctt tggcaatatc atgaaacagg 
aggcgagggt agagttcgac gcagagtgga 
ctttcaattt ttagcaaaac acggatggtg 
tcaaattgca tttggccaac aatactgccc 
tgacgatgca tgggcagcaa cgcacgtgaa 
aacagtttac gagcttcagg aatgtgacat 
cgcaaacgcc ggacggtagt ggagtaaata 
tagaacatcc gcatgatcgc ttcaggctga 
attaaatctc cccgtagttg aaactcttcg 
aatgccagta tggcttcatc aaacaattgc 
cttgtcatac gatagaaatc tttcatcata 
tagcgtaaaa gttgagcaac gctcagttgg 
gtcaatacca ggtgcagcgc aaaacggata 
tcattacgct cggcgttggt gagaaaccca 
ccgaaatggc gacgggcaac ccagagtaat 
cttttgatat cgggttccag attatagctg 
tgttcaacaa cttttgcatg gaaaaattga 
tgtttttgca tctgtaaaaa aagggcgaca 
ttggtggcta tggttaaatc tgcgaggcct 
ctatgaccga cttccagctt caaatcccac 
gcctgttcat catttagacg ctgcttactc 
tgcagttcgc cgcgcccata gccaccgacc 
aaaccataga atgtccacaa ccgctgtaac 
attaatacct cggcagaggt gccactatta 
tcaagccgtt gcttcagtat cgggcaatgg 
gcgggctgct cagaaatagt gggagtgagc 
tcagacatat gtttacccga tgaggaacgc 
gtcattatta ctcgtgcgtt atgatgttgg 
tttcgcagcc gttatcggtt accacaatag 
ctttagtttt taccgtccag ccgtctttca 
ccatgggctc gatcgtgaag gccatacctg 
cgtaatgtaa aacttgtggc tcttcatgga 
cacgtactac ggagaagttc tctgcttcaa 
gcaggcggat ccccggttta accattttaa 
ggcacagacg ttcacccaga atggtcggtt 
catggaagcc atctttaatc acggtcacgt 
ctttttcatc gcttgggatg ccgtggcaga 
tagggaaacc gtgatacccc aggcaggcag 
cgtgacagat acgatccagt tcacccgttg 
tttccagcac ttcagcagcc aagcggccag 
ttttgattga gattgccatg caaaagtgtc 
attaatgaga tgcagataat gaaaagataa 
gtatcagcca agcaatttgc tgccaaattt 
aagttggtgt gttgtggctg tttatggtat 
gatgaatatg aatcttccga tgacgtattc 
taccaatcgc actcatatgt gtaaataaca 
ctaaagtttt tgattgttat caatcgctta 
ataaccccaa ctgatactat agaggtttta 
ctccaggctg gtgttcactt tggtcaccaa 
ttcatcttcg gcgcacgtaa caaagttcac 
ttcaacgaag ctctggctga gttgactaaa 
gttggtacta aacgcgcagc aagcgaagcg 
ttcttcgtga accatcgctg gttgggcggg 
tccatcaaac gtttgaaaga tttggaaatc 
accaaaaaag aagcgctgat gcgtacccgt 
ggtatcaaag acatgggcgg cttaccagac 
cacatcgcta tcaaagaagc taataacctg 
aactccgatc cagatggcgt tgacttcatc 
gttaaactgt acctaggcgc tgttgctacc 
gccgttcagg cggaagaaag cttcgtagaa 



67141 ttaggggggc cttataggcc ccctttgcat atctaatgtg agattttgtc ttcacaagag 
67201 aaccgaggaa atttaaatgg ttgctattac cgctgctttg gtaaaagaac tgcgtgagcg 
67261 tacagccgca ggcatgatgg aatgtaaaaa ggcgttggtt gaagctaatg gtgacattga 
67321 gctggccatc gacaacatgc gtaagtctgg tcaggctaaa gctgccaaga aagcaggccg 
67381 tattgctgct gaaggtatca tcctggctaa agtttccgct gatggtaagt acggtgtgat 
67441 tctggaactg aactgcgaaa ctgacttcgt tgctaaagat gctggcttca aagcattcgg 
67501 cgaagaagtt atcaacgcag ctctggctga aaaaatcgct gatatcgatg ttctgaaagc 
67561 taagttcgaa gaacaacgtg caaacctggt cgctaaaatc ggtgagaaca tcaacattcg 
67621 ccgtgtcgct gtactggaag gcgacattct gggtacttac ctgcatggcg cacgtatcgg 
67681 tgttatggtt gcggccacgg gtgctgatga agaactggtt aagcacattg ctatgcacat 
67741 cgctgcaagc aagcctgaat atgttaagcc agacgatgtt cctgctgaag ttgttgctcg 
67801 cgagcaccaa atccagttgg acattgctat cgaatccggc aagccacgtg agattgctga 
67861 gaagatggtt gaaggccgta tgcgtaagtt cactggcgaa gtctctctga ctggtcagaa 
67921 cttcgttatg gacccaagca aaactgttgg cgatctgttg aaagaaaaca acgctgatgt 
67981 cgtgaacttc atccgctttg aagtgggcga aggcattgag aaagttgaaa ctgactttgc 
68041 tgctgaagtt gcagcaatga gcaaacagtc ttaatgctcc aaatggggcc gccaataggc 
68101 ggttccattt tattcactcg atacccgtga tacttcaagt tgcaagcgcg ttgactgcct 
68161 tcctataact cgaattattt agggtataaa cttagtccgc tcagttttta gcgcacaccg 
68221 ctgctctgag tatctgacgg tcaagtctcc aatattatta ctgctgctta ggacacaaca 
68281 ccatggcaac caatgcaaaa cccgtatatc agcgtatcct gcttaagctg agtggcgaag 
68341 ccctgcaagg cgcagaaggt ttcggtatcg acgctagcgt tttggaccgc atggctcagg 
68401 aagttaaaga gctggttgaa ttgggcattc aggtcggtgt agtcattggc ggtggtaact 
68461 tattccgcgg tgccggtttg gcacaggctg gaatgaaccg tgtcgtgggc gaccacatgg 
68521 gaatgctggc gactgttatg aatggtctgg caatgcgaga tgctttgcac cgtgcctatg 
68581 tgaacgcccg cctaatgtct gctattccac ttaatggcgt ttgtgataat tacagttggg 
68 641 ccgaagcgat tagcttgctg cgtcataacc gtgtggttat ttttgctgcg ggtaccggta 
68 701 acccattctt tacgacggat tctgctgcgt gtctacgtgg tatagaaatt gaagccgatg 
68761 ttgtgttaaa agcaacaaag gttgatgggg tttactccgc agatccggtg aaaaatcctg 
68821 acgcaacatt gtacgaacag ttaacctatc aggatgtttt agaacaagag ttgaaagtta 
68881 tggatctggc ggccttcact ttggcccgtg accataacct gccaattcgg gtcttcaaca 
68941 tgaataaacc cggcgctttg cgtcgtgtgg tgatgggtga gaatgaaggg acattgatcg 
69001 ctaaataagc gacagacctt attatcaggg gtaagcacat tgttctgtct gcccctttag 
69061 ccagcaactg ttggtttgta tattcaccgg ctatactcaa aagctatggc cagccaagta 
69121 accagtttcc aagggttcgc aacgtgatta acgaaattag aaaagatgcc gaagtgcgca 
69181 tggaaaaatg tctggaagca ttccagaacc acattagtaa aatccgtacg ggtcgtgcct 
69241 ctccaagtat ccttgatggc attcaggttg agtactacgg tacggcgaca ccattgcgcc 
69301 aattggctaa cattgtagta gaagactccc gtactttagc gctaacagtc ttcgaccgta 
69361 gcctgagtgc tgcggtagag aaggctatca tgacttcaga tttggggctg aatccatcgt 
69421 ccgccggtac tgttatccgt gttccattgc cagcattgac ggaagagcgt cgtaaagact 
69481 tgattaaagt cgtgcgggca gaagcagagc aggggcgtgt gtctattcgt aacgtgcgcc 
69541 gtgatgctaa cgataaagtg aaggcattgc tgaaagacaa agaaatcagc gaagacgaag 
69601 atcgccgttc acaagacgat gtgcaaaagc tgacggatgc ctatatcaag aaagttgatg 
69661 ctgcattggc ggtaaaagaa gctgagttga tggattttta atctctactc gcattgtata 
69721 agcgtcgcct taggcggcgc tttgcttttt ttccggtcat gttttgtctg atcaatgttt 
69781 tatctgatca atgtcttatc ttcgcgatgt tttttcggca gtcactcgcc ataaaacctg 
69841 gaacgatggt atcgtttgcc atgcaggcaa attgatatct cgcctatcac caaacagtta 
6 9901 ttcagagcgc cctcatgaag caactgacta ttcttggctc tactggctcc atcggcaaca 
69961 gcacgttgag tgttgtgcgt gcaaaccctg aattatttaa agtcacggcg ttagttgccg 
70021 ggcgtaatgt cagggaaatg gcgcagcaat gccttgagtt ctccccccgc tatgcagcta 
70081 tgtctgatga acactcagct aaatcactgc gtttattgtt ggcagaacag ggttcagaca 
70141 ccgaggtcta ttccggtgaa acggctgcct gcgaattggc agctttagat gatgtcgatc 
70201 aagtgatggc cgcgattgtt ggcattgctg gtttaccctc aacattggcc gctattcgtg 
70261 caggtaaaca agtcttgctt gccaataaag agtcgctgat tacctgcggt aaactcttta 
70321 tggatgaggt taaacgcagc cgtgctcaat tactgccaat tgatagcgag cataacgcga 
70381 tttttcagag tttgcctgaa agaattcagc gccaattagg ctattcttca ttaaatgaaa 
70441 atggcgtttc acggattatt ttgactggct ctggtggccc atttcgtgaa actccacttt 
70501 cgcagttttc tgatgtcacc ccagatcaag cttgtgctca tccaaattgg tctatggggc 
70561 ggaaaatttc tgtcgactca gcgacgatga tgaataaagg cctggagtat attgaggctc 
70621 gatggttatt taatgccagc gcagaacaga tagaggttgt gttacatccg cagtctgtta 
70681 ttcactcgat ggttcgctat catgacggaa gtatcctggc acaaatgggg actccagaca 
70741 tgcggacacc gattgcacat gcgatggctt acccaatgcg agttagctca ggtgttgcgc 
70801 cattggattt ttgcaaagta ggcgcgctga cctttacaac cccggattat caacgttatc 
70861 cgtgcttaaa gttggccata gacgcctgta atgcggggca ggcagcgaca acagcgctga 
70921 atgcagcaaa tgaaatttca gtcatggcat ttttagattc aaaaattcgt ttcactgata 
70981 ttgaagtgat taaccgtact gttgttgaag gtttattatt atcagaacct accagtgtgg 
71041 aagaggtgtt ggtgattgat cgcaaagctc gtgatgtcgc ggcccaagtc attgcaaaat 
71101 taaataatta aaccaggttc gctgatccgg tcatttgtta gggaaaggct gagatggtat 
71161 agtcaaccct atcaatggat agatagactg ttgattagtg acccaatagg tatttaaacc 
71221 gtctatttgg gctgtcactc cagaagccgt gacttggtac acggcttttt ttgtgcactt 
71281 gaggcctaat gttcgggacg aacaactgat ttattgagga aataagttcg cgttatgtcg 
71341 cctgtaaaag aagatagggc taacttgtcc ccccggagtc cacgtcatgt tgctatcatt 
714 01 atggatggta atggacgttg ggctaaaaat aagggtaaat taagagtttt cggccataaa 
71461 gccggggtga agtccgttcg acgcgcagtg agttttgctg cgaagcataa tttagatgcg 
71521 ctcactcttt atgcttttag cagtgaaaac tggaatcgcc ctgaccaaga agtgaccgct 
71581 ctgatggagc tttttgttcg agcactagac agtgaagtaa aaagcttgca taaacataat 



71701 cgtcgttcag aaaaattaac ggcaaataat 
71761 tatggtggcc gttgggatat catccagggg 
71821 ggggaattgc aacctacaga tatcagtgaa 
71881 gagcaatctc aggtggactt agtgatcaga 
71941 ttgttgtggc aaatagccta tgctgaactt 
72001 gatgaaaatg tctttgaagg tgcgctgaat 
72061 ggaactacac ctatcgatgc cactgcatcc 
72121 ataactgctt tgattttgat tccggttgtc 
72181 ggtttcgcta ttgttactct cgtcgtctgc 
72241 gcgggttttg ccagtcgtac tcagcgtatc 
72301 gttgccatgc tgcttagtct ccctgaatat 
72361 acaccactat ggctctctat gggctggtgg 
72421 ccgcgctctg ctgtatcttg gcgtaactcg 
724 81 actattatcc catttttttg gggaatgttt 
72541 cataataccg gcgcttggtg gctgttatat 
72601 ggcgcctaca tgtttggtaa attgttcggt 
72661 ggtaaaacat gggaaggatt aattggcggt 
72721 tttggtcgat acgcaccgtt ggatatcgtc 
72781 gctgcattgg cctcggtact cggtgatctg 
72841 atcaaagaca gtggtcattt gattcctggt 
72901 ctgaccgcag ctgtgccggt atttgcctgc 
72961 gacggggagt ttagggatta tgatgagcat 
73 021 actggggatc ttaattacag tgcatgagtt 
73081 tgtccgtgta gagcgttttt caattggttt 
73141 tcagggaacc gagtatgtta tcgcccttat 
73201 tgagcgtgta gaggctgttg cgccagaatt 
73261 tttacaacgt gcggcaatcg tcagcgcggg 
73321 cgcttactgg ctggtgttta tcatcggcgt 
73381 ttcgccacaa tctattgctg cgcaagccaa 
73441 tgatggcatc gaaacgcctg attgggattc 
73 501 cgataagcaa atgcaagtgg gggtcgctcc 
73561 tctggattta cgccagtggc aatttgaacc 
73621 tattatccca cgtggtccgc agattgaatc 
73681 ggcacaaaag gcgggtttac aagcagggga 
73741 ggatcgttgg cagacatttg tgttacaggt 
73801 agatattgaa agggagagta cccccttgtc 
73 861 tggagaaaac cgtagtgaag gttttgcggg 

73 921 tgaatataaa acaattcgcc aatatgggcc 
73981 aacctggcag ttgatgcggt tgacggtaag 
74041 taagctgaat aacctaagtg gcccgatatc 
74101 gtacggattg gtgtactacc tgatgttttt 
74161 caatttgttc ccattaccag tattagatgg 
74221 gctgaagggt gggccggttt ctgagcgagt 
74281 tttgctggtg ctattaatgg ggcttgcact 
74341 ggggataggt taggaagaac gcataacaac 
74401 gctgctgttt ggcagcgcca ccgtatacgg 
74461 tttcgaagga ctgcaacggg tcgccgtcgg 
74521 aggcgatacc gtcagtgatg atgacatcgg 
74581 caacttcgag gacgttcgcg tcctgcgcga 
74641 acgccccacg attgccagca tcactttctc 
74701 caagcagaat ctggaagcct ctggcgtccg 
74761 atccaacatt gaaaaaggac tcgaagattt 
74821 agtgaaggct gttgttacgc cattaccacg 
748 81 cgaaggtgtt tctgcaaaaa ttcagcagat 

74 941 tgatgagctt atctcacgct tccaactgcg 
75001 tgatcgtaaa tatcagaaac agaaactggc 
75061 tttagaccgc ggttatgcgc gctttaatat 
75121 taaaaaaggc atctatgtca ccatcaatat 
75181 tgtgattgtg agcggcaatc ttgcagggca 
75241 tgagccgggt gagttgttta acggcagtaa 
75301 gatgttgggc cgttacggct atgcttaccc 

753 61 tgacgataaa acagttaaac tgcatatcaa 
75421 ccatattcgt ttcgaaggta acgataccag 

754 81 ccagatggaa ggtgcatggt tgggcaatga 
75541 ccgtttgggt tacttcgaga ccgttgatgt 
75601 tctggttgat gtgacctata aagtgaaaga 
75661 tggttatggt accgagagcg gggtgagttt 
75721 aggcaccggt aataccgtgg gtattaacgg 
75781 atttacgcta atggaccctt acttcactgt 
75841 ctacaatgac tttaaagcgg ataatgcaga 
75901 tgccgacggt actcttggct tcccaattaa 
75961 ttatgtccat aacgatctgt ctgacatgtt 
76021 atcagtgggt gagcgcccgg gctatgatgg 
76081 tctgaatcta gggtggacat ataacaacct 
76141 taaatcatcg gtgaacacga aaattaccgt 



gacggattga aactcaacat tgctgccaat 
gtgcgccatt tagctgaaca agttcagaaa 
gagtcgctca actcgtatat ctgcctgcat 
accggtggtg aacatcgcat cagtaatttc 
tactttacag atgtcctctg gcctgatttt 
gcatttgcac aacgcgagcg tcgcttcggg 
tagggggaac ttttgctgaa gtatcgtctc 
attggtgcac tgttcctgct tccgccggtt 
atgcttgcag cctgggaatg gggtcaatta 
tggttagcga tactgtgcgg attcctgctt 
cagcattcac ctcatcacct cttggtgagt 
gttgctgcat tgatgctggt gcttacttac 
cgcctcctgc gcatcatctt cggtattctg 
gctttacgcc agtatggcta cgagcagaac 
gtcatgttat tagtgtgggg ggcagattct 
aaacacaaat tagcgcctaa agtctcacca 
ttactgacgt cagctctgat atcattgctg 
cccgaaaaac tgttaatctg ttccgtggtt 
actgaaagta tgtttaaacg tgaagcggga 
catggtggga tactggatcg tatcgatagc 
ttaatgctgt tagtgtttta atccgtcact 
actctggagc ctggctgcgt ttattgtcgc 
tggtcacttt tgggtggcac ggcgctgtgg 
tggtaaagcg ttatggcgcc gtactgatcg 
cccactgggc ggttacgtta aaatgttaga 
acgccatcaa tcattcaaca ataaaactgt 
tcctattgct aacttccttt ttgccatcgt 
gccaagtgtc cgtccggtca ttggtgatat 
tatttcttca ggaatggaac ttaagtctgt 
tgttcgtttg gcgttgatca gccgaattgg 
atttggttcc gacaatgtgg tagagaaaac 
cgataaacaa gatccagttg tagcactcgg 
tgtcttggct gaagttcagc cagggtcagc 
taggatcgtt aaagttaatg gtcaattatt 
ccgtgataat cccggccagc cgctggtatt 
tttaaccttg ataccagata caaaatcggt 
tgtagtgccg aaagtgatac cacttccgga 
gtttacggca gtctatcaag ctggggataa 
catgttgggt aaactgatta ctggtgatgt 
tattgcgcaa ggcgctgggc tttcagctga 
ggcgctaatc agcgtcaact tgggcattat 
tggacatctg ctcttcctgg cgatagaaaa 
gcaggacttc agttatcgca tcggctcgat 
tttcaatgat ttctcccgtc tttaacggca 
gatggcgatg aaaaagttgc tcatagcgtc 
tgcagacggg ttcgtagtga acgatattca 
tgcggcgtta cttaatatgc cggttcgcgt 
taaaactatc cgtgcgttgt ttgctacagg 
tggtaatacg ctgattgttc aagtcaaaga 
cggtaataaa gcggtgaaag aagatatgct 
ggttggcgag gccttggacc gaaccactat 
ctattacagt gtgggtaaat acagcgcttc 
taatcgtgtt gacctgaaat tggtctttac 
taatatcgta ggtaatcaca gtttcaccac 
tgacgaggtg ccttggtgga acgtagtcgg 
aggtgacctt gaaaccttgc gcagcttcta 
cgattcaact caggtgagtt tgacgccaga 
tactgaaggc ccacaattca agctaaatag 
tcaatctgaa gctgagaagc tgacaaaaat 
agtgacacgt atggaagatg acatcaaaaa 
acgtgttgtc actcagccag aaattaatga 
cgtggatgca ggtaaccgtt tctatgtccg 
taaagactcg gtgttgcgcc gtgaaatgcg 
tcaggtagaa gcgggtaaag agcgacttaa 
tgagactcaa cgtgtaccgg gagctgccga 
acgaaacacg ggcagcctga actttggtat 
ccaggttggc gttcagcaag ataactggct 
cacaaagaac gattatcaga cgtatgcaga 
tgatggtgta agtctgggtg ggcgtatttt 
tttatctggc tataccaaca gtagctatgg 
tgagaataac tcattacgtg ttggtgtcgg 
accgcaagtt gcgatgtggc gttatcttga 
acgtgaaggt ttcactactg atgacttcac 
tgaccgtggt ttcttcccaa catcgggggt 
gccaggctca gataatgagt tctacaaagt 



76261 ccgtggtcgc ttaggttatg gtgacggtat tggttcgaaa gagatgccat tctacgaaaa 

76321 cttctatgca ggtggttcca gcacggttcg tggtttccgg tcgaataaca ttggtccaaa 

76381 agcggcgtat tacgctaatg gtggggctac agtgactaac tctaccgatg cggtaggggg 

76441 taatgcgatg gcggtggcca gtatcgagtt gatcacaccg acaccgttta tcagcgagaa 

76501 atactcgaat tcagtgcgta cctccatctt tatcgactca ggtactgtgt gggataccaa 

76561 ttgggaaaac acagctaaaa cgcgtgctgc aggtattcct gattacggta aagcgagtaa 

76621 tattcgtgtg tctgcgggtg tagcattgca atggatgtct ccattgggtc cattggtgtt 

76681 ctcatatgct aaaccagtta aagattacga aggtgataag tcagagcaat tccagtttaa 

76741 cattggtaaa acttggtaat tgatgggcaa agcagttctg aattttaaag acagcactgg 

76801 tagtgaaagg atcttaagtc gcgttggtga aatctgacgc ttcatcaggt gggatacact 

76861 accaagtgca attcaaggtg tctctgacac aaacgggtaa tggtaaggag tttatagtga 

76921 aaaagtggtt gtgtgccgca agtctaggtt tagcattggc tgcttcagcc agcgttcaag 

76981 cagctgacaa aattgctatt gttaatgttt ccagcatttt ccaacaatta cccgcgcgtg 

77041 aagccgtggc taagcagctg gagaatgaat tcaaaggccg tgcaaccgaa ctgcaaggaa 

77101 tggagcgcga tctgcagact aaaatgcaga agctacaacg tgatggttct accatgaaag 

77161 ccagtgatcg taccaagctg gaaaatgaag taatgaaaca gcgtgaaact ttctctacta 

77221 aagcgcaggc ttttgagcaa gataaccgcc gtcgtcaggc agaagagcgt aacaaaatcc 

77281 tgagccgtat tcaggatgct gttaaatctg ttgctaccaa aggtggctat gacgtagtca 

77341 ttgatgcgaa tgcagtggca tacgctgatt cttctaaaga cattactgct gacgtattga 

77401 aacaggttaa ataaaacatg ccttcaattc gactggctga tttagcccag cagttggatg 

77461 cacaggtgca cggtgatggc gatcttgtca tcaccggcat cgcttctatg cattctgccc 

77521 aacctgagca aatcacgttt ttgtcaaaca gccgctatcg ggaacaactt gcatcttgca 

77581 atgcaggtgc tgtggtattg actgaggctg atttaccttt ttgtaaagtt gctgctttag 

77641 ttgttgaaaa tccttatttc acctatgcgc gcatggctca aataatggat accacccctc 

77701 aacctgctca ggatatcgca ccgagtgccg tgatttctcc gcaggcaacg ttgggcgagg 

77761 gggtttctgt tggtgcgaat gcagtcatag agtcgggtgt tgtactcggc gataatgttg 

77821 tcatcggtgc gggttgcttc attggtaaga acacccatat tggtgcgggt agccgtttgt 

77881 gggctaatgt ttctatctat catgaagttg ttattgggca gaattgcctg atccaatctg 

77941 gtactgtcat tggtgccgat ggctttggtt atgcaaatga tcgtggtaac tgggtaaaaa 

78001 taccgcagct tggttctgta catattggcg accgtgttga gattggtgcc tgtacaacga 

78061 ttgaccgtgg tgcgctggat aacacgatta ttggtaacgg tgtcatcatt gataaccaat 

78121 gtcagattgc acataacgtt gtgattggcg acaatacggc agttgccggt ggtgttatca 

78181 tggctggcag tttgaaagtg ggccgctatt gtatgattgg cggtgccagt gtgattaatg 

78241 gccatatgga aatttgtgac aaagtaacaa ttaccgggat gggaatggtg atgagaccaa 

78301 tcactgaacc tgggttatac tcctccggta ttccgttaca accaaataaa atgtggcgca 

78361 aaactgccgc tctagtgatg aatattgatg gtattaataa acgcttaaaa gctgttgagc 

78421 gtaaaatcga taaagaataa tcactctagc ctgacactct gagttaataa tcgaacttac 

784 81 tggcagtgat gccaaaagtc tacagagttc ctaatttgcg gcctgcctaa tgcctccctt 

78541 tggggctgtg caggccgtgt tgttgatagc atcagttttt tagacaggaa gagtattttg 

78601 actactgaca ctcatactct gcatattgaa gagattttgg atctactccc gcaccgtttt 

78661 ccgttcttgc tggtagaccg tgtccttgat tttgaggaag gaaaatttct gcgtgcagtg 

78721 aagaacgtct cttttaacga gccatttttc caaggccatt tcccaggcaa gcccatcttc 

78781 cctggtgtgc ttatattaga agcgatggcg caggctacag ggattttggc gtttaagagc 

78841 cgcggtaaac tggaaccggg cgagctttat tactttgctg gtattgatga agcacgtttt 

78901 aagcgcccag tagtgcctgg cgatcagatg atcatggaag ttgagtttgt taaagaacgc 

78961 cgtggcctaa cccgttttac cggtgtagct aaagtcgatg gcgaaatcgt ttgcactgcc 

79021 accatgatgt gcgcgcgtag caaaccagca gcacctgcag aatctgtcgt tgtaaaacca 

79081 gatgttgtaa aaccagatgt tgtaaaacca gatgttgtaa atcctgtcgt taaggagtcc 

79141 tgattcgtga ttgacaaaac cgcctttatc catccaagtt ctatcgttga agaaggtgcc 

79201 atcattggcg ctggtgttta tattgggccc ttctgcattg tcggatccca ggtggaaatt 

79261 ggtgctggca cggaactgaa atcccacgtt gtcgttaatg gcataaccaa gatcggctgt 

79321 gataatcaaa tttaccagtt tgcttccatc ggtgaagcga accaagatct gaaatatgca 

79381 ggtgagccta ctcgggttga ggtcggtgat cgtaaccgta tacgtgaaag tgtcactatc 

79441 caccgaggta ccacacaagg tgggggggtg accaaggtcg gttgcgacaa tctactgatg 

79501 gttaataccc atgttgccca tgattgtgtc attggtaacc gttgcattct tgctaataat 

79561 gctgcgttgg ggggtcatgt agagattgac gattacgcta ttatcggtgg catgacggct 

79621 atccatcaat tttgtgtgat tggtgcgaat gtgatggtcg gcggttgttc cggtattact 

79681 caggatgtgc ctccttttgt catcgctcaa gggaatcatg cgacgccatt tgggattaat 

79741 attgaaggtt tgaaacgtcg tggtttcgac aaagaatctc ttcatgctat ccgtagtgca 

798 01 tacaagttgt tgtaccgcag tggcaggacg ttggatgaag tgaaaccaga aattgctgaa 

79861 ttagcggagc aatatccagt cgttaaagcg ttcaatgatt tctttgctcg ttcgacccgc 

79921 ggtattattc gctgacttat gcaaaatagc ccattgactg ctgattgttc attaaacgct 

79981 ggacgtcccc taacgattgg tttagttgcc ggagaaactt ccggcgatat cttaggcgct 

80041 ggattaattc gggcgctgaa agttcaggta cccaatgccc gatttgtcgg tgtcgctggg 

80101 cctctgatgc aagccgaagg gtgtgaagcc tggtatgaga tggaagagtt agccgtcatg 

80161 ggcgttgttg aagtattaga gcgcctgcca cggttattga aaatcaggaa ggatctgact 

80221 cagcgtttta gtgagctttc tcccgatgtc tttgttggca ttgacgcccc cgatttcaac 

80281 atcacccttg aagggcgctt aaaacaacgg ggtattcgga ctattcatta tgtcagccca 

80341 tcagtctggg cctggcgaca aaaacgtgtt ttcaaaattg gaaaagcgac cgacatggtg 

80401 ttggcttttc tcccttttga aaaagcgttt tatgatcgtt tcaatgtccc ttgccgtttt 

8 04 61 ataggccata ccatggccga tgccatgcca ctagtacccg atcagcaggc agcgagagca 

80521 gagcttggca ttgcgcctaa tgccacttgt ctggcgctgt taccgggtag ccgccactct 

80581 gaagttgaaa tgctcagcgc tgatttttta cgtacagcgg taattctgcg tgataaattg 

80641 cccaatttgg aagtagtggt tcccttggtc aacagcaagc gacgtgaaca attcgagcga 

80701 attaaagcgg aaattgcgcc tgatttatcc gtacatttac tggatggtaa agcgcgtgtc 



80821 ctggccaaat gcccgatggt tgtggggtat cgtatgaagc cat t tact tt ctggttggca 

80881 gagcgattag tcaaaacacc ttatgtttca ttaccgaatc tgttagcagg ggaagagttg 

80941 gtgacagaat tgttgcaaca agagtgccag ccgcaaaaac tcgctggtgc acttttgccg 

81001 ttgctacaag gtggtagcga aatagcagcc ttaaaagaac gttttttggt tctgcatcag 

81061 agcatccgct gtggtgcgga tgaacaggca gcacaagctg tattggagct ggcagatcga 

81121 tgagcgagac ttttatttac ccacaggcta atttaatcgc aggcgtggat gaagtgggtc 

81181 gtggcccttt ggtgggtgcc gtagtcactg cagcggtgat ccttgatcct aaccggccaa 

81241 tagttggctt ggcagattca aaaaaactca gcgagaagcg ccgcctgtca ctgtatgacg 

813 01 aaatcactga aaaggcacta tcatggagtc ttggtcgtgc tgaaccggaa gaaattgatc 

81361 agcttaatat tttgcacgca accatgctgg cgatgcaaag ggctgtatcg gggctgcata 

81421 ttgtacccga ttatgtattg atagatggta accgttgccc gaaattacag atgccgtcac 

81481 tggccgttgt caaaggagat agccgggtgg cggagatcag cgcggcttct attttagcca 

81541 aagtgacgcg tgatcgtgaa atgaccgagt tggacctcct cttccctgaa tatggtttcg 

81601 cacaacataa aggctaccca acggcttttc atttggaaaa gttagccgcg cttggtgcca 

81661 cggtacatca tcggcgcagt tttggcccgg tgaaacgggt gctgggtttg gtgtaaacga 

81721 taagcttgca tcgatgatat atatacccta aataatttga gttgcacgaa gatagccaat 

81781 acataggcaa cttgaaatat gatggggata gctatgatgt tccgggtact tagctaccgg 

81841 ggcgtatgat tcaatccact catttctggt atctggatat ggccgaacct cgttttgtcc 

81901 acctgcgtgt tcatagcgat tattccatga tcgatggatt agctaagatt ggacctttgg 

81961 tgaaaagagc tgctgcatta ggtatgcctg ctctggctat taccgatttt actaacctct 

82021 gtggcctggt gaaattctac ggcagcgcgc atggtgcggg gattaaaccc atcatcggtg 

82081 ctgacttcta cgtacaaagt gaaatactgg gtgatgaatt agctcacctc acggttttgg 

82141 ctcgcaataa tgaaggctat caaaatctga ctttattgat ctcagaagcc taccaacggg 

82201 gttatggcgc tgcgggtcct atcattgatc gcgattggct gatcaaacat aaagaaggac 

82261 tcattctgct gtctggtggc cgaatggggg atgtgggcaa atttctgcta cggggtaacc 

82321 aggttcaggt tgatcaatgt ctcgcctttt atcaggaaca ctttccagac tgctattacc 

823 81 tcgaacttat tcgaaccggg cgtccagacg aagaaaacta tcttcatgct gccgtggcat 

82441 tggccactga gcgtggttta cctgtcgttg cgaccaatga tgtgcgtttt attgatgaat 

82501 cggattttga tgcccatgaa atccgtgtcg ctatccatga tggttttaca ttggtcgatc 

82561 ctaaacggcc gaagaattac agccctcagc agtttatgcg cgatgaagaa cagatgtgtg 

82621 agctgttcgc tgatattcca gaagcgttga tcaacagcgt tgagatagcc aaacgctgta 

82681 atgtcactat tcgccttggg gaatatttct taccgcagtt ccccacaggg gagatgagta 

82741 cagaagattt tctggtagaa aaagccaaac aaggtttgga agaacgtttg gaatttctgt 

82801 ttccggaccc tgaggttagg ctgcaaaaac ggccagaata tgatgaacgt ctggatatcg 

82861 agctcaaagt tatcaaccaa atgggcttcc ctggttactt cctgattgta atggagttta 

82921 tccagtggtc gaaagataac ggcgtaccgg tggggccggg gcgtggctct ggtgcaggct 

82981 cattggtggc ttatgccctg aaaattactg atattgaccc actggaattt gatctactgt 

83 041 tcgagcgctt ccttaatccg gaacgtgtat cgatgcctga cttcgacgtt gacttctgta 
83101 tggagaaacg tgatctggtg atcgaacacg tggcggaaat gtatggccgc gatgcggttt 
83161 cccagataat taccttcggt accatggcgg ctaaagcggt catccgcgat gtcggccgtg 
83221 tattaggcca tccctatggc tttgttgatc ggatctcaaa attaatccca ctcgatccgg 
83281 ggatgacact ggaaaaagcg tttgcagcgg agccacagct agctgaaatt tatgaagctg 
83341 atgaagaggt cagggcgcta attgatatgg cgcgtaagct ggaaggagta acgcgaaacg 
83401 ccgggaaaca tgctggaggc gtggttattg cccccactaa gattaccgat tttgccccac 
834 61 tctattgtga tgcggaaggc aataacccag ttacgcagtt tgataaaaat gatgtggaat 
83521 atgctgggct ggtgaagttt gacttcctcg gtctacgtac actaaccatc attaattggg 
83581 cgttggagat gattaacgcc cggcgcgcta aaactggatt ggaacccatt gatatcgcct 
83641 ctattccgct ggaggataaa aaaagcttcg atatgctgca acgttcggaa acgacagcag 
83701 tattccagct ggaatccaga gggatgaaag acttgatcaa gcgcctgaag cccgactgct 
83761 tcgaggacat gatcgcctta gtggcgttat ttcgtcccgg tccgttacaa tcagggatgg 
83821 ttgataactt tattgaccgt aaacacggcc gggaagcgat atcttaccca gatattgagt 
83881 ggcaacatga atccttgaaa cctgtgcttg agccgaccta cggcattatc ttgtatcagg 
83941 aacaggtcat gcaaattgcg caggttctgt ccggctactc actgggtggt gcggacatgt 
84001 tgcgtcgtgc catgggtaag aaaaaccctg ctgagatggc taagcagcgt tctgtatttg 
84061 aagatggtgc gaaaaatcag ggtattgatg gtgagctggc gattaaaatc tttgacctgg 
84121 ttgagaaatt tgcaggttac ggttttaaca aatctcactc ggcagcttat gcattagtct 
84181 cctatcaaac gttgtggtta aaagcccatt atccggctga atttatggct gcagtaatga 
84241 ccgctgatat ggataatacg gacaaagtgg tcggcttggt cgatgaatgc tggcgtatgg 
843 01 ggctgaaaat tctgccccct gacatcaaca gcggtttgta ccatttccat gtaaacgatg 
843 61 atggcgaaat tgtttatggt attggtgcca ttaaaggtgt aggtgagggg ccgatagaag 
84421 ccattctgga ggcgcgtaaa gaggggggct attttaaaga gttatttgac ctatgcgccc 
84481 gtgtcgatac caaaaagctt aacaagcgta ttttagaaaa actgatcatg tccggtgcat 
84541 ttgaccgttt ggggccacac cgtgcagcat tgatgagttc tctcggggat gcgcttaaag 
84601 ccgcagatca acatgctaaa gccgaagcaa ttggtcaggt agatatgttt ggtgtattag 
84661 ccgacgcgcc tgagcaggtt gagcaatcct atgccaatgt gccgccatgg caggagcagg 
84721 ttgttttaga tggtgaacgg gaaacgctgg ggctgtattt aactggccat ccgatcactc 
84781 agtacctcaa ggaaatcgaa cgctatgccg gtgggatgcg tttgaaggat atgcatccga 

84 841 cggatcgggg caaaatgacc accgcagtgg ggttggtgat cgcggcaagg gttatggtca 
84901 caaaacgtgg taatcgcatt ggtatttgta cgctagatga ccgctctggt cgtctggagg 
84961 tgatgttatt caccgatgca ttggaaaaat atcagcattt attggaaaaa gaccgtatcc 
85021 tgatagccac cggacaggtc agctttgatg actttagtgg tgggcttaaa atgaccgccc 
85081 gtgagttaat ggacatcagt gaagctcgtg aaaaatatgc aagtgggctt gctatctcgc 
85141 tgactgacag gcaaattgat gaccagcttt tgaaccgtct ccgtcagtcg ttggaacccc 
8 5201 atcgagcggg gacgatccca gtacatcttt attaccaacg ggaagatgct cgcgcccggc 
85261 tgcggtttgg agccacatgg cgcgtgacgc ccaccgatcg cttattaatc gatttgcgaa 



85381 
85441 
85501 
85561 
85621 
85681 
85741 
85801 
85861 
85921 
85981 
86041 
86101 
86161 
86221 
86281 
86341 
86401 
86461 
86521 
86581 
86641 
86701 
86761 
86821 
86881 
86941 
87001 
87061 
87121 
87181 
87241 
87301 
87361 
87421 
87481 
87541 
87601 
87661 
87721 
87781 
87841 
87901 
87961 
88021 
88081 
88141 
88201 
88261 
88321 
88381 
88441 
88501 
88561 
88621 
88681 
88741 
88801 
88861 
88921 
88981 
89041 
89101 
89161 
89221 
89281 
89341 
89401 
89461 
89521 
89581 
89641 
89701 
89761 
89821 



tgaattttct 
ctgcagtcag 
tgcgtgagaa 
ttgcccaatt 
ttaccgattt 
gcggtattgc 
aaaccaaaga 
ctttacgtct 
caccgggggc 
gcaacttgcg 
gtggatctgg 
gcacttattc 
aagcgccatt 
tgattgactc 
ctatctcact 
aagagttatt 
gccaactgat 
gaagacccga 
cacacaggca 
tggcaattaa 
ggttctactg 
attcatggaa 
cttcaccggt 
a-ggttgatga 
cagttcgcat 
taccactgga 
aatgtgttac 
tttagtggtg 
cttattcctg 
gatagttggg 
gtgcgagtta 
taccaagcct 
gacgatcaat 
tctgcgatgg 
ttttctcgag 
gatagcaatc 
cttaatcaac 
gagcaggagc 
gacagatcac 
ttgcgccgct 
ctgtggctgg 
cgtcaggtac 
acaaattact 
ttacccgagc 
gggcaagcgg 
ggagatatca 
ctgggtattc 
attgcagccg 
tggcacttag 
gaagaagaga 
taaaaacgcc 
gtggtgcggg 
tctggaagag 
cggtgttact 
gtgctaagtc 
tgtatcctaa 
aaaaggggag 
ccgatgaaga 
gtggataata 
aacgcggctt 
agccgtcagt 
aagccaggta 
tcacaaggaa 
aagctggcga 
tactcgacct 
ctcaacttca 
tcacaattga 
ggcggcatag 
gtccaatgct 
aatctcggca 
aaagtttaac 
tctgaactta 
ttatttttgg 
aggtctaatc 
aactatccat 



tgattttgaa 
ccgtcaagac 
aagtgttgag 
ggcacgccat 
cgaagagctg 
acgtctggat 
gaagatccgt 
gatggaaatg 
ttatccgggt 
tgaaatgtca 
tggtgcgttg 
tgttatctct 
agccgcagaa 
agtgatcccc 
gaaagcgcaa 
gaatcgtcgt 
aaaagccggt 
atttagttaa 
acttaaagtc 
taagattcac 
tgaggtgcta 
agcggatctg 
accgcggcca 
tattgatctg 
tgatccctac 
gctatatgaa 
ttaaccctct 
ggttagattc 
aactgaacat 
taaaacattg 
atattgatcc 
tttccgccaa 
gtgagacctt 
cggctaaaat 
agatattaga 
aagatgatcg 
gttggccaca 
agttattgga 
tctctattga 
ggttggcgag 
aggtcgccat 
gccgttttcg 
taccttgggc 
cattatggct 
tccgtcctgc 
aaattgttgg 
ccccatggca 
cgggggtatt 
attgggataa 
gagtcgaagt 
agattccccg 
cagtctatcc 
tcccgttaat 
gagcttgtgt 
atccagttgt 
cctggccggg 
ccgctccggc 
tttttctgat 
aaacaagcca 
acagcgacct 
acagtagtca 
aatgtaaata 
gtggaaacta 
tatggtccag 
tttttctcag 
acgtcagtat 
tgggttgata 
gccgcctgtt 
ttcaatacac 
taaaaacggc 
ccggtaagct 
taaggactat 
tattgatacc 
aagtacactg 
aaagccgcta 



cagccgattg 
gaaaaattag 
ctgacgcgga 
cctcgtcgtc 
gctggtgatc 
gggcgcccgg 
cgtaactttg 
gctgaacgct 
gttggggcgg 
cgactgaatg 
gctatcggtg 
ccagaaggtt 
gcgatgggca 
gagccgttgg 
ctgttggcag 
tatcagcgct 
gcggaattta 
gcaatctata 
tggcagatgt 
catattgcga 
ggttttaatc 
gcgttacatg 
agtcgtccag 
gcgctaaaag 
actcaatcac 
ttgaaagcgg 
tttcgctcaa 
cacggtgtta 
tcgcgctatt 
tatgcagcaa 
tcggcagaac 
tttggctgcc 
cctgttggca 
gccctttgcc 
gaactatgca 
ttttgatcgc 
ttttgctcag 
tgaactgttg 
tggactatta 
tcttggtgca 
ggcacggcaa 
tcaacatctg 
aaccgttaag 
acctgccgat 
cgccgttggc 
gcggcatcac 
gcgggaacgt 
cgttacccaa 
aecacttaga 
tgatgactca 
catgaaaata 
tcacgcgttt 
ttttgaggag 
agctttcagg 
gtggcctgcc 
cagaatgcgc 
gggcaagccg 
ttggctgcgt 
gtcagtgaat 
tgtagaggcg 
cgtcaagtac 
ggataaatgc 
tgatgcgctc 
acgcaattcg 
cagcagatcg 
caggtgatgt 
ctcttcactc 
ccctaacgga 
agcgtagtgc 
gtactaattc 
ctcagcagaa 
agcacaggag 
acttatatcc 
ttagccagat 
ttaacattgc 



cagagctgga 
atattaatct 
aaattttctc 
cttataccct 
gtgcttatgc 
tgatgattat 
gtatgccagc 
ttaagttacc 
aagaacgtgg 
tccctatcgt 
tgggtgataa 
gcgcctctat 
ttactgcaca 
ggggcgcaca 
atctcaatga 
tgatgaacta 
agcgccggtt 
cccaagttac 
aacatgttta 
ttatttgctc 
tgatcagcga 
accaatacac 
aggcctgtgg 
aacttgtcgt 
gttttacttt 
aatgacatga 
ctgggtgaaa 
ctgcatttat 
catattcacc 
tgtgatcaat 
gggatagaag 
aaagaagtct 
cttaagcgtg 
cacagccaac 
caggcacagc 
aatttcttac 
gccacggcac 
gctgaaaact 
caggcatcaa 
ccaatgccat 
gatgcggagc 
tatttattaa 
gccgccccaa 
ttaggcgtac 
gaggagataa 
tcgcgtcaaa 
attccgctgt 
gcggggcagg 
ttgggataaa 
atgcccacac 
acttccatgt 
aaaaacgagt 
caataatgat 
tgtgggcaaa 
atggcatgaa 
tgtatctgca 
aggtgatgcg 
attatgccag 
cgggtgataa 
aagtgtaagg 
gtagggttaa 
aaaacgggcc 
actacgaccg 
cgcattgtcc 
ctggctttgc 
tggcaggcga 
atcgacataa 
ggacgggagt 
gtctggcacg 
acagtattgc 
aagacaaact 
ttgtagagat 
ttctttctgt 
aattcagagg 
agatatggac 



agcgaaaatt 
ggatgaagag 
ggacctcggc 
ggattatatc 
cgatgataaa 
tggtcatcaa 
accagaaggg 
cattattact 
tcaatcagaa 
ctgtaccgtg 
agtgaatatg 
tctgtggaaa 
tcgcctaaaa 
tcgcgattac 
tttggatgtg 
cggctattgc 
ttttttatct 
ttcgagatgc 
atttaacgga 
agattatcag 
ggtttatcgc 
cattgaatta 
tttacgtcat 
cgcgggcgtt 
ctttaatgac 
acttagtcac 
accgtcatgt 
tggtctgttt 
atggtttaaa 
ggaaaattga 
cggcagccag 
tactgaccgc 
gtagcgggcc 
tgttgcggcc 
aattacaatg 
ggctaaatgt 
gtagtgcggg 
tgcagcagtt 
tggcgaaacg 
cacaaagcca 
ctcaattaat 
tgcccttggc 
attcgtcgat 
tgaggtttgt 
gtgtacgttt 
gtaaaaaagt 
tatatttcgg 
cgaatgaaaa 
acaacacccg 
cggagagaat 
gggttagaac 
cattgagggt 
gaaattagtt 
aaatgatatt 
tggtaaagtt 
acaatcatta 
cgcttatgta 
cctaaagcca 
atctgccggg 
ataggcagag 
aggtaagtcg 
gctcttcagc 
ttccaatttc 
cttcttgctc 
cttctacgat 
gttccagatt 
tcgctcacca 
gcgtcgttag 
tagccaagat 
tgcacaataa 
gtaaacctac 
attagagtgc 
atggttcgct 
tcacccatgg 
cgtaattttt 



gactcgctga 
gtccagcgtc 
gcatggcaga 
gctaatatct 
gctattgttg 
aaagggcgtg 
tatcgtaaag 
ttcatcgata 
gcgattgcgc 
attggtgaag 
ctgcaataca 
agtgcggata 
gaactgaaga 
gccgcgattg 
ctgaatgacg 
tgagttgatt 
attttttact 
agggggccaa 
ggatacgtgc 
gccagtaaac 
gaagagcgtg 
ttctcctttc 
ctggctttcc 
gtttgcgaag 
ccagatggtt 
ctcgaaaccc 
gttggtgggg 
acgtcaacaa 
cccgcaggct 
gcttaaggtg 
aacagcacgt 
ccagcatttg 
ggcgggtctc 
actactcgct 
gattgaagat 
tctgccgata 
gctatgtgct 
acagggacct 
ggcggctata 
attgcagcgc 
gattggcacc 
tgaaataaca 
aataccttta 
ctcagcgggt 
tggtttacag 
atggcaagag 
tgaacaatta 
tgagccatgc 
cattgaaaaa 
aaatagatcg 
agtgccatct 
ttgtccgttg 
gtcagtatgg 
gaggcggggc 
tcagtaccta 
gaagcctata 
tccggtttat 
taaattatat 
agctgatttt 
tagtcacgaa 
ggtaaaatgt 
gacccgtttt 
ggggtgactg 
gacaatcagg 
ctcaccgcct 
atcgtagtca 
ataagttagc 
ccgcaacttc 
caccacttcc 
tcctcccgga 
gaagataaac 
aagctaaggt 
gtctgctggc 
cactaaaagc 
tccaggatat 



89941 ggcatggatt tgccatgctg atgagcgctt 
90001 tgaacccgaa gtttggcgcc ataatgacca 
90061 gctacctgaa gagaaacggt tgcgcaaagc 
90121 tgcatacagt gagcgtgcgg ccaaagtctg 
90181 gcatcgtaac ttaagagtgc gttttcttga 
90241 gtctaaccgc aatatgtcat tacaggcgac 
90301 tgtacagaat aatctggaaa ttagttttgc 
90361 ttgattatct ctaatagtgt cacgctggcc 
90421 gcgcagggcg ctggaggcca gcatgttaat 
90481 gacattaagg cctcaagcct gccagagtat 
90541 catttggtga cagctgaagg tgttgtcatc 
90601 atgaaccgtg aggctgcatt ggcgcgcttg 
90661 gaaaaggccc gtcgagcgac aaaaccaacc 
90721 aaagtccgca aaggggcgac aaaagcgctg 
90781 tatttatatg aacgttgata cattgcggtt 
90841 cagtatggga gaatcactgt gagtaaatta 
90901 ctgtcattat taggctgtaa taatcgctct 
90 961 cagcagagct atcagggcgt attaccttgt 
91021 ttcctcgata gtgatgggac ttttatcttg 
91081 gaccaaactt ttgcagaata tggtaaatgg 
91141 aacggtcagg gagaaaaacg ttacttccac 

912 01 cagcaaggtt tacccattaa atcgacactg 
91261 ttatccaaaa cgccaatgcc actaagtggt 
91321 tttactgatt gtgtaacggg taaggcgttc 

913 81 gggtatttaa acgctcgcag aaaccctgga 
91441 tttgattcaa ggccatcaat ggaagaaggg 
91501 gatattcagt ttaatgccaa taaaagttgt 
91561 ataagaagcc gcacggtagt tcaagttctc 
91621 atggggactc tttatttatg gcgggtatct 
91681 gttttatttg gctcagcaag aactcaacaa 
91741 cgacgcgtcg gtttttgtat tccacttctt 
918 01 tattgtgtgg cacgccaatc aattccatat 
91861 ggtcgtcgag aataacgtca ataccgtgag 
91921 gttctttcac tcggaaagat ttgtgcatat 
91981 caatagcatc tggccagatg ataccgcggt 

92 041 ccacacggga tacccctatg ccgtagcaac 
92101 cgccctgaac ggtggctttc atgacttcag 
92161 ccacttcaat accgcgtttg atctgtagag 
92221 ctactacgtt acgcaaatca gcaacgagcg 
92281 caaagtaatg cttcccatca atattcgcac 
92341 tgtggtcagc aataacaggc aatgataaat 
92401 cgatagctgc acgaatttcc tcttcgctag 
92461 gcttctctgc tttgatctca tttaaatcat 
92521 tatgaccact ctcttcgtga gcatgaacca 
92581 gtttgaattg ctctaccaac tcggcgatgg 
92641 gctcttctgt agccggtgcg cgtggttcag 
92701 tggctgcgta atcagagcca gtggagaaaa 
92761 cttggaattc gtgggatgca ctgccgccga 
92821 agttcaaatc catccggctg aaaatcttac 
92881 cctgtaaaga ctcttgcgta gtatggaagg 
92941 cgcgcatgac accaaagcgc gggcgcactt 
93001 aattcagcgg cagttgtttg taagagttaa 

93 061 cttcatgggt tgggccaaga acaaaagggc 
93121 cagggccgta ctgctcccag cggccactct 
93181 gcattgatac ttcgatagca cctgcgttat 
93241 ttttcaacac ccgaaccccg gtcggtagcc 
93301 tcatcccggc acggagcatg agttgatggc 
93361 tctgagtgga gagcagatat tggctagtac 
93421 attaggggca gccaaaaagt aaaaagtggc 
93481 agagagcaga gtgaaattta gcgtgggtcg 
93541 acccgccagc gcacattaaa ttccagtaat 
93601 tctttacggt aagctgggcg tggatcttgt 
93661 tgtggataac gctgctggtg gtgagctaac 
93 721 tgcatctctg tatctggtgc agattgcgca 
93 781 gcgaaaggca gataaggttt aatatcaata 
93841 cccaatgcca gaatcacctc accgttttgg 
93901 atgcctatcg ggtttggacg aaaagttgaa 

93 961 ccccccagac ggggagggcg aacagttgga 

94 021 acgaatatca cccataaatg gctgaagtca 
94081 taaggtgcca gcaactgtaa ttcgccacca 
94141 accgcaaact tttctttata cggtgaacgg 
942 01 ctcatttaga tgaaacgttc agtgctgaac 
94261 ggacaccact ttgtatttca cattcatgca 
94321 cacgaatttg cattctctta cgagcggttg 
94381 attggcaaga ttcaccagac acttcaccta 



gctgtttacc aaaggactca gtgctgatga 
caacggcatt gagctgtgga ttgaattggg 
ctgtaatcaa tcaaaacaag tcgttctgta 
gtggccacaa gttcaggaga aactcgctgg 
tgacgaacaa atggcgaagt tggctgcgct 
actgcaagaa gggactatct ggctatcaga 
cgaatggcaa aactacgggc aataagtgtg 
agcggcgaga ttgagctgac cgcgatccgg 
aaaacctcaa ctgcgattca tttgcgcttt 
tataaagaaa ggttattgtt gctaaatagc 
ataaaagcac aggaataccg cagccaagat 
gtagcactca ttcgccaggc aatggtggta 
aaaggttcaa aaatcaggcg agttgagggt 
cgtggtaaaa tacatcaaga gtgaagtcga 
gatatgactt cattccatat atatcggaca 
acgataacgc tacttttggc agtaggtgcg 
caacctgctg aacaggtttt gcaaccaatg 
gcggattgca gtggcttaga cacttctttg 
aaagaagttt atctcggcac taaagatggc 
gcacgtactg ctgataagtt agtgttaacc 
cctgtcgata aaagcttggt gatgctcgat 
aattatcagt tggaacccag cgaccagccg 
atgtataaat actttgctga tgccgccata 
ccggtagaga ataatatcga actggaaacc 
gaacctgtgt ttttgaccct caatgggtat 
cgaacgaata aaacgctgat tccggagggg 
gagaaaaaat agcccttgtc catagctaaa 
cggcggcttc atcagtaaaa taccccgctt 
ttatttatga cgggtatata gccattcagc 
tttcgctggt cttaatcatt tgtttctcac 
cgctatcgag attacggtca ccaatgacga 
cggcaaacat cacacctggg cgctctttac 
aacgcagggt ggtgtagagc tcttctgcca 
tcataggtag aatagcaacc tggaatggcg 
catcatggtt ctgctcaata gccgccgcca 
ccatggtcat cacctgatta cggccatctt 
aatatttggt ccctaactgg aaaatatggc 
tgcctttgcc atcagggctg atatcgcctt 
gcagcgccaa atcacgctcc cagttaatac 
ccgcaccgaa atcgctcatc accgccacac 
taaccgggcc taatgaacct gggccagcgc 
caaaggtcag tggcttggca acttgtggca 
gatcgccgcg caccaggagc gcaaccaact 
ataaggtttt cacggttttc tcaatgggta 
tttttgcgtt cggggtgtca acgattcgca 
ttggcgctaa tgcttcagcg aattcaatat 
cgatatcatc ttcgccactt tctgccagca 
tagagccggt atccgccagt accgcacgga 
tgtacgcggt atacatcgcg tcgtaagttt 
aatacgcatc tttcatcaaa aactcacggg 
catcacgaaa cttggtctga atttggaaga 
tctcgccgcg gatcagatca gtaatcactt 
gctcgccgcg gtcaacaaaa cgcaacaact 
cttgccataa gtccgcgggt tgtaccactg 
tcatttcttc gcgcacgatg ttttcaactt 
aggtgtaaag acctgaggcc agtttgcgga 
tgataacttc agcatcggca ggtgtctctt 
gcatgatgtt tttgttccat tagcattgca 
ctagtttacc agtgagcacg ggttgtcaaa 
agtgataaca cttctgtttg ctggccaata 
aaaacggcat attcacggct ttcattatcg 
gctaaaactt ggctgataaa acggcgaaga 
tgttgctcgg cggcgggaga aaagcaaact 
aaaccagcgc gcgcctgagg ttggctctca 
acaggcgtgc catcaactaa atccaggctg 
caacgaacgc ctttcaactc aatcagcgac 
cgggtcgcaa atacgcccat tcgtgtattt 
cgccagccac cctccatcgt ctgatggaaa 
gataagccgc gtacgctttc tggttggttg 
ccatcctcaa ccaaaccggg ttgccgtggt 
atcacgccga tctgattaaa agagaatgta 
cctgacacac cgcttgttga tagcagccag 
gtaagacggc atttgctttc atataagatg 
aaatactggg tggagaatcc tgaacggttg 
gatcgcgaaa aggcttacca acaagttctt 



94501 taaccggagg tggttgctta atcggattat ctgtggtgga tggctttgat ggaagcaggg 
94561 tacatccagt tagcgataga gctaacagac acagaggtaa cgcacgcata aaaattcctc 
94621 agccttattc atttaagttg tgctattgaa gcaatctatt attcaaataa caagacgggc 
94681 tatagcccgt cttgtcgata tattcaataa aagaggtaat taaagattac cagcctttta 
94741 ctgcgccacc gttgaatgct ttattcgctg catcataaac ttcgtcagac tggtaagcct 
94801 gaacaaattt tttcacattt tccgcatctt tattgtcttc gcgggcaaca atcagattca 
94861 catacggtga ttctttatct tccacaaaca gaccatcttt agctggtgtc aggccaattt 
94 921 ggctagcgta agtggtattg ataatcgcca gagcgatttg ttggtcatcc aaagaacgtg 
94 981 gtaattgagg tgcttccaac tcaaccagtt tcagattttt agggttctca accacatcta 
95041 atacggttgg cagcaggcca acaccgtctt tcagcttaat cagaccaacg ctttgtagca 
95101 ataacagaga acggcccaaa ttagtagggt catttggtag ggcgacctga gatccaggtt 
95161 gtagttcatc cagtgacttg attttcttgg aatagccggc aatggggtaa acgaacgagt 
95221 tgccaacaga aaccagtttg tagccacggt ctttgatttg ctgatccaga taaggtttat 
95281 gctggaaagc gttaaggtcg atatcccctt tgctcagtgc ttcgtttggt aacacatagt 
95341 cattgaatgt caccagttca acgtccagac cgtacttttc tttagccact ttctgagcca 
954 01 cttcggcaac ttgttgctct gcaccgacaa taacgccaac tttaatatgg tttggcgctt 
95461 tttccgtcgg gccgcaaccc accaaagtca aggtaccaat cagcgcacta attgctgcga 
95521 tagatttgaa ttttaaagac atatcccttc ctctatagac attcgttaag atgcgccgct 
95581 gtgtgaggca gtcgctgatt aatcactatt tgtgcgtaac ggctttgact atccgatcac 
95641 cacttaactg aattaggtag accagaatta ctagtaatac taatacagta ttcatcaccg 
95701 tagcgttata cccaatataa ccatactgat aaccgatttg acctaagcca ccggcaccta 
95761 ctgctcctcc catagcggaa tagcctacca gagtaattaa tgtaatcgtt gccgcattca 
95821 ctaagccggg cagggcttct ggtaataata cctttttgat gatctgcatc ggtgttgcgc 
95881 ccatcgcacg tgccgcttca accaaacctg acggaatctc tagcagagcg ttttccacca 
95941 tacgggcaat gaaaggcgcg gctcctaccg ttaacggcac gatagcggcc tgcaatccaa 
96001 tcgatgtgcc gacgatcatg cgtgtaaatg gaatcatcca taccaataag ataataaaag 
96061 gaatggagcg gaaaatattc accacaccag acagcgttct ataaatttta ttgttagcga 
96121 taatttgccc aggacgggtg acatataaca gcacccctac aggtaacccc agaacaaagc 
96181 caaagaaacc ggaaacaaaa gtcatcataa gagtttccca gacgcctcga gccattaacc 
96241 acatcattgc ctcagacata acccagaacc tctactttca catgatgatc ttgtaaaaac 
96301 ttgatggcag ccaggccatc ctgaacgtcg ccatgtagct ctgccaacat gacaccgaat 
96361 ttcaccccac cggcataatc catctgcgag cttaggatac cgatatcaat attgaagcgg 
96421 cgcacggctt gtgagattaa cggagcatca actgacttac cggtgaactc cagcttcaga 
96481 agcgggacgt gatctaccgt tggctccggg ctcatacgtt tggcgtaatc ttccggaata 
96541 tccaaatgca gggtggattg aataaattgt tgtgctaatg gtgtcttggg atgagagaac 
96601 acctcactga cactgtcttt ttcaatcagt ttgccttcac ttattactgc aacttgatca 
96661 cagatgcgtt tcacgacatc catctcatga gtgataagca aaatcgttaa ccccaaacgg 
96721 cggttaatat cttttagcag ctccaatatt gagcgggtgg tggctgggtc taatgcgctg 
96781 gtggcttcat cacacagcaa aactttagga ttactggcca gcgcgcgagc gatcgcgacc 
96841 cgctgttttt gcccaccgga cagattagcg gggtaagcat cttgcttatc ggtcaaccct 
96901 actaaatcca gtaattcatt cactctcttc tttatatctg cacgcgatgt attgtccaac 
96961 tcaagaggca atgcaatatt gccgtataca gtgcgtgatg agagtaaatt aaagtgctga 
97021 aaaatcatgc caatctgacg acgcgcgcgt gttaattggc cttcagacaa ggtggtcaga 
97081 tcttggtcat caaccagaac ttgaccgctg gttggccgtt ctaacatatt agcgcaccga 
97141 atcagggtac ttttacctgc gccagaagcg ccgataacgc cataaatttg cccggcaggg 
97201 acgtgaagac tcacgtctga tagcgcggta atagtgcgcg aaccctgctg gaacactttg 
97261 ctgatgtggg tgagtttaat catattattc ttattttatc gttatttccg tggctagata 
97321 aaaagtggac tttagtagaa ccaaagtgtg gatcggatgt taaggcgtct agacggctaa 
97381 gtcaactgac aacgtggatt ctcttccgtt ctcaataggt aataaaacat gcgatactat 
97441 cggggtttac agccctcagg agtatataag tgactcagtc cgttcccgca atatttttag 
97501 atcgtgatgg tacagttaat gtcgatcatg gttatgtcca cgaaattgac aactttcaat 
97561 ttatcgatgg tgttattgat gcatgccgtg aattgaaaga aatgggtttt gcgttggtgt 
97621 tggtaaccaa tcaatcgggt atcgcccgcg ggatgtttac cgaagaacag tttttaagtc 
97681 taaccgaatg gatggactgg tcattggctg atcgcggtgt tgatttagat ggcatctatt 
97741 tctgcccaca tcatcctgat ggcagtgtcg ctgaattcag tgaaacatgt gaatgccgca 
97801 aaccattacc cggaatgtta ttgcaagcgc agaatgaact taatattgat atggctgctt 
97861 cttatatggt tggcgacaaa attgaagata tgcaggcggc attagcggcg aatattggca 
97921 ctaaagtatt ggttcgaacg ggtaagccag tcactgccga gggtgaagca gcggccgatt 
97981 gggtgctaaa tagtctggca gacctaccaa aagcaataaa agcgcgttat aaatagcctg 
98041 tacgttcaaa taatcaacaa acgaaaaaaa tgtgaagata aacccttgcc attctgaagc 
98101 agctccctat aatgcgcatc catcgacacg gcgctgtgag caacttcaca gagtggccgg 
98161 ggagagcaga gaaaattaac tgattaagtg gttgactctg tagcgggaaa gcgtattatc 
98221 tgcctcccgc gttaccgtaa gattcgccgc aaggcaaacg ggtaacgaac gctctttaac 
98281 aatttatcag acaatctgtg tgggcactcg caagacgata tcgacgcctg tttcggcagg 
98341 cagaaataat atcaagtctt gaagagtgac caaagcagta aacattttga ctccggtcag 
98401 aatgcctatt tgcagaaagt aatctttgag caccgctgct tttacgcaag taaaagtcag 
98461 caaatcaaac aaatcttaaa ttgaagagtt tgatcatggc tcagattgaa cgctggcggc 
98521 aggcctaaca catgcaagtc gagcggcagc gggaagtagt ttactacttt gccggcgagc 
98581 ggcggacggg tgagtaatgt ctggggatct gcctgatgga gggggataac tactggaaac 
98641 ggtagctaat accgcatgac ctcgcaagag caaagtgggg gaccttaggg cctcacgcca 
98 701 tcggatgaac ccagatggga ttagctagta ggtggggtaa tggctcacct aggcgacgat 
98761 ccctagctgg tctgagagga tgaccagcca cactggaact gagacacggt ccagactcct 
98821 acgggaggca gcagtgggga atattgcaca atgggcgcaa gcctgatgca gccatgccgc 
98881 gtgtgtgaag aaggccttcg ggttgtaaag cactttcagc gaggaggaag gggttgagtt 
98941 taatacgctc aatcattgac gttactcgca gaagaagcac cggctaactc cgtgccagca 



99061 ggcggtttgt taagtcagat gtgaaatccc cgcgcttaac gtgggaactg catttgaaac 
99121 tggcaagcta gagtcttgta gaggggggta gaattccagg tgtagcggtg aaatgcgtag 
99181 agatctggag gaataccggt ggcgaaggcg gccccctgga caaagactga cgctcaggtg 
99241 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgctgt aaacgatgtc 
993 01 gacttggagg ttgtgccctt gaggcgtggc ttccggagct aacgcgttaa gtcgaccgcc 
993 61 tggggagtac ggccgcaagg ttaaaactca aatgaattga cgggggcccg cacaagcggt 
99421 ggagcatgtg gtttaattcg atgcaacgcg aagaacctta catactcttg acatccacag 
99481 aatttggcag agatgctaaa gtgccttcgg gaactgtgag acaggtgctg catggctgtc 
99541 gtcagctcgt gttgtgaaat gttgggttaa gtcccgcaac gagcgcaacc cttatccttt 
99601 gttgccagca cgtaatggtg ggaactcaag ggagactgcc ggtgacaaac cggaggaagg 
99661 tggggatgac gtcaagtcat catggccctt acgagtaggg ctacacacgt gctacaatgg 
99721 cagatacaaa gtgaagcgaa ctcgcgagag ccagcggacc acataaagtc tgtcgtagtc 
99781 cggattggag tctgcaactc gactccatga agtcggaatc gctagtaatc gtagatcaga 
99841 atgctacggt gaatacgttc ccgggccttg tacacaccgc ccgtcacacc atgggagtgg 
99901 gttgcaaaag aagtaggtag cttaaccttc gggagggcgc ttaccacttt gtgattcatg 
99961 actggggtga agtcgtaaca aggtaaccgt aggggaacct gcggttggat cacctcctta 
100021 cctaacgata cgcattgcgc agtgcccaca cagattgtct gatgaatgta aacgagcaag 
100081 agcacctgtt gatgttatga gtttcgactc atgctgatac gaaacggttg aatttcggtt 
100141 tgatcggtat tttcgtgtcc ccatcgtcta gaggcctagg acactgccct ttcacggctg 
100201 taacaggggt tcgaatcccc ttggggacgc caatccgata atgagtgaaa gacattatca 
100261 ccggttcttt atgaactgaa aaataacgta aagatgactt taccaagtcg tgtttacgat 
100321 attgctcttt aacaatctgg aacaagctga aaattgaaac attacagctg aaatttaccc 
100381 cgccgtagat gtattggggt aaagagtaac ctgtaataga gtctctcaaa taatcgcaac 
100441 acaagatgta ctgtcagcaa tgataagaca ccttcgggtt gtgaggttaa gcgactaagc 
100501 gtacacggtg gatgcctagg cagtcagagg cgatgaaggg cgtgctaatc tgcgaaaagc 
100561 gtcggtaagc tgatatgaag cgttataacc gacgataccc gaatggggaa acccagtgca 
100621 atacgttgca ctatcgttag atgaatacat agtctaacga ggcgaaccgg gggaactgaa 
100681 acatctaagt accccgagga aaagaaatca accgagattc ccccagtagc ggcgagcgaa 
100741 cggggaggag cccagagtct gaatcagttt gtgtgttagt ggaagcgtct ggaaagtcgc 
100801 acggtacagg gtgatagtcc cgtacaccaa aatgcacagg ctgtgaactc gatgagtagg 
100861 gcgggacacg tgacatcctg tctgaatatg gggggaccat cctccaaggc taaatactcc 
100 921 tgactgaccg atagtgaacc agtaccgtga gggaaaggcg aaaagaaccc cggcgagggg 
100981 agtgaaatag aacctgaaac cgtgtacgta caagcagtgg gagcaccttc gtggtgtgac 
101041 tgcgtacctt ttgtataatg ggtcagcgac ttatattttg tagcaaggtt aaccgaatag 
101101 gggagccgta gggaaaccga ctcttaactg ggcgtctagt tgcaaggtat agacccgaaa 
101161 cccggtgatc tagccatggg caggttgaag gttgggtaac actaactgga ggaccgaacc 
101221 gactaatgtt gaaaaattag cggatgactt gtggctgggg gtgaaaggcc aatcaaaccg 
101281 ggagatagct ggttctcccc gaaagctatt taggtagcgc ctcgtgaact catcttcggg 
101341 ggtagagcac tgtttcggct agggggccat cccggcttac caaaccgatg caaactccga 
101401 ataccgaaga atgttatcac gggagacaca cggcgggtgc taacgtccgt cgtgaagagg 
101461 gaaacaaccc agaccgccag ctaaggtccc aaagtcatgg ttaagtggga aacgatgtgg 
101521 gaaggcacag acagccagga tgttggctta gaagcagcca tcatttaaag aaagcgtaat 
101581 agctcactgg tcgagtcggc ctgcgcggaa gatgtaacgg ggctaaacca tgcaccgaag 
101641 ctgcggcagc gacgtttagg cgttgttggg taggggagcg ttctgtaagc cgttgaaggt 
101701 gacctgtgag ggttgctgga ggtatcagaa gtgcgaatgc tgacataagt aacgataatg 
101761 cgggtgaaaa acccgcacgc cggaagacca agggttcctg tccaacgtta atcggggcag 
101821 ggtgagtcga cccctaaggc gaggctgaaa agcgtagtcg atgggaaaca ggttaatatt 
101881 cctgtacttg gtgttactgc gaagggggga cggagaaggc taggctggcc gggcgacggt 
101941 tgtcccggtt taagcgtgta gggggagtga ccaggtaaat ccggttgctt atcaaccctg 
102001 aggcgtgatg acgatgcacc acggtgcaga agcagttgat gccatgcttc caggaaaagc 
102061 ctctaagcat caggtaacat taaatcgtac cccaaaccga cacaggtggt caggtagaga 
102121 atactcaggc gcttgagaga actcgggtga aggaactagg caaaatggtg ccgtaacttc 
102181 gggagaaggc acgctggcgt taggtgaaag gactagctct tggagctgaa gccagtcgca 
102241 gataccagct ggctgcaact gtttaataaa aacacagcac tgtgcaaaca cgaaagtgga 
102301 cgtatacggt gtgacgcctg cccggtgctg gaaggttaat tgatggggtc agccgcaagg 
102361 cgaagctctt gatcgaagcc ccagtaaacg gcggccgtaa ctataacggt cctaaggtag 
102421 cgaaattcct tgtcgggtaa gttccgacct gcacgaatgg cgtaatgatg gccaggctgt 
102481 ctccacccga gactcagtga aattgaactc gctgtgaaga tgcagtgtac ccgcggcaag 
102541 acggaaagac cccgtgaacc tttactatag cttgacactg aacattgagc cttgatgtgt 
102601 aagataggtg ggaggccttg aagtgtggac gccagtctgc atggagccaa ccttgaaata 
102661 ccacccttta atgtttgatg ttctaactcg gccccatgat ctggggtgag gacagtgtct 
102721 ggtgggtagt ttgactgggg cggtctcctc ccaaagagta acggaggagc acgaaggtta 
102781 gctaatcacg gtcggacatc gtgaggttag tgcaaaggca taagctagct tgactgcgag 
102841 agtgacggct cgagcaggta cgaaagtagg tcttagtgat ccggtggttc tgaatggaag 
102901 ggccatcgct caacggataa aaggtactcc ggggataaca ggctgatacc gcccaagagt 

102 961 tcatatcgac ggcggtgttt ggcacctcga tgtcggctca tcacatcctg gggctgaagt 

103 021 aggtcccaag ggtatggctg ttcgccattt aaagtggtac gcgagctggg tttagaacgt 
103081 cgtgagacag ttcggtccct atctgccgtg ggcgttggaa gattgagagg ggctgctcct 
103141 agtacgagag gaccggagtg gacgaatcac tggtgttcgg gttgtcatgc caatggcatt 
103201 gcccggtagc taaattcgga agagataacc gctgaaagca tctaagcggg aaacttgcct 
103261 cgagatgagt cttccctggg gcttaaagcc ccctgaagga acgttaaaga ctatgacgtt 
103321 gataggctgg gtgtgtaagt gcagcgatgt attgagctaa ccagtactaa tgatccgtga 
103381 ggcttaacct tacaacaccg aaggtgtttt gaattgagag agattttcag cgtcgttccg 
103441 agattgtgtt gactggctga caagtgttgc ttgtttgcgg gttaattaaa acagaatttg 
103501 cctggcggcc atagcgcggt ggtcccacct gatcccatgc cgaactcaga agtgaaacgc 



103621 aacacqagtt atcagagtga catctggtaa ctgtcaaaat cgccaaaagc ggttttgcac 
103681 aqcqcqgagc gctggccctt atagggtaaa gtatcgggaa gatacacgta aaagaatcgg 
103741 tggagcggta gttcagttgg ttagaatacc tgcctgtcac gcagggggtc gcgggttcga 
103801 qccccqtccg ttccgccact tattgcaaaa ccctgaatct tagattcagg gttttttgca 
103861 tttataaaat agtaacaaag taagctctat cttctatctt ctttatccac ctatttatcc 
103921 attttctctt agtccacttt agctattagt gctatgaagc aggcttattc tcgctcattt 
103 981 tgagccaaaa ctttaccgaa attattgatt ttttatcaaa agtattatgt ttaataggcc 
104041 attttttgtc gataaaactc ttgccataat gtattcagtt tttcatctgg tagttattta 
104101 gaacattcca ctgccgataa aaattcgtta tgactcgcaa gataaaatga ttgttagcaa 
104161 aaaatgagga actattatga gcatacctgt attcggatta ggaacttttc gccttcaaga 
104221 tcaaattgtt attgattcag tgagccaagc tctgacgttg ggctatcggg cgattgatac 
104281 agcacaaata tatgaaaatg aagcgcctgt tggtcaggca atccaagaaa gtggtattaa 
104341 tcgtgatgaa ctatttatta ctactaagat ctggattgct aatttatcaa aagacaaatt 
104401 aattcccagc ctgcgagaaa gcatacagaa actgaaaact gactatgtag atttaacact 
104461 gattcattgg ccatcaccta atgatgaagt ttctgttgct gaatttatgt ccgagttatt 
104521 aaaagcaaaa gggctaggct taactcgtca aattggtatt tctaacttta ctattgattt 
104581 aatqaagcag gcgattgctg ccgttggtgc agaagagatt gcaactaatc aaatcgagct 
104641 ttctccatta ttacaaaacc gtaatgttgt tgattttgcc aaacaaaatg gcattgcagt 
104701 gacttcctat atgacactgg cttatggcaa ggcacttgca gagccagtga ttaaaacgat 
104761 tgctgagcaa catggtgcga cccccgcaca agttattttg agttgggcga tgcaattggg 
104821 ttatggcgtt attccgtcat cgactaaagc tgctaacttg gccagcaact tactagcaca 
104881 gaaattatgc ctgaatgcag cagatatggc gttgatcgcc acattggatc gcaatgagag 
104941 qttqqtcagc cctgatggtt tagcacctaa atgggattaa tagcagccta agacactgga 
105001 cqagccacct tacattgagg tggtttttta ttttggtgca tcctgacggg gctgaaacag 
105061 qctcggtgaa aagtgtgcgg atggacatgg agttgccgcc gtcaatgtgc gcggtgatag 
105121 taggtactaa tctgatgcgt gcttatcagc gagtgggaga acaaagggta gacaactcga 
105181 tcgttaaaaa acacggtaac gaggccatgg atctgcaatt atcagggtta cggcctgaca 
105241 acgggtcaca agtcgaaagc gctggacaag gtaactgcga ggctagcatc ggtactgaga 
105301 ggatggatac gttacttccg tttaaccgag gtggaagagg ggactggaat agttggatcg 
105361 qtggagataa agaaaattgc gttgcctgct atggcggcag tggaaaaggt cgtatatacg 
105421 aacaaagatg cttattagaa gaggtcttct acagaaaagg actgatatat cagcaataga 
105481 ccaacgaggg ctgtggtgga acttgggcgc gagtcatatg aataaagtgg tacgaaagaa 
105541 tgaccttgaa cagggcgggc tgttatcact gagtgcacta aattatttcc tgttaactaa 
105601 cgctgtaata caggagagat attgattctc agtctatgac ttgaattgtg ggcttttttt 
105661 ttaatacatc agcttatgcg tgcgatagat tgaatatgct gttgagtatc cctatatcta 
105721 qqtaatccct tgtggaaaag atattactgg ccgggatatc tgggagtata atcaatagaa 
105781 tcccggctgc tattaactat tgagccagtg tgattttaga taaggaatag gcgataagcg 
105841 tgccgaaaag aacatatgca atgcggtatg ttgcaggaca accggttgaa cagatcttcc 
105901 ctqgcgccgc taagcaactc gataaaggct tgccgttagg cgaaccatta cctacagctg 
105961 aacttttacg agtggtagtg tggaatatct ttaagcagca gcgggctgga tggctacctg 
106021 tacttaagga actcgggcgt gatactcaac tgatgttact gcaagaagca caaactacac 
106081 ctgagttggt gaggtttgcg actgccagct atcaggccgc agatcaggta cctgcttttt 
106141 cattgcccca gcacccttct ggtgtgatga cgctggctgc ggctcatcca gtatattgtt 
106201 qccctttgcg tgagcgtgag ccgctactgc gcttgtctaa atcggcttta gttactgtct 
106261 atcctattca tgatggtcgc ttgttaatgg tggtgaatat tcatgcggta aacttcagtc 
106321 ttqgtgtcga tgtttacagc aaacagctcg atccgattgg tgatcaaatt gcttcacatc 
106381 gtgqcccagt tattcttgcg ggtgatttta atgcctggag ccgccaacgt attaatgcct 
106441 tqcaqcactt tgcacaggac gctggtttac aagaggttga atttcgcgtc gatcaccgca 
106501 gtcgtgcgtt cggccggcct ctggacttta ttttttatcg cggccttacc gttattgatg 
106561 cttcagtttt agtgaccaga gcctccgatc acaacccact cattgtcgaa ttccaaccat 
106621 aaaaaagcac ctcgaaaggt gctttttgac agtgattctg gctctacgcc tccggggttg 
106681 atttgtcgtt aacgaataac gtcaatttca agcctggttg taaattgtta gctttgctga 
106741 ctaccgaatt ccatcgcatc acatcgtcgg tatttacacc gtgacgcttg gcaatactgg 
106801 caaaggaatc acctttacga acttgatagg tgatactttt gctggttgta ttgcttgcca 
106861 gttgcagggt ttgcccaact tttaaggtgc tcttggcacg taagttgttc caactctgca 
106921 aatcgctggt cttgatattc aaccgcttgg caatcgtaga taaggtatcg cctgggcgaa 
106981 ctttatactg cgaacttgat gctgattttg tactgttcgt tgccaatttt gtaggttgaa 
107041 cagcagtaat atcagtatct gccaatgagt ctttcaactg ctcagcatgg gctttcggta 
107101 acataatata acggggacca tgtccatccg gtgccgttac attgcgtttg tagccagaat 
107161 taaaggattt cagctttgtc agtgacatcc ccgccatctc agctgcctga gttagctcta 
107221 tttgttgatc gacatcaacg cgagccagcg cacggtcttt gtcagtctca ggcaaggtta 
107281 taccgtactt tttgctgttc ttgagtaaat cacccaaggc cagcattttt ggaacataaa 
107341 ttgacgtttc acgcggaagc gacaatgccc aaaaatcggt cggtttgccc tttgccttat 
107401 tggcttttat cgcttgcata actcggcctt caccgctgtt ataagctgcg actgtcaata 
107461 accagtcacc gttaaacatt ttgttcaagc gttccaatat atcaagcgct gctttggtag 
107521 aggcaacaac atctctgcga ccgtcatacc attgattttg cttcaagcca taatttcgcc 
107581 ccqtacttgg cacaatctgc catagccctg cggcattggc ggatgatgtc gcgtgtgggt 
107641 caaaagcgct ctccactatg ggtagcagta ccagttccat cggcatctta cgtttcttaa 
107701 tctgctcgac tatcaggtac atgtacggct ctgcccgtaa tgttacatcg tggagatagc 
107761 tcttattttt taagtatttc tgtttttgtt cacggatccg ggaattctcc ggaaccttca 
107821 tcttcagctc gtctttgatg aagttccaca aatcttgctg cgcgtagctg ctttcacttt 
107881 ccaqccaqcg cgctgaaccc tctcgggtgc tatttgtgta ctctcctgct tcactcgctt 
107941 gacctgccga agacaaactc tgtgcgtgct gttcgggagc ctgcgcgtcc tgcttggacg 
108001 tctggcaccc gacaagcaag actgaggcga gaagtatcgc tttggtcttc atgtgtgtgt 
108061 caatgtggtt gcttaaaaga cgagcaatca tactttgctt cgataaaaaa cacaactaaa 



108181 aggcgaagtt gttcctattt ttctttgtaa atcaatgtca ttgcaacgta aaaaaacatt 
108241 aattttgcgc tcaaattgca attttacggg caggctaggt aggttttttg accgtaattg 
108301 ctcaatttgt tgttgatatg tagcaatatc ttgatcagca ggcaaaattg cgtgggcaaa 
108361 cttaagattt gagagagtat attcatgtgc gcagcatatt aaggtttcgt cgggtaactg 
108421 cgcgagtcgt tgaattgaag cgtacatttg ctctggtgtg ccttctaaga gtcgcccaca 
108481 gccagcagaa aacagggtgt ctccacaaaa aagataagga gaactgtagt atgcgatatg 
108541 acctagagta tggccgggga cggctataat ggtgtaattt tgtccggcaa tggtgaggct 
108601 atcgccatca ttaacaatga cggtggcccc ttttttcgca gtttcttgtg gcccatagac 
108661 gggaatatcg gggaaatggt ggcgtaaatc agcaaccccc cccacgtgat cgttatgatg 
108721 atgggtcaat aatatggctt gtggcacata ttggccttgg gccagcgtgg ctagaactgg 
108781 cgctgactca ccagggtcaa cgataacaca gtgtttttgt cgattagcca gtagccaaat 
108841 gtaattgtct tgaaatgccg gaatgctgat aagattcatt agatacctct cgttggcaac 
108901 agcttgaaag aagataataa aatatgaaac cagcgcaaat acgccaaaat attgagacac 
108961 ctgcatcttg ggctgaatta ccttgggggg aatattaccg cgcggcgctt gaacagcaac 
109021 tgcaaccctg gtggccaaaa ttttttggtt ttcacttgct gaaggttggt catttaagtg 
109081 ctgaaattgc cagcgataaa tgtgcgattg ctcatcaggt taatgtcggg gagtggggca 
109141 aaaatatgca ggtgctcgcc agtccttatc agttaccctt tgctgaaaaa tcggtagatg 
109201 cttgcttact ttcccatact ttggcttatg ccgctaaccc gcaccgtatt ctacgtgaag 
109261 tcgatagggt attaattgat gatggttggc tggttatcag cggtttcaat ccattcagtt 
109321 tgttaggctt gggaaaattg gtgccggtat tacgtcagcg acagccatac atcagccgga 
1093 81 tgtttactca aatgcgtttg ctggattggc tgagtttgct gaattatgaa gtagtacatt 
109441 tatcgtgctt tcacgttttg cccttcaata agaacggtgg gcactttatc agtacccacc 
109501 tgcctgcgtt gggatgtgtc aatctaatta ttgcccggaa acgcacggtg ccattgacgt 
109561 tcaatccgat gaagttcggc gtacgtaaac cttggtttag ccgggccgtt ggtgcaacaa 
109621 aaagttaccg ggatcaaccc tgagtcaagt caatcattcg aatgagtaag agccgctaac 
109681 acgccaagct agtaggatta atcagggtta tagcccgtat catcgagtgt cggtgaatta 
109741 gctccctgcc gggccagttc atcgcaacgt tcgttttctg gatgacccgc atggccctta 
109801 acccactccc actgtatggt atggctctga atagctaaat ccagacgctg ccataagtcc 
109861 acattgcgca ctggcttacg atcggctgtt ttccagccgc gttttttcca gttatggatc 
109921 cattgagtga taccctgacg cacatattgg ctatcagtac ttagggtcac ttcgcaaggc 
109981 gaggtcaatg cctcaagcgc cacaatggct gccatcagtt ccatacggtt gttggtggtc 
110041 aaataatagc cggcactaaa cgttttttcg tgttgtttgt aacgtaaaat tgcgccatag 
110101 cccccagggc ctggattacc gaggcaggag ccatcggtga aaatttctac ctgtttagtc 
110161 atctctggta gactcttcct atctgtttaa acgtcaagtc tgacataaac gagcgctgtg 
110221 agcactgcaa tatgatcatt acaccgacca gacagattgt tctggatact gaaactaccg 
110281 gtatgaataa gcttggcgtt cattatgaag gccatcgtat tattgaaatc ggtgccgttg 
110341 aagttatcaa tcgccgcctg accggcagaa atttccatgt atatgtgaag cccgaccgtt 
1104 01 tggttgatcc agaagcttac ggggtgcacg ggatcagcga tgagtttttg gctgacaagc 
110461 caacatttgc tgatattacg ccggagtttt tagattttat ccgtggcgct gaattagtca 
110521 tccataatgc tgcgtttgat atcggcttta tggattacga gttccggatg ttgcagcaag 
110581 atattccgaa aacggagacc ttctgtacta tcactgatag cctgttgatg gctcgccgct 
110641 tatttcccgg taagcggaat aaccttgatg ccctgtgtga tcgctatcag atagataata 
110701 ccaaacgaac cttgcacggt gcgttacttg atgccgagat tctggctgaa gtttatttag 
110761 caatgactgg cgggcagaca tcgctaactt tctccatgga aggcgaagtg tcgcagaata 
110821 acgcgtctga agatattcaa cgcattactc gcccagcatc ggcattgaaa attatctatg 
110881 ctaccgagga tgaattggct aatcatgaat cccgtttgga ttttgtgatg aaaaagggcg 
110941 gcagttgcct gtggcggatg ccgcttgagg cagaagaaga gacaaaagca gaataaaagc 
111001 acactcagat gaaaaactgt gcaaatggct tatttcactg gaaaaagtgt tgacgccatt 
111061 ccgttctaac cgtactatcc gtctcgttct taaccggaac agcagcgtgg tgcggtagtt 
111121 cagtcggtta gaataccggc ctgtcacgcc gggggtcgcg ggttcgagtc ccgtccgcac 
111181 cgccaaaatt tagaagccct gagttaagta ataactcagg gctttgtcgt ttctgcttaa 
111241 ctcaatcttt gcctatttca gggcgtataa acagagccgc attgttcggc aatgggtcat 
1113 01 tgcgtgcaat aaagcgataa atactcccgc caagaaacgc cccgataaac caactgaaac 
111361 tggagatctg ctgtaggttg gggatgaaaa tgatcatcaa gccaatggcc actgcgggca 
111421 gtaacgccat aatcgctttc ggattaaaac ctccgcgata ccaataacgt cctgatgatg 
111481 aagcattaaa cagatcatct acatataatt ggccgcgttt aatcagataa aaatcagcga 
111541 ttaaaatacc aaatagcggc cccataaagg cacccagcac gtccagggtg tagtggataa 
111601 tgattggtga ttggaacagg ttccatgggg taagcaaaac tgaacctacc gcagctatca 
111661 tcccaccagt acgaaagttg attctttgtg gtgaacagtt agaaaaatca aaggcgggtg 
111721 aaacaaagtt tgcgacgata ttaatcccga tagtggcgat aatcatggtc agtagaccta 
111781 gagcgatagc cacgccattg ccgacgcggg taacggtctc gatagggtcg gtaatcattt 
111841 caccgaacag cgactgtgag ccggagacaa tcaccacggt gacgagagag aacaggagga 
111901 agttaaaggg tagcccccat cggttgcctc ggcggatttc ctgcatactt ttgccatagc 
111961 gggagaaatc accaaagttc agtaatggcc cggcaaaata agagacgacc agcgcggttg 
112021 ccgtcagcat ctgccagacc tgttctgatg ggcttaagat tttactggcg agggtgaatg 
112081 aaatgttact ccagccggtt tgatagacaa tccataacgc caatgacagc atcacgacat 
112141 agactgccgg cccggcaaaa tcaataaagc gcttaatcgc attcatccca tgccagaaca 
112201 ccatcgcctg cagtacccac ataatggaga aacagatcca gccgagggtg gataacccta 
112261 gccattggct ctgagccagg gggatcaggc tggggaagaa tttcagcatc acgagcataa 
112321 aagcattacc cgccagatag gtttgaatgc cataccaggc aacggcgatc agcccacgga 
1123 81 taacggctgg gatatttgct ccaaataccc caaaagattg tcgacagata acggcatagg 
112441 gcacaccggc tatctggctt ggtttggcca ccagattggc gcaaatttgt acaatgcaaa 
112501 taccgactaa tagcgataac agcacttgcc aactggatag cccgagagca aagaagctgg 
112561 cggcgaccac ataacccccc atgctgtgca catccgtcat ccaaaatgaa aagatattgt 
112621 accagcccca gttctgctta cgagttgggg ccagatcttc attagtgagt ttgggactgt 



112741 ccaaaggcaa cattcagatg atgtgagaca aggtttgtcg aacctcacat gagaaatatt 
112801 tacagagtta gggctagaga tatttgcaga actcagacta aaaattgttg tatcagcgga 
112861 acgtaaatga attttatgta caattaacat acaatttata tacataacat atacaatctg 
112921 tgtgtttttt atatacaatc agggaaggga tggaaggcaa caaagggatg gaaggcaaca 
112981 aagggataga aggcagcaaa ggtgatgatg tgggatgtct tgccgtttca acgagtcagt 
113 041 ggatagacat aatggagtat gccaatgcat gagactgaac ttaatcatga ttatccacgg 
113101 gatttggccg gttacggtgg ccaaccccca gtcgctaact ggccggggca ggcacgtatt 
113161 gcggtgcaat ttgtcctgaa tattgaagaa ggtgcggaaa ataacgttct acatggcgat 
113221 gcgggatctg aacaatttct ttccgatatt atcggtgcgg acagttatcc cgataggcat 
113281 attgtcaacg acggatgaaa agtgatccac ttatatctcc accaacggcc caatattgat 
113341 ccaccgtttt actcaggatt agcttctgct ataaccccgg cctttcgttt ctgtctgagt 
113401 cgatagcttt ctcctttgat ttgaacgaca tgtgagtggt gtaagatacg gtccagcatc 
113461 gctgaggtca gtgctgcatc accggcgaac gtttgatccc actgcccgaa cggcagattg 
113521 gatgtcagga tcattgcgct cttttcgtaa cgtttagcga tgacctggaa gaacagcttt 
113581 gcttcttcct gactgaacgg cagatagcct atttcatcaa tgatgagcag gcggggggcc 
113 641 attactccac gctgaagcgt cgttttataa cggccctgac gttgtgccgt agataactga 
113701 agtaacagat ctgctgctgt tgtgaagcga actttgatac ctgcacggac tgcttcatag 
113761 cccatcgcta ttgccagatg ggttttcccc acacctgatg gccccagtaa tacgatattt 
113821 tcattacgtt ctatgaagct gagtgagcgt aacgactgga gttgcttctg cggtgctccg 
113881 gtggcgaatg tgaagtcata ctcttcgaac gttttcaccg ccgggaaggc tgccattcgg 

113 941 gtatacatcg cctgtttacg ttgatgacgt gccagttttt cttcatgaag cagatgctcc 
1140 01 aggaagtcca tataactcca ttcctggtct actgcctgtt gtgacagcgc aggcgctgcg 
114061 cttataaggc tttccagttg caactgcccg gcgagcgcca tcagtcgttg atgttgcagt 
114121 tccatcatca cgccactcct ctgcagaatg agtcgtagat ggagagtgga tgatgcaggg 
114181 ggtgtttgtc gaagttcacc agattttcat caagatgcac gtcatactct tttttctccg 
114241 gaggcagtgc cagcatggac tgctgctctt cgagccagcg atcgcaggga cgggcctgga 
114301 ttgtttcatg ctttcgttgg ttagcgacat cgtgcagcca gcgcagaccg tggcggttgg 
1143 61 ctgtttcaac atcgacagtg atccccatcg ggcgcaggcg agtcattagt gggatgtaaa 
114421 aactgttacg ggtgtactgc accatccgtt ccaccttacc tttagtctgt gccctgaagg 
114481 ggcgacacag tcggggagag aagcccatct ccttgccgaa ctgccacagc gaaggatgga 
114541 accggtgctg accggtctga tatgcgtcac gttgcagaac cacagttttc atattgtcat 
114601 acaacacttc gcgcggcaca ccaccaaaga agcggaacgc attacgatgg caggtctcca 
114661 gcgtgtcata acgcatattg tcagtgaatt cgatgtacag cattcggctg tatccgagaa 
114721 cagcaacgaa cacgtgaagc ggtgagcgac cattacgcat agtgccccag tcaacctgca 
114781 tctgtcgtcc gggttcagtt tcgaaccgaa cggcaggctc ctgctcctga ggaaccgaga 
114841 gagaacgaat gaatgccctg agaatggtca ttccgccacg atatccctgg tctctgatct 

114 901 cgcgagcgat taccgttgcc gggattttgt aaggatgagc atcggcgatg cgttgacgaa 
114961 tataatcccg gtattcatcc aggagtgaag caacagcagg tcgcggcgta tattttggcg 
115021 gctcagattt tgcctgcaaa taacgtttaa cggtattgcg ggagatcccc agttctctgg 
115081 caatcgcccg gctactcatt ccctgcttgt gcaggatttt aatttccata actgtctcaa 
115141 aagtgaccat aaactctcct gaatcaggag agcagattac cccctggatc tgatttcagg 
115201 cgttgggtgt ggatcactat tgcaccgttc gttacaacta acagaaataa accatcacca 
115261 tcatggaggg ttaggtcttt gtcggtggtt tttgcttttt gtacttcggt gtgggtgagg 
115321 gggcgtgtag tccgcgccat gcaaggttct tccctgtaat tggtatacgt ccattggtac 
115381 ctatcttagc gtataccaat tgtctctggc ttcaggcgga tatccttgga ctactacaga 
115441 cacaaaaaag cccgcaaacc tagaagggat gcgggctttc cgtacttcac cggacttatc 
115501 tggtaataac cggtttaaca tttgggctgg gggacgctga atagattaat caatgcattg 
115561 tttctattga taaaaaacta attcaaattt gtttcgtata cctaaacgta taccaatgca 
115621 ggaaagtgac acaccaataa ggatattttg gggtggcctt tgcatattgg gctaccaagt 
115681 aggtatcgca aatagtggca tgatttatgg tgaaatcaac ttgcgaaatg tagcctaaag 
115741 gctacaatat attgatgaaa acgatcaaac actatctgac tcccgaggga cgggatttat 
115801 acatggagta tttgaagagt ctccgtgatt cgatcgctaa ggcaaaaata tcatcaagag 
115861 ttaaccggat cgcttcgggg aactttgggg atcataagcc ttgccgtgaa ggtgtctggg 
115921 aacttcgtat agaccagggg ccaggttaca gggtttatta cagccttgtt gcggtgaagt 
115981 cgtgttgttg cttttgggcg gtgataagcg ctcgcagaat gccgacatcg atcaggctat 
116041 tgtgtgtctt aaagattatt taacgaggta atgattatgg ccaaagcccg tttacatgat 
116101 gacgctatgg ttcagcttct tatggaagac cctgaatttg cgcaagtata tctgcatcag 
116161 gctcttctag atattgatga ggaaggaggg caagaagcct ttctgatggc cctacgtcat 
116221 gtagttgagg cccgaggagg gatggcaagt gtggctaaaa aagcgggtgt atcgagagaa 
116281 acactgtacc ggacgttatc accttcaggt aatccaacac taaaaacact gcttagtgtg 
116341 gtatctgcaa cgggatttca gttttcacat cttgcatcga ttaccgcatg attaaatctg 
116401 aaataaaggt aaaaaatcag tttaacagtc cggtattcga agtcattcat cgtatggtct 
116461 tagtgggatt tagtgctaag gccatttaac aagaaattat gtaccgtttt gataaaatca 
116521 gtaccaagcc taaatccttc ccccatctta tcccccattt agcctgtacc acttcttccc 
116581 ccacctggat tttggctctg tgctctcaac ccagtcatga cgcgggtttg ctctctttcc 
116641 cccatttagt gtgtataggc aagtgggaag tgggggattt gaggctaaat tagttaaaaa 
116701 ttcgatgttc ccccacttaa ttcccacttt atcccccagt tagtttcttg ctgtattgct 
116761 taatgaatta gggcaaatat tctccccatc aatatgaatc agcccatcag tttccagctt 
116821 gtcgagccag cgggtaaatt tcttcgcaac atcaaaaccc atgccgcgca tatcatccct 
116881 caccagcgat tttgtacagg cctcaccact ggcggtacgg gagcggatag attgccacag 
116 941 cgcgatatga ttagccgtca ggcgctgaat actcgccagt tccgcctcat agggtgagcc 
117001 ttcatcgctg acttcacggc ctttatcaca tagcaccaat gagttaactt cttcgccatc 
117061 gttatcgata tacaaattga gcggcgagag gtcgtaggcc cgcgtggtgg gttcttctga 
117121 atccttcatt ttggtgcaac tcagcaccag tgcgccgccg tcgccctcac gacgaacatt 
117181 gaattcgaca tccaacgccg cgcgaaaggc actggagccg cgcgcgcctt tatccagatc 



117301 cccctqaata aatgcgccca tatctttggc agcgttctcg tcggagccac caaagcagcg 
117361 tqccaqcgta tccagtacaa tcaggcgcac cggcgagccg gtttgctcgg tgacatcctg 
117421 aqccqcttta atcacctgaa tggcgctctc cgggctggcg ggaaataccg ggcagtcgat 
117481 qcqqtaqagc gattcaatcg gtgtgccgcc gttaaattcc attccccacg cgcggatacg 
117541 ccqtqqcacg ccgataccgc cttctccgac gatatagacg acggaacctt gggtcacacg 
117601 ccqcqaggcc cacggcttgc cggtggcaat atgacaggcc catgagacgg ccaaaaagga 
117661 tttatacgag ccgcttggcc cgtaaatact ggccagtgtg ttacttggca agtggccctt 
117721 aatattgtaa tcctgcgggg tgttatagcc gtcggagcct ttgcgcaacg gcaggtttgt 
117781 agtcaatgac acaggttcgc gttcgagcag aatgagtggt gttttcatat ccattccctg 
117841 qqattqtaga gttggcggaa aaactcgata cgggcggcgt ccgttccgac ttgttggcga 
117901 taatcatccc agtcagcttt gccgcaggtg ggtggcaggg tcactaagcc attgactgca 
117961 aqagcggcgt attccgcctg ttgtttgccg ggattgttgt cagagtttgt tacgtcgtta 
118021 tccgccgcga taataatctg cgcgtcggga taacgttcac gcagtgccac agcaacctgg 
118081 gtcagattag tggctgagat agcggcaaca actgtggcgg tgacaaatcg tgatatgacg 
118141 aqqccggtgg caaagccttc agtaatgatt attgtctgcg acagagtgtg gtgagtgatg 
118201 gggataaatg aagatttaat ccgcgagccg ggcagtaaac gcttttctcc ccggcgattg 
118261 attaacagcg caccggtagt gttgccactc agatccacca gtggcagcac catgctgccc 
118321 acattaaaat gcaccccgcc aatattttta gcactgttgg cgggcatgag aaagactgga 
118381 tqatttaagc ccttatgcaa caggtaatca ctggttccct gcgaacaggc agccagcagg 
118441 gcagtgactt tttgtgtcat gtctgagtga aggggctgtt cagttctttc tttggcgggc 
118501 qctqttqggg ctaagtctgt taagcgggag atttctctgg cggcctgaat cacatcacat 
118561 tqtcqcacca gagtgaccaa atccagacca tcgccgtgcc cgcaatagtt gcagaaccac 
118621 qttcctcgcc cttttttatc atcaaaacga aaacggtctt tacctccgca gacggggcag 
118681 qqqccatgtt tgttaaggtg gggaatagga atagctaagg aggaaaggat attgctccag 
118741 tqgtgttgtg atgcgcgaat gatgtcggaa acaaacctac tcatggttag cctcctctgc 
118801 atccqgcact tcatccagca tcagttgcag caaatccagc cgttcagcca gaataaatag 
118861 caacagttct ttcgctgtag gatcggtaat ttcgagcatt gccaatacca gtgcgtggca 
118921 qtgatcaaca atttcttcaa tcgtgagcgg ggaagagtta tgcatagcac acccccggat 
118981 qqqtacgaac gtattgcggg tacagacagt gaagggtagt agccattggg ccgcctccaa 
119041 qaaqtagttg taatgactac cacctgagac gctaatccct tgggtggtag cccagacgga 
119101 qttagcgtta ccggacttct tggaaaccgg cgctcctttc ggagcccccg cctgggccac 
119161 cattgacggg tgtgcgcaag ccatacctat aaaagaccat gtctgaatga cgaggctagg 
119221 tatcaagaag attttcggaa cgctaattcc ggctacggat tttgccgcag cgaaaggatt 
119281 atactqcaaa gcgcccgcca gaaaaaaata aaaaatcatt agttgcgacc ctcctcgcgt 
119341 tgattgattc tgataatgac ccattcatcg acttcacttt caacccaggc cactgaacgc 
119401 ggccctaatt ttattgggcg agggaattga ttgaggctaa tcaaatgata aattgatgat 
119461 tttttcagtc gtattttctc aatcacttca ggtaagcgga tcaaattaat atcagacata 
119521 qttcgcctcc ttgcgggtat aaacgcgttc aacatagcgg ccacggccat caccatgacc 
119581 cagatccatt gacgttaacg ctctggcttc aatgcggcta aatcctgcgg cgaggtaata 
119641 attaatagcg tcttgcgccc aggcatagcg caggctgtgg ggggagtaat gtccggtcag 
119701 ccctaagcgg gtggtatggg agcgccagaa attcatcgcg gttttcaggt caggtttatc 
119761 aataagtttg ccgccgcgtg tttcggcgat ctgttgcgct tctttgactg cccgtttgac 
119821 tgcttcagga tccagaatgc gggtttcgcg tgagcggcct cccttggtgc caaacaccac 
119881 qqttagttta gaccggttct gctcaagctc cttttgccat gttttcaacg agttggcgca 
119941 ttgcaccgcc tcttgtgaac gcagccccag caggcgggca agctgaaggg cgcaggcgag 
120001 tcctttgtct tgctgttggg cggtgcgcag agcggcttga tacacctcct cagggatagc 
120061 gaatttgcta ccggcgcggc tggctccacc taaccccaat gctttgttgg ttaatcgttc 
120121 tgataagacg attttatccc gaccagcgaa tctgaaaacg gtgcgcaatg ccgccatttc 
120181 qttatqaaga gaacggatcg cgatcccctg tgaaaaacgg tcagtaatat aactttcaat 
120241 atgctttgct tttaggtgtt taacatcacg cacctgaata ttaagggcca gcagatggcg 
120301 gctaaacctg tccataatcc gaatacgatc atgaacggtt ttatggctac cgccagcttg 
120361 ttttgccagc attttcatct cacgacttaa ccgactcatt acgaacatct cccaacaatt 
120421 tcaactagct gcatttctct ggcgcatggg gaatggcagg ttatctgcct tgaacgatac 
120481 ctgtgcgcca gagcgggcag cagttgaggt attgcggggt atcggtgatg cactcggtgc 
120541 agcctcctgc ttttgcaggt catcccccag gctgttggcc tgcctcggta tcggttcaaa 
120601 tatcttgtaa aaagcggtca gcctgacgta ttgaagtggt gtcagatgca tgacgtgttg 
120661 agtgaagtcg cttatctcaa aggctgaggc aggtttgtgc tgtggcgtca ctttcactcg 
120721 taaaagtgac gccacgagca acacgcaaca gggcaggcgc tagcagatgc ctttcggcat 
120781 aqcagagcgc agccagcggc gtttagcacg cgccagagat tgaggtataa ggagtctttt 
120841 qatqcagtga tgcatcttga tttaagtgtg caaaaacagc gccgtcatgg tagaccaccg 
120901 gtttaagagg ccgttgctct actgaggcga gtcgttcgct tagttgataa ccaatgacag 
120961 tatccccgat gggcaatacc ggttggtcag gaaatgctgg taattagcca tttcgttagt 
121021 gtgataagag tacctttatc accttagtgg ggtgacaccc gttttcaaag gttcaacggg 
121081 tttaccgctc ttacagcgct acttttcagg tttcggtagc tattttgtgg tcgctcgaaa 
121141 qcqtcaqtct cagaccacac tccattccgt atgactttgg gtttatgacc accacgaagg 
121201 tgtttcttgc aggtcattcg tatcatcctg atactggaaa cattgggtgc tgttatcgac 
121261 aqcgagttta catgggggaa attacggcaa aaactgaata gcggcaagtt attggaagct 
121321 gtaatlagga agagtaatga aaggggggct acatcccttt ctaacctgtg tcatttgagg 
121381 attttccctc tggctagcct gccaaaaaga gcactgcatg catatagtgc atgaatttgc 
121441 atqcaacgcc ccggctaaaa caactgaagc accgcgccac aagggcttgt aggctctcta 
121501 tgtcatgcat gaaaagtgca tgataaagtc agagcgcgta ggtgggggac cattgtgcgc 
121561 agacgacctt acacctattg atacgaagaa ggatttcaat cgaaaataac ttcatttcgc 
121621 aqttttgaaa ttttctcgta ttgatcttgt aatagtattt ttcttgcgtc taatttaatt 
121681 acttgagcat ttgtttttat ttgctctaag gctttgtgtt ccatggcaca tataacatat 
121741 tgcctatcat tatttatgcc gtcagaaatt gattttataa tgttggtata attcccgctt 



121861 tccgctactt gttcataaat tgctaaataa taaccaagga cagcccttgc atcttctgct 
121921 gcaccgccta cttcatatat ttttttataa tcaagagggc tttttatttt agatatgtca 
121981 acgttaactt ttaattttgt tatgtatttc gataatctat ctgcgaagct ttgctttatt 
122 041 tcactttttt gttccttgga tagtatgtca ctttgagatt tttttgctgt tttttcttct 
122101 gattttttaa tttcaatggt tatatttttt ttactattta tttcgtgtgc gcgtttttca 
122161 aacttatcat tggtgaatga tattaacgaa ttggagtcac tttcaatgac attattataa 
122221 cttcgttcgc tcttattgct ttgctctgta atctcctcat cttttataac tagcaaattt 
122281 attattccgt tgacttctga ctctaaggat tcgagttggt tttttgcaag ttccatatca 
122341 atcaaaatgc tagtgcgatg tgctatagaa ttttcatgta ttgttccaca tgttgggcac 
122401 tctatgtcac catcttccat gttttcgacg gaaaaaatat aatctttatc tagctcacta 
122461 ataatccctt tggctaattt gatttgcgac tcaagtcgta ttttttcaac tcttaagtta 
122521 gttctttctt caagaaggtt ttttttgttt ttttctgatt cttgtataac actaaaaata 
122581 aactcggtcg aaggaagtac attgtctgta tcgataaggt tttgcctgag tattataact 
122641 gcattattca gttcagtaac gcctttctca acttcttcta actctctttt tattacataa 
122701 atatcttttt caacttcaaa gtgtgctttt ataaataatc ctgagtggta ggagatgact 
122761 ggttttgccc agctactata ttgctgaaga ttctcgaaac tatcccaagg tttagcccat 
122821 gttctttttt ggtcaatata aaagggtatg aagtaatatg caggtggagg ggtttctagt 
122881 gttaattgac ctttcttagg gagtaataca ttaaaattaa caagttctga aaaatacttt 

122 941 gaataatcac cggttatttt tgggaaactg tgtagtttac cattgctatg cctaatttta 
123001 atttcatttt tatatctatg aataatgtaa ggtgcattat ttatctcaaa ctctataatt 

123 061 gatgtacagt ctaatgttct ccaaacctcg gaaaaaatgg gctcacagcc aaaagtccaa 
123121 aaaatcattt ttgcaagcgt agatttccct acgctattat catcatctgc tgttatcagt 
123181 gttagatgct ctccgaactc gaattgcctc gcggattttc ctgaatctga gagtatcatg 
123241 atcttcttta gtattaattt tttcataaac cttcatcctt aagcttcgaa ccaatagctt 
123301 gtagtttttt tccattttag tttgttgcct ctaccatttc gtaaatgatc gcagatgata 
123361 tagaaatatt atcctcaaac aaaccaatag tatcttttga taagtctatg attagccttt 
123421 cgagcaattc ataagacatt tcaatttcaa tttttttaga gttcttttta agtaactctt 
123481 gtatttcttt ttgggctttg ataattgcaa gggtcggtga gagtagggtt agtgaatatg 
123541 aatttatttt ttgtattatt tttgccttaa tcgtgccctt taaatttaga tcatctaata 
123601 tgtattttat tgcttcaaaa tttaaatcgg aattaccttt tgagactatt ttggttacta 
123661 ccgtatggat atctttatag gtcagtgctt ttttagtaag agcatcattc caattttcat 
123 721 agtcaaaaac gttcttccct ctgatgcgta atgagtccat taatattcta taaatatctt 
123 781 tcgtacgaag tagatgttgt ggatacactt cttcgacgct atcagtgatc cttgatattg 
123 841 tatgctcata atagctttct agaggaatgt cagaggtagt gaagtgtatt atttcttgcg 
123 901 gaaactttcc gaagttagcg agctcatcac ttaggcattt tataatttca ctagcctcat 

123 961 cactacttaa gctagacagt ttatatgaag ccaagtttaa cttgttgttt ttagtgaggt 
124021 tgaatccact tgtagttata aaatctaatt tttcaacctt ttcaaagtat tttttgtcgc 
124081 taactgaact acaaagcttc cctaaaagag atggttttgt accattctct cgcttggtta 
124141 atacttttgc agtataagtt ttctttgcat tggctttaat ttgattgaaa gtaaatttaa 
124201 ctttattttt atctatagaa ttcgcataga caacatcttc atgatattcc atgaagattg 
124261 caaagtcatt gctttctttg tatgaactga atgctttaca aagagcccaa tgatattggt 
124321 attcaaatcg atcataggac tgggcccctg cggactctct ttgcggttcg gacaagggat 
124381 tgtcagatgt gtcttcacta tccattttag tggcctgtcc attttatgag tgatactaaa 
124441 agataatacg tgctaaggga aacgtctatg actttcactt acgttaaata cttcttttag 
124501 tgtatatctc taatccccac caattcatta actcttggcg ttgtatgaga taggtagaac 
124561 gattataagc tttcctgact tcattcttat cactatgcgc tagcgcagcc tcaatcacat 
124621 cagcattaaa cccagcctca ttcatggctg tactggcgat agaacgtaaa ccgtgagcaa 
124681 cgagcttccc tccataacca atacgtttca aggctgcatt tgctgtttgg ctattcatgt 
124741 gattacgtgg gtcattgcga cttgggaata cgtgctgacg attaccacta attggacgca 
124801 ttatttccag caactctaag gcttgttccg acaacgggac tatatggtcg cgctttgctt 
124861 tcattcgttc tgccggtata caccactgct tattctctaa atcgatctct atccaagctg 

124 921 ttgctgatgc ttcagcgggg cgtatcagag ttagtaattg ccattcaagt aagcagcggg 
124 981 ttggcagtga gagattgctc atggcaatgg tacgcatgag tttaggtagt tcttcaggtc 
125041 tgatggtggg catatgctgc ttcttggggc gttcaaaagc attaccgata cctgaagctg 
125101 ggttagcatc aatcaaacct acattgaccg catagatcat tatctcgttt atacgttgta 
125161 ccaatcgcct gaccgtctct aatgctccac gtgctttgat tggctccaat acttgaatca 
125221 gagttcgtgc tttaagttct tggatgggaa tattttcgat gctgggtaag atgtcttttt 
125281 caatagaacg ccaaatatct ttcgcatggg ctgcgctaac atggctttct ttcaactcga 
125341 accatttgcg ggcgacattc acaaaaatgc tttcctccgc tatctggagc ttttcagcct 
125401 cttcaccagc cctcgtttga gggtctgtcc ctctgactaa catcgctagt ttttctgcgc 
1254 61 gtatctctct ggcatctgcg agggatagtg ctgggtaagc accgaggctg atcatagtgc 
125521 gcttgctact gttgggaagc tgatagcgga aacgccaaat tttcttgccg gtagttttga 
125581 ctagcaaaaa taaaccatca ccatcatgga gggttaggtc tttgtcggtg gcttttgctt 
125641 tttgtacttc ggtgtgggtg agggggcgtg ttgtccttgc catgtatggt tcttccctta 
125701 attggtatac gtccattgat acctatctta gcgtatacca atttgtatac caatagacac 
125761 tggattcagc cggatcctct tggactacta cagacacaaa taaatatcct ccggcatagc 
125821 cggaggtttt tcatatgcgc ctataaggct ctgttaccag ccgcgcccta acaggcgcat 
125881 cgcgatctga catttgcatc tatggattac ttacggcccg taaacgggct accgggatac 
125941 gggatcgaga gttgctcacc cattttatcc tcttccaatt ggtgctttat gtattcttgt 
126001 atcctggccg tgtttttccc taccgtatca acgtaatacc ctcgacacca aaactccctg 
126061 ttacggtatt tgaacttcaa atcgccaaac tgctcataaa gcatcagact gctctttccc 
126121 ttcaggtacc ccataaatcc cgagacactc atcttgggcg ggatctccag aagcatatgg 
126181 atgtgatcca cacagtattc tgcttccagg atattcacgt ttttccattc gcacagtttt 
126241 cttaaaatac tgccaatcgc tctgcgtttt tccctgtaga acacctgcct tcggtacttc 
126301 ggcgcaaaaa ctatatgata tttacagttc catcgggtgt gcgctaagct cttttcatcc 



126421 ttttaacaaa tcaaaagggg tttttataac tgacccaaag ctgaaagctt tactgaaccc 
126481 ccagcctagc tgggggtttt ctgggcacaa aaaagcccgc aaacctagaa gggatgcggg 
126541 ctttccgtac ttcaccggac ttatctggta ataaccggtt taacatttgg tggagctggg 
126601 gggatttgaa cccccgtccg aaattactac accgtcggca ctacatgctt agtccaatca 
126661 ttacattcgc cggccagctg cggatggaca cgctactgac aaactatcct gattagtttt 
126721 aatgcttcca ccccaggcaa ggtttccaca cgagctcttt taggtttgac ctctcttgat 
126781 ccccgtccta agagcggagg ctagggagag agggctctaa gcaggttatt aagctgctag 
126841 tgcgtagttt tcgtcgtttg caactatttt ttttgcggtt ttttacgagg ccaccgcacc 
126901 tcggcatgca ccttgggttt cgcgaatccc gtcgaatcca gaatcagccc caaagaactc 
126961 agctagtata acagaactat gtcctgcgat gccagttact taacgatttg cgtgcttcat 
127021 gatgcgcgct ttgtctaact tccactcacg atctctaata tcatcgcgct tgtcgttatc 
127081 ttttttacct tttgctacgc cgattttaac tttcacccag gcatttttcc aatacatgga 
12 7141 gagagcgaca accgtgtagc cttctcggtt cactcggcca aacaaggaat ccagttcgcg 
127201 tttgttcaac agcagtttac gggtacgcat aggctcgcaa acaacatgag tcgacgcaac 
127261 gttcaacggc gtaattgtgg ccccaaacag gaatgcctca ccgtttttaa acattacgta 
127321 gctgtcactg atatttgctt tgcctgcacg cagtgatttt acttcccaac cttgcagtgc 
127381 aagcccagct tcgaactctt cttcaatgaa gtattcgtga cgggctcgtt tattttgcgc 
127441 aatggttgcg gaaccgggtt tgtatgcttt tttctttgtc atagtatcgg tcattatact 
127501 gtatgcggag gcgaatgaaa tcctttcccg taagatgctt atgtattttg tctcatttgt 
127561 tgcgaggaat gcctggcaga ttttttgtct ggcggta:gat aaatggtatt atcagtgcgt 
127621 tttatgtctc acaggaaatg ttatgccaca gattagccga tctgcactgg ttccgttcag 
127681 tgtaaaacag atgtatcaac tcgttaatga tgttcgttct tacccagagt ttttacccgg 
127741 ttgtaccggt agccgggtac ttgatgccac tgaaaatgaa atgatagccg ccgtggatgt 
127801 cgccaaggcc gggataagta aaacatttac tacacgcaat acgctaacag ataaccagag 
127861 tatcaatatg cagttggtgg atgggccgtt tcgtaaatta atgggcggat ggcattttac 
127921 accactaagt gcagatgctt gcaaagttga attgcatctt gattttgaat ttacaaataa 
12 7981 gctaattgaa ttggcattcg gtaaaatatt caaggagttg gctgggaata tggtgcaggc 
128041 tttcactcaa cgggctaaag aggtctacag tgcctgatat tcgcgttgag gttgtatatg 
128101 ccttatctga acgccaatac ttgcgcactg tatctttggt ggttggcagt actgttgagg 
128161 atgcgatcaa ggcatcaggc ttactggagt tacggccaga cattgacctg gaaaaaaata 
128221 aagtgggtat ctatagtcgc ccggtaaaac tcggtgacaa actcaatgat ggtgatcggg 
128281 ttgaaatata tcgcccatta atagccgacc ccaaagagct tcgccggcag cgagctgagc 
128341 aggcaaaaaa gtaaggcaca atcagcgcct tattttttgt tgtcactctg ggcagccatt 
128401 gaaagagaaa cgcggactat cttgtggctg agttaacatc aataggttta gaacctaaca 
128461 ccataacccg cattcccaat ctgttctgca attctcaaat aacttagctt ccggttaacg 
128521 cgggctcatt tttgatatca gtcagtacac cactgctatc aaaggtcaac gttaacgttt 
128581 gttgcgttat tttttcgtga ccaggttgtt ggcggaaaac gtagaaccag gtttgagtgc 
128641 caaacggatc ttgtagcatc ggcgtgccta aggtatacgc cacttgttgc tgtgtcatac 
128701 ctttatgaat tttggaggca tcaataggtg ataaatagtt accctgatta atatcaggcc 
128761 ggtagaccac cttctccaac gttgaacaac ccgcagtaag cattgcaagc atcacagcgg 
128821 cggcagtcag cattttacag cgcatagtaa tcacattcct ttagggcata ggatgacgat 
128881 gataatagac cttgcggcag ttggaaacct ttacagggca cctgtatgac caccaaaaag 
128941 gaaaaaagtt aatcttttct atgcagcaag caattctttt gcgtttgcca aggtgttttt 
129001 tgtcacctca ctaccgccca gaagacgcgc caactcttgt aagcgggctt ttttatctaa 
129061 ccgatgcata tgagtttcag tctcagtacc atcggtttgt ttactcacaa agaaatgttg 
12 9121 atgaccacag ccggcaactt gcggaaggtg ggttacgcac atgacttgag ttgattcgcc 
129181 aagttgacgt agcaggcggc caactatggc cgccgttggg ccgctgatac ctacatctac 
129241 ctcatcaaag attaatgctg gggtgtccat tttacgtgct gtaattacct gaatagccaa 
129301 ggcgatgcga gacagctcac caccagatgc aactttgacc agtgcttgta ggggttggcc 
129361 tggattggtg gtgacacaga actcaattcg ggttgcccct tccgcactaa gatgttctgg 
129421 ctcaaatatt gtttcaatgg cgaacttgcc atgtggcatc gatagctcat gcatgctttc 
12 94 81 agtaatcagt gcagccagtt cattggcata atattgacgc tcttggtgta gccgttttgc 
129541 tattgccagt gcttgctgat agtgagtatt tactgcatta ctaagttgtt cgtgatcatt 

12 96 01 ttcttgttgt gagagcagtt cttgttcatc cagtaattgc tgatgaaatt gtggcaattc 
129661 ttccggtgct acatggtgtt tccgcgcgag atttaattgc cgagataacc gtttttctag 
129721 ctcatataga cgatttggat ccatatctaa ttgttccgca tagtgacgta actcatcact 
129781 ggcttcactg atttggattg aggcttcttc taacatattc agtagattat taaattgttc 
129841 gtccatactg gccagctcgg tcaactgatg tttggcagcg taaagctggc tgaggatatt 
129901 gttttgttca tcatctgaca gtaattgtaa agtctgctga ctcagggaca gtaattgccc 
129961 gctatttgcc aaacgtttgt attcaatatc aatttgttca tattcccctg cctgtggaga 

13 0021 aaatgaattc aattctttta gttgatattg taataactct tggcgggcag ttcgttccag 
13 0081 cgactgttgt tggtgaagtg ccagatcacg gcaactttga tgccaaattt gataggctgc 
13 0141 cttcatttct gccaacagag atgattgatt tgcgtaggca tccaataatt gtttttgatg 
130201 atcgggtttg agcagcagtt gatgagcgtg ttggccgtgg atctgaatta gatgttggcc 
130261 caactctctt aattgggata cgggtacagg cgtaccatta ataaagcctc ttgaacggcc 
130321 atcagcgcca atagcgcgcc gcaacaggca ttcattattg tcatcgagat ggttgttttc 
130381 cagccattgg cgggcagatg gtgtatccgc cagtgagaag cgggcgcaaa tgtcagcgcg 
13 0441 tgttgcccct aagcgcacca tgctgccatc tgagcgattc ccgaggcata accctagtgc 
130501 atctacggca atggatttgc ccgccccggt ttcgccagtg attgccgtca ttccgggctg 
13 0561 aaaatcaatt tctaactcac gcacaatagc aaagttgctg atggttaatt ggaccagcat 
13 0621 ggtaaacccc ctgtacataa acacatgtct gtatttgaac acagtataga ctggttttat 
130681 atacagtaaa gtagctgagg tgatttttag aataattttt ttgaccaacc caatttggtg 
130741 cttaacgtat tgaaatagct atagtccttt ggatgaataa gattgaggtg aaaatcactg 
130801 cgccggatta aaacctcttc cccttcctga atgggtaggg ctatctggct atcgcaactg 
130861 atttccaggt cgctggtaat gtgcgaaaac ttcagtcgga tggtgctact gctgctgata 



130981 gttggtgtca agattgggcc gcctgcggaa agtgaatagg cggtagaacc ggtcggggtt 
131041 gcgatgatca agccatcaga gcgttgagaa aaggcgaagc ggtcatcaat atacacttcg 
131101 aattcgatca tatgggcgac tttgcctgga tgcaaaacca cttcattgat tgcggtactg 
131161 atacggcttt gttgatttgt acgtcgtaca tgcgtttcta gcaggaagcg ttgttcactc 
131221 aggtactcgc cttccaaaac gtctgaaagc tgctgcagag cattatccgg gtctaaatcg 
131281 gtcaagaaac ccagattacc tcggttgaca ccaatcactt ttatatcata acgggcaagt 
131341 acccgtgcag caccgagcat gttgccatca ccgccgacaa caacagccaa atctgctttt 
1314 01 tggccaatat ctgccaaact ccctgttatg gcatcggtta aattgaggtc atgagcaatc 
1314 61 tgttgttcaa ccataacggc gtagcctctg gcttttagcc agtggtagag tatttcatgg 
131521 gtggcaagtg cagccgggtg tcgcgggtga ccgacaatac caatacaatc gaatcttcta 
131581 ttattcattg ccgtgatttt ccttaatgga gtgtctttta ccccacatta tgaccggttc 
131641 ccttgaatcc ccggttttga tccccataat aagccaagta gcgagattaa tgctaaaacg 

1317 01 cggagatatc catgagtagt aaagaacaga aaacgccaaa cgagcaagtc tcggaagaaa 
131761 tggaaaatac agcagagcag caagtggaag caacacaaga gacgggggaa tgtgtagatc 
131821 cgcgtgttgc agagcttgag gtgcagttgt ctgatgctct acagcgtgag cgcgaaagcc 

1318 81 tgttacgcgc taaggctgaa gttgagaata tccgtcgtcg tactgaattg gatgtcgaga 
131941 aagcgcataa atttgcactg gagagattct catctgaatt attgccagtg attgacaatc 
132001 tggagcgcgc tctagatacc gctgataaga ccaatactga actgatctca atgattgaag 
132061 gtgtcgaact gacactgaaa tcactgctag atgctgtcgg taaatttggt attgaagttg 
132121 tcggcgagac ccatgtgccg tttaaccctg aggtgcatca ggccatgaca atgcttgagt 
132181 ctgctgatca cgagccaaac cacgtaatga tggtgatgca aaaaggctat acgctgaatg 
13 2241 gccggctgtt acgcccagca atggttgcgg tatctaaagc gaaatcgtaa agtgctctaa 
132301 cgttaaaagg cacctcaacg gtgcctcaaa ttgctaacaa agtcctggcc tcctagccgg 
132361 gactttgctc taattttacg cggtaacgcg tacctgctta ttttttcaat tggctctgga 
132421 aatcacgttc tgtatagcca gtataaagct ggcgcggacg ggcaatcttg atgccgtcat 
132481 tatgcatttc gttccagtga gcaatccaac cgatggttcg ggcaatggcg aatatcaccg 
132541 taaacattga cgatggaatc cccatggctt tcaggataat acctgaatag aagtccacgt 
132601 ttggatacag ttttttctca atgaagtaag gatcgttcag tgcgatgttt tccaactcca 
132661 tcgccacttc taacaagctg ttattcagct tcagctcttc taacacttca tgacaggttt 
132721 cacgcatgac ggtggcacga gggtcatagt ttttatagac acggtgacca aaacccatca 
132781 gacggaatga gtcattctta tctttcgcgc ggcggataaa ttcagggata tgctcaacag 
13 2 841 ttttaatctc ttccaacatc ttcaggcagg cttcattggc accaccgtgc gcagggcccc 
132901 acagggatgc gatccccgcg gcgatacagg caaacgggtt agcgccagaa gagccggcag 
132961 tacggacggt cgatgttgac gcgttctgtt cgtgatctgc gtgcaggatg agaatacgat 
133021 ccatagcgcg ttccagtact gggttaactt catatttctc acatggcgta gagaacatca 
133 081 tgtgtaggaa gttacccgcg taggacagat cgttacgtgg ataaacaaac ggttgaccca 
133141 gtgaatattt ataacacatc gctgcgacag tcggcatttt agacaacaga cggaaggcgg 
133201 taatttcacg atggcgctcg ttattaacat ccagagcatc atggtagaag gcggctagcg 
133 261 caccagtaac cccacatagc acggccatcg gatgtgaatc gcggcggaaa ccatggaaca 
133 321 gacgggtgat ctgctcgtgg atcatggtat gtcgggtcac tgtggttttg aacgttttgt 
133 381 actcttctgg cgtaggtgtt tcgccgtaca gcagaatgta acaaacttcc agataagtag 
133441 aatcttttgc caactgggca atcgggaagc cacggtgcag caaaataccc tgatcaccat 
133501 caatataggt gatttttgat tcgcaggatg cggtagaggt aaaaccgggg tcaaaggtaa 
133561 aataaccttt agaacctaag gtgcggacgt cgatcacgtc agtgccgagg gttggggata 
133 621 ggacacccag ttcgatcgca gcttcgccta aattcagcgt cgcttttttg tcagccattt 
133681 acagtctcct taagtgctta ttttaaaacc cctaacaacg . gaaacatctt agatgcccgg 
133741 ggatatcacc ttattagcac tgctctgttt tattcatcat taatactctt ggtacttgaa 
133801 gttgtcgggt tgtcagcgat aactctaatt acgtgagtag ataacttgat ttagggtaca 
133 861 gagtaaattg ccaatacggc ccgcgatccg aaggtttgga tcaacagcta gtgttaaaaa 
133 921 gttgttagtg atttgtcgga tacttatagt ttatggatca ttgcctacga tatctatggg 

133 981 gatattgact caatacactg ttgcataact taagcttgtg gggaagtgta tccgttgacc 

134 041 tttagtgcat cataggcctt atttctaccc tagttgtaat ggaaatgttt attctttgtc 
134101 aaatcagata attgatttta tatgaattgt gtaaatcgtg atcctaatca ctgttcaggg 
134161 gaaatgttat caatcagttt gtgtgggaat tgtaataaga atgtgaaccc cctatactgc 
134221 cgccaggtct ccggattacc ctgaagtagg agcacccagc gttatacaga acatgctgtt 
134281 taagcattca ggatgcaata cttaaacgtg cgatgtgaag ggtttcgact cttaacgctg 
134341 tctgattccc cacccaggtc cggaggaagg aaaataataa gagctgtgtg ggcaaaaccg 
1344 01 tgaaaaaaca aagacctgtc aatttggatc tgcaaacgat tcgatttcct gttactgcga 
134461 tagcgtccat cttacaccga gtctctggcg taattacctt cgtggctgtc ggtatcctcc 
134 521 tctggctgtt gggtttgtct gtttcctcac aagaaggatt catgcaggca gcaaccatca 
134581 tgaatagttt ttttgttaaa ttcatctttt ggggaatact cactgcgctg gcttatcaca 
134641 tttgcggtgg catccgtcac ttgttaatgg attttggcta tctcgaagag aatttagccg 
134701 tgggtacccg ttctgctcaa gtggcaatgg gattaaccct cgtgttgtca gttttggctg 
134761 gagtcctcgt atggtaagca atgcttctgc gttagggcgt aatggagtac acgattggtt 
134821 gttactgcgt gcttctgcaa tcgtcatcac cctgtatgtc ttttatattc tgggcttcgt 
134881 tgtgattgtc ccagacatca cctatgaaat ctggcgcggt ttcttcgcgt cacacatcac 
134 941 aaaagtattt acgctgctga cattgctgtc gattctagct cacgcttgga tcggtctatg 
1350 01 gcaggtattg acggactata taaagccact ggcgatacga ctggtgttac aactggtcgt 
135061 cgtgattacg ctgttggttt atttattgta tggaacaatt gtggtgtggg gagcttaaat 
135121 gaatctgcca atcagagagt ttgatgcagt tgttgtcggt gcaggtggtg ccggtatgcg 
135181 tgccgccctg caaatttcac aaatgggcct gtcttgcgcc ctgatatcta aagtcttccc 
13 5241 tacccgttcc cataccgtat ctgcgcaggg ggggattacc gttgcgctgg gtaataccca 
135301 tgaagataac tgggaatggc acatgtacga tacggtaaag ggttccgatt atatcggcga 
1353 61 tcaggatgcc atcgaatata tgtgtaaaac cggtccggaa gcggttctgg aattggagca 
135421 catgggttta cctttctccc gtttagagga tgggagcatt tatcagcgtc catttggggg 



135541 cggtcatgcg ttgttacaca ccctttatca gcaaaacctg aaaaaccaca ccactatttt 
135601 ctctgaatgg tatgcactgg atttggtgaa gaatcaggac ggcgcttttg tcggttgtac 
135661 tgctattaat attgaaaccg gcgaagtcgt ctatttcaaa gcacgggcca caatattggc 
135721 aactggcggt gctgggcgta tttaccaatc aacaaccaat gcgcacatta ataccggtga 
135781 cggtgttggt atggccttgc gtgctggagt cccggtgcag gacatggaaa tgtggcagtt 
135841 ccacccgacc ggtattgccg gtgcgggtgt attagtgact gaaggctgcc gcggtgaagg 
135901 ggggtattta ctgaataaac atggtgaacg cttcatggaa cgttatgccc cgaacgccaa 
135961 agatttggcg ggtcgtgatg tagtggcacg ttccattatg attgaaatcc gtgaaggacg 
136021 tggttgtgac ggtccttggg gcccacatgc caaactgaaa ttggatcact taggtaaaga 
136081 tgtgcttgaa tcccgtctgc cgggtattct tgagctgtcg cgcacctttg cacacgttga 
13 6141 cccgatcaaa gaaccgatcc cggttatccc tacttgccat tatatgatgg ggggaattcc 
136201 gactaaagtg acgggtcagg cgattaccgt aaatgaaaag ggtgaggatg ttgtgatccc 
13 62 61 cggcttgttt gctgtaggcg aaattgcctg tgtttccgtc cacggtgcga accgcttggg 
13 6321 agggaactca ttactggact tagtggtatt tggtcgtgct gccggtatgc acctgcaaga 
13 63 81 atcattgatg gagcagggcg caagccgcga tgccagtgag tcagatatcg aagcctctct 
136441 ggctcgcatg aatcgctgga ataatacccg ttcaggtgaa gatccagttg aaatccgtaa 
13 6501 agcattgcaa gcctgtatgc agaacaattt ctccgtgttc cgcgaaggcg atgcgatggc 
13 6561 aaaagggttg gaagagctga aaactatccg tgaacgtttg caaaatgccc gtttggatga 
136621 cacttccagt gaattcaata ctcagcgtat tgagtgcctg gagctggata acctgatgga 
136681 aactgcgttt tcaaccgctg tgtcggcaaa cttccgtacc gagagccgag gtgcccatag 
13 6741 ccgcttcgat ttcccggatc gtgatgatgt gaactggctg tgccattcac tgtatttgcc 
13 6801 aggtactgag agcatgaccc gccgtgaggt aaacatgcaa cctaagctac gcgaggcatt 
136861 cccgccgaaa gtgcgttctt attgatgcct gtgttgtgtt tctaagggtt aaacccgttg 
136921 cggaggaata ggtcatgaag ctcgagtttt caatttatcg ctataaccca gatgtcgata 
13 6981 acgcgccgca tatgcaagat tacacgctgg atgcggaaga aggtcgggat atgatgttgc 
137041 tggatgcact tatccagttg aaagaaaaag atccgacgtt gtcattccgt cgctcatgtc 
137101 gcgaaggtgt gtgtggttct gatggtctga atatgaatgg caaaaatggt ttggcctgta 
137161 ttacgccaat ctcagctttg caaaaaggca ataaaaagat agtgatccgc cctctgccgg 
13 7221 ggttaccggt ggtgcgagat ttagtcgtcg atatggggca gttttatact cagtatgaga 
137281 aaataaaacc ttacctgttg aatgacggca agaatccacc agcacgtgag cacctacagt 
137341 cgccggaaca acgggcaaaa ctggatggtt tgtatgaatg catattgtgc gcttgctgct 
137401 caacgtcttg tccgtcattc tggtggaatc cagacaagtt tgttggcccg gctgggttac 
13 74 61 tggctgccta tcgtttcctg atagatagcc gtgataccga aacggcctca cgtttggatg 
137521 acctggacga tgctttcagt gttttccgct gccatagcat tatgaattgt gtcagtgttt 
137581 gtcctaaagg tcttaatcct accaaagcaa tcgggcatat taagtctatg ctgttacaac 
137641 gtagcgcata gtcgttatcg cgtagcgcat aaaatagcga tagcgctgtg agtgatgatt 
13 7701 acagtgtagt gaatgacaat tatggcataa cgattgttat aacatagcgg ataatcattg 
137761 caacgcagtg cctgatgtga gtttattaac ccggagaggg gattctccgg gtagcttcac 
137821 ggtagggaat ctctaaaagc cggcataatg tctgtttcta gaggttcctt gaacgggacc 
137881 gataaggtcc ataacagaac gtgcattgag cacgtcgagt gaaccgtttt tacggcaaac 
137941 cgttgatatt acggtgtata tgttaaccac ggcgaaaact gaagcttcaa agcttaaggg 
138 0 01 atcataatgc agaacggcgc aatgaaggcc tggctggatt cctcctatct ggcgggcgcg 
138061 aaccagtcct acatagagca gctctatgaa gactttttaa ctgatcctgg ttccgttgat 
13 8121 gatagttggc gttcaatttt tcaacagcta ccaacaacgg gtgtaaaacc tgatcagctc 
13 8181 cactcccaaa cgcgtgagta tttccgtcgc ctggcgaagg attccactcg ttataactcc 
138241 tctatcaatg atcctgatat tgatgccaag caagttaagg tattgcagtt aatcaatgcc 
138301 ttccgcttcc gtggccatca gcatgccaat cttgacccac tcggtttgtg gaagcaggag 
13 83 61 tctgttccag acctcgatcc ttcctaccat aatctgaccg aagctgactt ccagaacacc 
138421 ttcaatgtcg gttcttttgc catcggcaaa gaaacgatga agctggctga tttgtatgcc 
13 84 81 gcgctgaagc aaacttattg cggttctgtg ggtgctgaat atatgcacat taccaacacc 
13 8541 gaagaaaaac gctggattca gcagcgtatt gaatcggttg ttgggaaacc gacattcagt 
13 8601 gatgcagaga aacgccgttt cttgagtgaa ctgaccgccg ctgaaggtct tgagcgttat 
138661 ttaggcgcaa aattcccagg cgccaagcgt ttctctctgg agggggggga ttcactggtc 
138721 accatgctca aagagatgat ccgtcatgcc ggtaaaaatg gcacacgtga agtcgtcctg 
138781 gggatggccc accgtggccg tttgaacgta ttgatcaatg tgctgggtaa gaaaccagaa 
138841 gacttgtttg atgagtttgc tggtaagcat aaagaacatc tcggtactgg tgatgttaag 
138901 taccaccaag gcttctcttc cgatgtcgaa accgaaggcg gtctggtcca cttggcactg 
138961 gcctttaacc catcacattt ggaaattgtc agcccggttg ttattggttc agtgcgtgca 
139021 cgtcgcgatc gcctggatga agcacgcagt aatatggtgc tgccaatcac tattcatggt 
13 9081 gatgcggcga tcaccggcca gggcgtggta caagaaacat tgaacatgtc acaggctcgc 
13 9141 ggctatgaag tgggcggtac tgtacgcatt gttatcaaca accagattgg tttcacaacc 
139201 tcgaacccac tggatgcgcg ctcgactcaa tactgtactg atattgccaa aatggtacag 
13 9261 gcaccgattt ttcacgtgaa tgctgatgat cctgaagctg tggcatttgt tacccgtctg 
139321 gcgttggatt tccgtaatac ctttaaacgt gacgtgatga tcgatctggt ttgttaccgc 
139381 cgtcatggcc ataacgaagc cgatgagcca agtgcaacac agcctgtgat gtatcagaaa 
139441 attaaaaaac atccgacacc gcgtaaaatc tatgctgaca aactgattga gcagaatatc 
139501 gccagcctgg aagatgcgac agaaatggtc aacctatatc gtgatgcatt agatcacggt 
139561 gactgtgtgg tggaagagtg gcgtccgatg aatttgcagt ccttcacttg gtcaccgtat 
139621 ttgaaccatg agtgggatga agcgtatcca agcaaagttg aaatgaaacg cctgcaagag 
13 9681 ctggcgcgtc ggatcagctc tgcgccggaa gccattgaaa tgcaatctcg ggttgcgaag 
13 9741 atatacagtg atcgcgcgtt gatggccagc ggtgagaagc catttgactg ggggggagcg 
139801 gaaacactgg cttacgctac attggtagac gagggcattc ctattcgttt atctggcgaa 
139861 gatgcggggc gtggtacttt cttccaccgc catgcggtca tccataatca gaaaaatggt 
139921 tctgtttatg taccactgtc aaatattcac agtggccagg gtgatttcca ggtttgggac 
139981 tcggtgctgt cagaagaggc agtattggcg ttcgaatacg gctatgccac tgcggagccg 



140101 atcgaccaat ttattagttc tggtgaacag 
14 0161 ttgctaccac atggctacga agggcagggg 
140221 tatttgcagc tttgtgctga acaaaatatg 
14 0281 gtttatcaca tgatccgccg tcaggcatta 
140341 tcacctaaat cattactacg ccatccatta 
140401 ggcagcttct tacccgctat tggcgaaatt 
14 0461 gtcgtgatgt gctcaggcaa agtctactat 
140521 caaacagatg tcgccatcgt gcgtattgaa 
140581 caatcggtgt tggaacaata tgcccatgta 
140641 cttaaccaag gtgcctggta ttgtagccag 
140701 gcttccttgc gctacgcagg tcgtccagca 
140761 gtacaccaaa aacaacaaca agctctggtt 
140821 ggttaacata taaaggatag agaatgagta 
140881 agtctgtagc cgatggctct gtagcaacct 
140941 gtgatgaagt gctggttgag attgaaacgg 
141001 aggacggtat tttggatgcc attctggaag 
141061 tgctggggcg cattcgtcca agtgatagtt 
141121 gtacggagtc aactccggct caacgccaaa 
141181 ccctcagtcc ggctattcgc cgcctgattg 
141241 aaggcagcgg tgttggtggc cgtatcaccc 
141301 gtaaatcggc ctctgcagtg gtagctgacg 
141361 tggcggggcg tagtgagaaa cgtgtcccga 
141421 gtctgttaga agcgaaaaac agcaccgcaa 
141481 agcccattat ggacctgcgt aagcagtacg 
141541 gtctgggctt tatgtccttc tatatcaagg 
141601 aagtgaatgc ttctattgat ggcgaagatg 
141661 ttgcggtttc tactccacgt ggtttggtaa 
141721 gcatggctga tattgagaag aaaataaaag 
141781 tgaaagttga agagttgact ggcggtaact 
141841 cactgatgtc gacaccgatc attaacccac 
141901 tcaaagatcg cccaatggcg gtaaatggtc 
141961 cactttccta cgatcaccgt ttgatcgatg 
142021 taaaagagat gctggaagat cctgctcgct 
142 081 cgtataaaaa gataactgac aaagctggtc 
142141 ggcttcggcc cgtctcagac cttatatgga 
142201 gcaaaacaac tgtttgctcg gtatggcatg 
142261 ccgcgtgaag cagaagaagc cgcatctaaa 
142321 caggttcatg cgggcggccg cggtaaagcg 
142381 gatattcgtg ctttcgctga acagtggcta 
142441 gccaacggac agccggtgca tcagatcttg 
142501 ctgtatctgg gggcggttat cgatcgcagc 
142 561 gaaggtggtg ttgaaattga aaaagtcgcc 
142621 gcattagatc cattgaccgg accacagcct 
142681 ggcttgaccg gtaaacaagt cggtcagttc 
142741 ttcctggaac gtgacctggc gatggtagag 
142801 gatctgattt gtctggatgg taagctcggt 

142 861 gaactgcgtg aaatgcgcga tccaagccaa 
142921 tgggaactga actatgtggc gttagatggc 
142981 ttggcgatgg gaaccatgga catcgtgaaa 
14 3 041 gatgtgggcg gtggtgcaac aaaagagcgc 
143101 gacgataaag tgaaagcggt ttttgttaat 
143161 attgctgacg gcatcattgg tgcagttgaa 
143221 cgtctcgaag gtaataatgc agagctgggt 
143281 atcattgctg cgaccagcct gacagatgcg 
143341 aaataatgtc tattttaatt gataaaaaca 
143401 gccaagggac tttccactcc gaacaagcta 
143461 ttacgccagg taaaggcggc acacagcatt 
143521 aagcggtaga aactactggt gctacggcct 
143581 aagattcgat tttggaagcg atcgatgccg 
143641 gtattccgac gctggatatg ctggtggtaa 
143701 tgatcggtcc gaactgtccg ggtgttatta 
143761 cgggtcacat ccacttgcca ggtaaagtgg 
143821 atgaagccgt aaaacagacc accgacattg 

143 8 81 gcggcgaccc gatcccaggc tctaacttta 

143 941 cgcagactga agttattgtg atgatcggtg 

144 001 ctgcttatat taaagagcac gtgactaaac 
144 061 cacctaaagg taaacgtatg ggccatgctg 
144121 cagatgataa atttgctgct ctggaagctg 
144181 aaataggtaa tgctgttaaa gctgtattac 
144241 tctaaaaatc taacgcgaca tactggccac 
1443 01 actccggtag ctgttgatac accaatgaat 
144361 tcaaaatgta aaacggtaag tcagaacggc 
144421 aggtgatagc actttatcac gcaacaaagt 
144481 gaactaatat aaatatgtaa aaatcaaaac 
144541 tttaaaagtt aatctggatc gcataataat 



aaatggggtc ggatgtgtgg cttagtgatg 
ccggaacact cctccgcacg tctggaacgt 
caggtgtgta ttccatcgac ccctgcacag 
cgtggtatgc gccggcctct gattgttatg 
gcgacatcat ctctggatga actggctaat 
gatgagttgg atcctaaagg tatcaaacgt 
gacttgctag agcagcgtcg taaaaatggg 
caactgtatc cgttcccaca tcaggccgtt 
cacgatttcg tctggtgcca ggaagagcca 
cataacttcc gcgaagtgat tccatttggt 
tcagcttcac cggcagttgg gtatctttcc 
aatgacgcgc tgaatgttga atagctagtc 
gcgtagatat taatgttcct gacctccctg 
ggcacaaaaa accaggcgac agcgtcaaac 
ataaagtgat tttggaagtg ccagccagtc 
atgaaggtgc cacagtgacg tcacgtcagg 
ctggtaaacc gactgaagaa aaaagtcaga 
cagccagctt ggaagaagaa agtaatgaaa 
ctgagcatga tctcgatgca acggcaatca 
gtgaagacgt tgatagccat ttagcgagcc 
ctaaagcggt agccgctgct gcacctgtat 
tgagccgtct gcgtaaacgt gttgctgagc 
tgttgaccac gtttaacgaa attaatatgc 
gtgaagcttt cgagaaacgc cacggtgtac 
cagtggttga agcactgaaa cgttatcctg 
ttgtctacca taactatttc gatgtcagta 
cgccagtatt acgtgatgta gacacgttga 
aattggctgt aaaaggtcgt gatggcaagc 
ttaccattac caatggtggc gtgtttggct 
cacaaagcgc gatccttggc atgcatgcca 
aggttgtgat cctgcctatg atgtatttgg 
gccgcgagtc tgttggctat ctggtgacgg 
tgctacttga tgtataaact ggctactcga 
ttgaggatcg gctttatcca gaacactacg 
tagaacatca tgaatttaca cgaatatcag 
ccagctccaa ctggctacgc ctgtaccact 
attggtgcag gcccgtgggt ggttaaatgt 
gggggcgtaa aactggttaa cagcaaagaa 
ggtaaaaaac tggtcactta ccaaacagat 
gttgaagcag ccacggatat tgataaagaa 
tctcgccgtg tggtctttat ggcatccact 
gaagagacac ctgaactgat ccataaaatc 
taccaagggc gtgagctggc atttaaattg 
accaagatct ttatggggct ggcaacactg 
attaacccgt tggtggtcac taagcagggc 
gccgatggca atgccttgtt ccgtcagccg 
gaagatgctc gcgaagcaca tgctgcccag 
aacatcggtt gcatggtaaa cggtgccggt 
cttcatggtg gcgaacccgc taacttcctt 
gtgaccgaag cgttcaaaat cattctgtct 
atcttcggcg gtatcgtgcg ttgtgacctc 
gaagtgggcg ttaacgtacc cgttgttgtt 
gccaagaaac tggcagacag cggtttgaat 
gctcagcaag ttgtagcggc agtgggggct 
ccaaagttat ttgtcagggt tttacgggca 
tcgcatacgg tacaaaaatg gttggcggtg 
tggggctgcc agtctttaat accgttcgtg 
cggttatcta cgtgcctgcg ccattctgta 
ggattaaact gattatctgt attactgaag 
aagtcaggtt ggaacagagc gatgcgcgca 
ctcctggtga atgtaagatc ggtatcatgc 
gtatcgtttc acgttcaggc acgctgacct 
gtttcggcca atcaagctgt gtgggtatcg 
ttgatatcct gaaactattc caggaagatc 
aaatcggtgg taacgctgaa gaagaagcgg 
ctgttgtggg ttatatcgcg ggtgtgactg 
gcgctatcat tgcgggcggt aaagggactg 
cgggtgtgaa aacagttcgt agcctggcag 
cgcagtaatc aagaaagtat caggtgttag 
cttttggtgg ccttttttat acaaatggct 
tggcgtgatt tcagtaactg tgtaccatca 
cttttcgttg cagggggcca ggttgttatg 
agcaacatat acgaacgctt ctctcaggat 
tgtgatttga ctcactctga atttaataaa 
aatttaatta cgcattatat aaaccattaa 



144661 aaagatttaa ttgttctaaa tcaacttttg aacgtggtaa taatgttcat ataggcttaa 
144721 attgtcgtag atcaatttac gacattgtta ccaaaatgcc tatttgttga cctatataca 
144781 gaactcaacg ctgggtcacg taaaatccta tttattgtat gggattagaa gcgtagtatt 
144841 accgcggtgt aatattgggt agttgatatt gttatttttg ttgttttttt tgagcattca 
144 901 ctgctggtag ttatgaggct tccgatttag cggaggtccg gttgccgcaa gtcggaatct 
144 961 tcctgcgagg agcaaggagt caagatgttt gatattgtcg aactgtcacg gttacagttt 
145021 gccttaacgg caatgtacca tttccttttt gtcccattaa cgctcggtct ggcgtttctg 
145081 ttggcaatta tggaatcggt ttatgtgctg tctggtaaac aaatctataa agatatgacc 
145141 aaattctggg gtaagttatt tgctattaac tttgctctgg gggttgccac tggtttaact 
145201 atggagttcc agtttgggac taactggtca tatttctctc attacgtcgg tgatatcttc 
145261 ggggcccctc tggctatcga aggtttaatg gcgttcttcc tggaatctac gtttgtcggt 
145321 ctattcttct tcggatggga tcgtctgact aagcatcagc atttagctgt aacttggcta 
145381 gttgcgttag gttctaactt ctctgcattg tggatcttgg ttgcgaatgg ttggatgcaa 
145441 aacccgattg catctgattt caactttgaa accatgcgta tggaaatgct gagcttctca 
145501 gaattagttc tgaacccggt tgctcaggtg aaattcgttc ataccgtcgc tgcaggttat 
145561 gttaccggcg cgatgtttat cttgggtatc agttcttact atctgttgaa aggacgtgat 
145621 attccctttg ccaagcgttc ttttgcgatt gctgcaagtt ttggtctggc ctccgtattg 
145681 tctgttatcg tcctgggtga tgagtccggt tacgaaatgg gtgacgtaca gaaaaccaag 
145741 ttagccgcaa ttgaagctga atgggaaact cagccgcctc cagcagcctt tacgctgttt 
145801 gcgatcccta accaggaaac catggagaac cggttcgcca ttcagattcc atatgcatta 
145861 ggattgatcg cgacccgttc actggatacc ccggttatcg gtttgcgtga cttgatgtcg 
145921 caacatgaag tgcgtattcg taacggtatc caggcgtata gtttgttaga acagttacgt 
145981 ggtggtaaca ctgaccctgc tgtacgtgat gcattcaata aagcgaagca ggatcttggt 
146041 tacggcatgt tgcttaagcg ttatacggaa aacgtagctg atgcgactga agaacaaatt 
146101 cagttagctg ctcaagactc tattccgcgc gtactgccgc tttatttcgc cttccgcatc 
146161 atggtggcct gtggcttcct gatgttgctg attattgggc tggcgttcta taacgtggtt 
146221 cgtggccgga ttggtgagaa aaaatggctt ctgcgtgccg cattgtatgg tattccgttg 
146281 ccgtggattg ccgtagaagc aggctggttt gtggccgagt atggccgtca accttgggcc 
146341 atcggtgaag tattgccgac agcagtcgcc aactcatcag tcactgccgg ggaaatactg 
146401 ttctcgatgg cgttgatttg cggtttgtat acgctgttcc tgattgcaga aatgtacctg 
146461 atgttcaaat ttgcgcgcct tgggcctagc agcctgaaaa ctggccgcta tcattttgaa 
14 6521 cagccgactg ccccagtgca ggaagcacgg taaacaggag tccacttatg tttgattatg 
146581 aagtattacg atttatctgg tggctgctga tcggtgtgct gctgattggt ttcgcagtca 
146641 ccgatggttt cgatatgggt gtgggtatcc tgctacgtat catcggtaaa aatgatacag 
146701 agcgccgtat catgattaac tcgattgctc ctcactggga tggtaatcag gtttggttaa 
146761 tcacggcagg tggggcactg tttgcggcct ggccaatggt ctatgccgct gcattttccg 
146821 gcttctatat cgcgatgatt ctggtgctgg cagccttgtt cttccgcccg gtcggttttg 
146881 actaccgttc taagctggag aatagccgct ggcgtaacat gtgggactgg ggcatcttta 
146941 tcggtagctt tgtgccggca gtcgtctttg gtgtggcgtt tggtaacttg ctgcaaggcg 
147001 tacctttcca catgaatgag tatatgcgcc tgttctacac cggtaacttc ttccagctac 
147061 tgaacccgtt tggtttgttg gccggtattg tcagcttgac catgttggtg acgcaaggcg 
147121 cgacttacct gcaaatgcgt acgcgtggtg aactgcacct tcgttcccgt aaggccgctc 
147181 agatctcggc tctggtaatg agtgttgcct tcttgctcgc gggtatctgg ttagttaaag 
147241 gtattgacgg ctttgttatc acttcagtat tggatacagc agctgaatct aacccaatgc 
147301 gcaaagaagt ggctcatcag gccggtgcat ggttgattaa cttcaacaag tatccaatcc 
147361 tgtgggcttt acctgcactt ggcgtgattt tgccactgtt caccattttg ttgtcacgct 
147421 ttgaaaaagg ggcatgggcc ttcttgttct catcactgac gattgcttgt gtgatcttga 
147481 ctgctggcgt aaccatgttc ccgttcgtta tgccatcaag cacgatgcca aatgtcagtc 
147541 tgaccatgtg ggatgcgaca tctagcctgt taacgctgaa agtgatgacg attgttgcga 
147601 taatctttgt gcctatcatc ctgctttaca ccagttggtg ttactacaag atgttcgggc 
147661 gcattgataa agaatttatc gaaaacaaca agcattcact gtactaagga gcataaagca 
147721 tgtggtattt tgcctggatt ctagggacac ttctggcctg ctcctttggt atcatcactg 
147781 ctctggcatt ggagcagagt gaagccaacg cagcagctaa agccgctaac aatggcaacg 
14 7841 acgatgctgt ctgataagtt gtacgaatta atggataagg gcccattgag ggccctttca 
147901 ctggtgctcg cgtttgctct ggctttctgt gtgttttggg accctacgcg ttttgcggcg 
14 7961 gccaccagct cactggaagt atggcaggaa gtctttatcg tgtgggcggt gtgcactggt 
14 8021 gtgatccatg gcgtcggttt tcgccctaaa caggtgtggt tgcgtgcttt ttttgcacca 
148081 cttcctgcaa ttgtgatctt ggctacagga ttattctact ttttcgccta attttccttg 
148141 agcacaatcc tacctatggg ccatccggcc cataattatt ttcgttctat tctcgcaagt 
148201 gattccaaag ctcaacgctc ttcagtatag tgtgttcgtt aattgcatta ctgggaagta 
148261 gagtgagtaa tacgttattt cgatggccgg tacgagtcta ctttgaagat accgatgctg 
148321 gtggcatggt ttaccatgcg cgctacgttg ccttttatga aagggcgcga actgaaatgt 
148381 tgcgccaacg aaattttcat cagcaacaat tgctcagtga gcatgttgct tttgctgtcc 
14 8441 gcagcatgac ggtagaatac ctggctcctg cacgtcttga tgatatgctg gaggttcaga 
148501 gtgaagttac cgctatgcgt ggtgcctctc tcacgtttgc tcaacgaatt ctcgactcac 
148561 atggcaatct tctgagtagt gcagaagtat tgatcgcatg tatcgatcca catcaaatga 
148621 agccaagagc gcttcctaag tctattgtcg cggagtttaa gtagtggctg acatgaacat 
148681 tctggattta ttcctgcagg caagcctatt cgttaagctt atcatgctgg ttctgatggg 
148741 cttttctatt gcgtcatggg ctatcatcat tcaacgaacg cgaattctga atgcggctac 
148801 ccgcgatgct gaagccttcg aagataagtt ttggtccggc attgagcttt ctcgcttgta 
148861 tcaggaaagt caggctcgtc gcgatacact tagcggctca gagcagatct tccattctgg 
148921 tttcaaagag tttgcccgtc tacatcgtgc aaacagccat gcaccagaag cggtgattga 
148981 cggtgcttcc cgtgcgatgc gcatttcaat gaaccgcgag ttagaagcgt tggaaactca 
14 9041 tattccgttc ctcggtactg tcggttcaat cagcccgtat atcggtctgt ttggtaccgt 
149101 atgggggatt atgcatgcct ttatctctct gggcgccgtc aaacaggcga cattacagat 



14 9221 tcctgctgtt atggcctata accgtctgaa tcagcatgtg aataagcttg aacaaaacta 
149281 tgacaacttc atggaagagt tcattgctat tctgcatcgc caggctttcg ccagcaccga 
149341 aagcaaataa gtagggggtt atatggcgcg agtacgtggt cgtaagcgtc gcgagcttaa 
149401 gtctgaaatt aatattgttc ccttactcga cgttttgctg gtgttgttgt tgatattcat 
14 94 61 ggcaaccgca cccattatca cgcaaagtgt cgaggtaaac ttgccagatg cgacggattc 
14 9521 aaaaacggta tccagtgatg ataatccgcc tgtaattgtt gaggtttccg gtgttgggca 
14 9581 atatacagta gtcgttgatc accaacggat ggagctgttg ccttcagagc aagtggtggc 
14 9641 agaagctcaa gcacgattaa aaacgaatcc gaaaaccgtg ttcctgatcg gtggcgcaaa 
14 9701 agaggtgcct tatgatgaaa ttattaaggc gctgaatatg ctgcatcagg cgggtgttac 
149761 atcggtgggc ttaatgaccc aaccgatata aatttgcgtt gatattggtt gggctaatcg 
149821 ttagctcaac ccgtagatac cctgttttta gcagttccgt tttttagcaa tagagcgcgg 
149881 acatcgagtg tggaaatcga gcgtgggtaa ggcaaccgag caaaatgata agctgaatcg 

14 9941 cgccgttatt gtttcggtgg ttctgcacat catattgatt gccctgttga tttggggctc 
150001 tctgacgcag acgactgaaa tgggcggtgg cggtgctggc ggcgaagtga ttgacgctgt 
150061 aatggtcgat ccgggtgcgg tgacagagca atataatcgt caacagcaac agcaaacaga 
150121 tgctaaacgt gccgagcagc agcgtcagaa aaaagctgag cagcaagccg aagaattgca 
150181 gcaaaagcag gctgctgaac aacagcggct gaaagaatta gaaaaagagc gtctgcaagc 
150241 gcaagaagat gcgaagctgg ctgctgaaga gcagaaaaag caagttgctg aacaacaaaa 
150301 acaaatcgct gaacagcaaa agcaggccgc tgaacaacaa aaaattgctg ctgctgccgt 
150361 tgcaaaagcg aaagaagagc agaagcaggc tgaaacggct gcggctcagg caaaagctga 
150421 agcagataaa atagtgaaag cgcaggctga agcacagaaa aaagccgagg ctgaagccaa 
150481 aaaagaagct gctgttgccg ctgcggcgaa gaaacaagcc gatgccgatg cgaagaaagc 
150541 ggtagaggtt gccgaaaagg ctgcggctga cgcagcagaa aagaaagcgg ccgcagatgc 
150601 agagaaaaaa gccgctgctg ctaaaaaagt tgctgccgcc gctgaggcca agaaaaaagc 

15 0661 ggcagcagaa gcggctgcct caacggatgt tgatgatttg tttggtggtt tggcaaacgc 
15 0721 taaaaatgca ccgaagagtg gttcgggtgc aggtgccgct gctgccggta aaggtggcgg 
150781 taagaaaagc ggggcatcag gtgctgatat cagcggctac ctggggcaaa tcaccggcgc 
150841 gatacaaagt aaattttatg atgcagatct ttacaaaggc cgtacctgtg atttgcgaat 
150901 caaattggca ccagacggtc tattaatcga tgtgaaagca gaaggggggg acccggcact 
150961 gtgccaggct gctatcgctg cggcaaaaca agcgaaaatt cctaagcctc caagtacgga 
151021 tgtctatgaa caatttaaga atgctccgct tgtgtttaag cctcaataag gtttgattga 
151081 cagggtgtaa cgttgttagt taatgatgga tcatgtagtt ttgtttgcgg tggtttgtta 
151141 aaattctgct aattatcgcg ggcatcccgc tcgggtaagg gagataaaat gaagcaggca 
151201 tttcgagtag cgctaggctt tttagtttta tgggcttcgg tattacatgc agaagttcgg 
1512 61 attgagatta cccaaggggt agactcagca cgtccgattg gggttgttcc gttcaaatgg 
151321 atggggccgg gcacaccacc tgaagaaatt ggcgccatcg taggtgcaga tttacgtaac 
151381 agcggcaaat ttaacccaat tgatgctgct cgtatgccac aacagccttc cacagccgct 
151441 gaagtaaccc ctgctgcttg gactgctttg gggatcgatg ccgttgttgt tggccaagtt 
151501 caaccaagtg ctgacggtag ctatgtggtc tcttaccagc tagtggatac ttctggctct 
151561 gctggcagca ttttggcgca aaatcagtat aaagtgacta aacaatggtt acgttattca 
151621 gcgcacaccg tcagtgatga agtatttgaa aaactcactg ggattaaagg cgctttccgt 
151681 actcgcattg cttatgttgt gaaaactaac ggtggtaaat tcccacatga attacgggtt 
151741 tctgattacg atggctacaa ccagtttgtc gtccaccgct ctcctgaacc tctgatgtca 
151801 cccgcttggt caccagatgg tagcaaaatt gcctatgtga cttttgaaag tggcaaatca 
151861 gcattggtta ttcagacgct ggcaaatggt gcgattaggc aagtggcctc attcccacgt 
151921 cataacggtg cacccgcgtt ctcacctgat ggcaccaagc tggccttcgc gttatctaaa 
151981 agcggtagcc tgaacttgta tgttatggac ttggcttcag gtcaaatcag ccaagtgacg 
152041 gatggtcgta gcaataatac tgaaccaagc tggttcccgg atagccaaaa tctggcttat 
152101 acctcggatc agggggggcg tccacaagtt tataaagtga atattaatgg cggtgtgcca 
152161 cagcgaatca cgtgggaagg ttctcagaac cagaacgcag acgtgagccc tgatggtaag 
152221 ttcctggtat tggttagttc caatggtggg gcacaacaca tcgccaaaca ggatctggaa 
152281 acgggagccg ttcaagtatt aacggacacg ttactggatg aaacgcctag tatcgcgcct 
152341 aacggcacca tggttatcta tagctcgaca caagggctgg gttccgtgtt acagctggtc 
1524 01 tcgactgatg ggcgtttcaa agcgcgtett ccggcaactg atggacaggt caaatttcct 
152461 gcctggtcgc cgtatctgtg atgcatgtgt aaatacgtat gtatggcaaa ctataaaaaa 
152521 ggatcaaaga gatgcaactg aacaaagtgc taaaagggct aatgttggct ttgcctgttc 
152581 tggctgtggc ggcttgtagt tccaacaaaa gcgcaaataa tgaccaatct ggcatgggcg 
152641 ctggcactgg tacagagaac ggcagcaacc tgtcttctga agagcaagca cgtcttcaga 
152701 tgcaagaact gcaaaagaac aacatcgttt acttcggttt cgataaatac gatatcggtt 
152761 ctgacttcgc tcaaatgctg gatgcacatg ctgcattcct gcgtagcaac ccatctgaca 
152821 aagttgttgt tgaaggccac gcggatgaac gcggtacgcc agaatacaac atcgctttgg 
152881 gtgagcgtcg tgctagcgca gtgaaaatgt acctgcaagg taaaggcgtt tctgctgacc 
152941 agatctctat cgtttcttac ggtaaagaga aaccagcagt actgggccat gacgaagcag 
153001 cctttgctaa aaaccgtcgt gcagttctgg tttactaaga gaatcgcatg aacagtaact 
153061 tcagacgtca cttagtgggt ctgtcgttac tggttggcgt agcggtccca tgggccgcta 
153121 ctgcccaagc gccaatcagt aatgtcggct ccggctcggt agaagatcgg gtcactcaac 
153181 ttgagcgtat ttccaacgct cacagccaac tattaactca acttcagcaa caactgtcag 
153241 attcgcaacg cgatgttgat agtttacgcg gtcagatcca agaaagtcag taccagctca 
1533 01 atcaagttgt tgagcggcag aaacagatct accaacagat ggagagtctg agcggtggcc 
153361 aaggggcgca gaattcagct tctgcggcga gtggtgcaac agcagacaat acagcggctg 
153421 gaagttccgg taatgccgac gcaggcgcag cagcttctac agcggcacca gcagcaagta 
153481 ctggtgatga aaacagcgat tacaatgtgg ctgtctctct ggctcttgag aaaaagcaat 
153541 atgatcaagc gatcactgtt tttcagagtt ttgtgaaaca gtatccaaaa tcgacttacc 
153601 agcccaatgc caactattgg ttagggcagt tgtattacaa caaaggtaag aaagatgatg 
153661 ctgcatatta ttatgctgtt gtcgtaaaaa attatccaaa atccccaaaa agttcagaag 



153781 tttatcagca agtgatcaag caatatccca 
153841 gtttatctgc tctttagctt ttttgagccc 
153901 ctttatcgtg tgaagaacaa gcagttaaac 

153 961 tgataaatta gtaatatatg ccgccgttgc 

154 021 gaaattattg tcagcagtac aggtaatggg 
154081 tttaatcaat tggtcgcagg ttcgaatccc 
154141 gcatcacagc gggtcgttag ctcagttggt 
154201 aggttcgaat cccgcacgac ccaccatcat 
154261 tagctcagtt ggcagaacag ttgactttta 
154321 agcgagacaa caacgcgcca gcgttgaccc 
154381 gcacgaccca ccgttatttc cccaacacac 
154441 ctcaggtatc tccattttag ctttagccct 
154501 actgatacgt gaattatcac ctcgttttta 
154561 caaattttgg cgactttttg ccaaatattg 
154621 attcaattct tactgggtgg ctagcagcga 
154681 gagatgtgag cgatgagcga aatttttgat 
154 741 cggcctgttc cgttggacac caatgaaaaa 
154 801 ttaaagcaac gcgacgcagt gttagtcgct 
154861 ctggccgaag agacgggggg ctgtgtcgcc 
154 921 aatcacccgg catcaacgct actggttgct 
154 981 attctcaatc cagagaaaaa agtgttaatg 
155041 ttaggatgcc cggttgatga gtttaccgca 
155101 gtggtgtacg ccaacacctc agcggcagta 
155161 agtattgcgg ttgaattaat tgagcatctc 
155221 cctgatcgcc atttgggcag ctacgtacag 
155281 cagggggcct gtatcgtcca tgacgagttt 
155341 ttataccctg atgccgctgt gttggtccac 
1554 01 gctgatgccg ttgggtcgac cagtcaactg 
155461 acgttaattg ttgccaccga tcggggcatt 
155521 aaagagttgt ttgaagcacc tactgcaggt 
155581 tgtccatgga tggcgatgaa tggcctccga 
155641 gtgatgcatg agatacatgt cgatgaagaa 
1557 01 cgcatgctgg attttgccaa tcaattgaag 
155761 ggtgagaata ttcattcggc gcctcttggc 
155821 caagagactt ttattttaat aaatcagttc 
155881 ttttctcagc acaggcaaca tattggtaca 
155941 ttcatggata gaggcaattg gtaccctatt 
156001 agaaaagatc atcaactact tatttggtct 
156061 ctttcagatc caactctacg ccagcttgtt 
156121 ttacggctgg tatgcctgga gcaaacaaac 
156181 ctggttatcc ttgcctaaag cattggcttg 
156241 aatgacattg catattgata ccgtatttgc 
1563 01 gcagagtctg ggggcgaatg tacagatgcc 
1563 61 ggactcgacg atgatggtct tgtctattgc 
156421 agagaactgg ctgatttggg tcgttattga 
156481 gggggtttat gccatggcgc tggagtatgc 
156541 ttggttgtgg atcaagagtg ctgcacgaaa 
156601 ataacgactc ggctcacctg cgggatagca 
156661 gtttcgttta ctgcctgcag caacaataac 
156721 cgtggtgatg atgacggtga tgtccgtcgg 
156781 gtgttccaca atgttgatac tccatttgaa 
156841 gtaaatagtc ttggatgcgt tgtaataacg 
156901 catgaagggt cattagccgt tgttcaccca 
156961 ggacttcata aatattggta cataggtctt 
157021 gcgcaccctc caataattcg tgaaaacttt 
157081 tcagtaccga caccaatacc gagagaatgg 
157141 tcacgatagc cgcaatcatc gcgcccactg 
157201 ctgctgcccg tacattgatg tttttctctt 
157261 aaatattcgc caacagtccg gcgatagcga 
157321 gaggtgaaaa gaagcgatga acggcctccc 
157381 ccgctgcgtt aacaaaggcc gcgagtgtcg 
157441 ggtctggttt tctctgggag aagtgtactg 
157501 cggtcaacat gtgtccggca tcagccaata 
157561 tcgcttccgt caccataaat aatgtggtga 
157621 tgctatcttg cgagaaaatt gtggatacag 
157681 tttattgtgc aaataggcgc aagctatgtt 
157741 tatgtcctct ccacaaggca tactctatct 
157801 acatcgtcgt tgagtatagg ccccaggttc 
157861 ctatataccc aagttacttc gagatgcagg 
157921 ttagatagct aagtgactag ggtgagcatg 
157981 attgcgacta atctgcactg aatagcgaaa 
158041 acaggctaga ttggttcaaa tatatattgt 
158101 ctatgaatta ccagaatgat gatttacgga 
158161 ttgcgttgtt agagaaattt ccctccacag 
158221 ggaatgctat tcacaacatc ttgcgtgctc 



ataccgatgc ggctaaacag gcacagaaac 
aggatttagt caatttgact gatttctggg 
atttcagtag aaatttaagt tgcactgtga 
caagacaact gtaaaacgtt aaggcagtag 
tcgttagctc agttggtaga gcagttgact 
gcacgaccca ccatcatgtt aagaattacc 
agagcagttg acttttaatc aattggtcgc 
gtcaagaatt accgcctcac agcgggtcgt 
atcaatgggt cgcaggttcg agccgagcag 
gaagggcgag tccagaggac gagtcatccc 
atatttctta atttttcatc ttctcctact 
ataatcctct ggtttctttc ctcactctct 
actctattaa acattttcgc tattaatcga 
gttattttgt ttaggataaa aaacaaatca 
tctgatagct tcagtaatac cacagcaaca 
gtgaatgcgg ctatttatcc tttcccggcg 
gctttttatc gtgagaaaat aaaaacactg 
cactattaca ctgatcccga aattcaggca 
gactcacttg aaatggctcg ttttggcaat 
ggtgtacgtt ttatggggga aaccgccaaa 
ccaactctga atgcggaatg ttcactggat 
ttttgtgaca gccatccgga tcgcaccgtg 
aaagccaagg cggattgggt tgtaacgtcc 
gatagcttgg gcgagaaaat tatttgggca 
aagaagagtg gtgccgatgt attgtgctgg 
aaaactcagg cgctggcccg gatgaaagcg 
ccagagtcac cacaggcggt ggtggatatg 
atccaagcgg cgaagacact gccgcagaaa 
ttttacaaga tgcagcaggc ctgcccggat 
gaaggggcga catgccgcag ttgtgcccat 
gcaatcgctg aagggctgga gcagggtggg 
ttgcgtcagc aggcgctgat accgttaaat 
ttacaggtca aaggcaatgc ctgagtaact 
gtcgtgaata atcacctatc atgagatgag 
tatcggcagc taaaaggatc accagatgga 
tataccgctg ggggcgggcg gttacgatct 
cgggttgctt tgtatctggt ttgccagcaa 
gattaacgtg acattgttcg ccgtgatctt 
attacagttg tttttctttg gtgccaatat 
accggataat caggctgaat taaagattcg 
ggctgcagtt tgtattgctg gcatcgtatt 
ctggttgaca cgtgtagcag tgacggtgat 
agagttgcaa cctgatgctt tcccattctg 
tgccatgatc ctcatgaccc gcaaatacgt 
tgtcatcagt gtggcgatct ttgcctatca 
gttcctgacc cttattgcat tgaacggttc 
tcattcacgc ccactgtcta ctcaaggtta 
taaatacggt tcaaatcagt tcctgattcg 
tttgggtata tgactgtgtc agcggttatt 
caggggccgc ttgattgata ccacaatccg 
ctgtggcatg actgatgcgg tagtgatgca 
catcatggtc cagtggcggt atcacttggg 
cttgccataa gtgaacatga tgaatgttcc 
tacgcagctt gttgatatca atctcttgcg 
ctttcagtaa tcgccaggcg ctacgcaaaa 
ggtctattgg tgtccagccc gtggttaaaa 
aacctaataa atcacttagc acatgcaagg 
cttctccctt atgcaggatc caaaaacaaa 
tgattaacat cggggtaccc atcacctcat 
agacgatcaa tataacaatc agcaataatg 
tcagccgtag gtagccaaaa gtatggcggg 
ccatcaacgc gataaataat gctgcagagt 
atgccagaga accagataac cagccgccaa 
tggcgaaggc tatcaatagg cgtttgctgt 
ccattattta tcctttgttt ccatcacggt 
ttttctacag gctatgtttt atctacaagc 
agaattccgt ggattagtta ttcccaatag 
gaatttgcct cagttggcac cggtttactc 
taggcggcaa ccgagtaaat cgccgggagc 
ggtaattgtt tttgggaacc ggtcgatggc 
gcgataaagt agatttacag cattccattc 
gcaatgatcg ctaaagactt aatggaagaa 
ttaaagaaat taaggaactc ctacctcctg 
ttaatgcggc agaaaccgtc gccaagaccc 
aggatgatcg cttgctggtg gtgattggcc 



158341 gtgaagaact gcagggcgag ctggaagttg tcatgcgtgt ttattttgaa aagccgcgta 
1584 01 caacggtggg ttggaaaggc ctgattaacg atccgcatat ggacaatagc tacgacatca 
158461 atgaaggttt acgtctggcg cgtgaattac tcgtcgaaat taacgacagc gggttaccgg 
158521 ccgccggtga gtttttggat atgattacgc cgcagtatct ggctgatctg atgagctggg 
158581 gagccattgg cgcacgtact actgaatctc aggttcaccg cgaactggct tctggtctgt 
158641 cttgccccgt tggttttaaa aatggtaccg acggcaccat taaagtggct attgatgcga 
158701 tcaacgcggc cagtgcccca cactgtttct tgtcggtgac caaatggggt cattcggcga 
158761 ttgtgaatac tgccggcaat ggtgattgcc atatcatttt acgtggtggt aaagagccga 
158821 attacagcag taagcatgtg aatgcggtta aagaagggct gattaaagcg ggcctggaac 
158881 cccagattat gatcgatttc agccacgcta acagttgcaa acagtttcaa aagcagatgg 
158941 aagtgggaac ggatgtttgc caacaaattg cgcaaggtga aaaatctatc ataggcgtga 
159001 tgatcgaaag ccatttagtc gaaggtagcc aaaacctgga aagtggtgaa ccactaacct 
159061 attgtcaacg acggatgaaa agtgatccac ttatatctcc accaacggcc caatattgat 
159121 ccaccgtttt actcaggatt agcttctgct ataaccccgg cctttcgttt ctgtctgagt 
159181 cgatagcttt ctcctttgat ttgaacgaca tgtgagtggt gtaagatacg gtccagcatc 
159241 gctgaggtca gtgctgcatc accggcgaac gtttgatccc actgcccgaa cggcagattg 
159301 gatgtcagga tcattgcgct cttttcgtaa cgtttagcga tgacctggaa gaacagcttt 
159361 gcttcttcct gactgaacgg cagatagcct atttcatcaa tgatgagcag gcggggggcc 
159421 attactccac gctgaagcgt cgttttataa cggccctgac gttgtgccgt agataactga 
159481 agtaacagat ctgctgctgt tgtgaagcga actttgatac ctgcacggac tgcttcatag 
159541 cccatcgcta ttgccagatg ggttttcccc acacctgatg gccccagtaa tacgatattt 
159601 tcattacgtt ctatgaagct gagtgagcgt aacgactgga gttgcttctg cggtgctccg 
159661 gtggcgaatg tgaagtcata ctcttcgaac gttttcaccg ccgggaaggc tgccattcgg 
159721 gtatacatcg cctgtttacg ttgatgacgt gccagttttt cttcatgaag cagatgctcc 
159781 aggaagtcca tataactcca ttcctggtct actgcctgtt gtgacagcgc aggcgctgcg 

15 9841 cttataaggc tttccagttg caactgcccg gcgagcgcca tcagtcgttg atgttgcagt 
159901 tccatcatca cgccactcct ctgcagaatg agtcgtagat ggagagtgga tgatgcaggg 
159961 ggtgtttgtc gaagttcacc agattttcat caagatgcac gtcatactct tttttctccg 
160021 gaggcagtgc cagcatggac tgctgctctt cgagccagcg atcgcaggga cgggcctgga 
160081 ttgtttcatg ctttcgttgg ttagcgacat cgtgcagcca gcgcagaccg tggcggttgg 
160141 ctgtttcaac atcgacagtg atccccatcg ggcgcaggcg agtcattagt gggatgtaaa 
160201 aactgttacg ggtgtactgc accatccgtt ccaccttacc tttagtctgt gccctgaagg 
1602 61 ggcgacacag tcggggagag aagcccatct ccttgccgaa ctgccacagc gaaggatgga 
160321 accggtgctg accggtctga tatgcgtcac gttgcagaac cacagttttc atattgtcat . 

16 03 81 acaacacttc gcgcggcaca ccaccaaaga agcggaacgc attacgatgg caggtctcca 
160441 gcgtgtcata acgcatattg tcagtgaatt cgatgtacag cattcggctg tatccgagaa 
160501 cagcaacgaa cacgtgaagc ggtgagcgac cattacgcat agtgccccag tcaacctgca 
160561 tctgtcgtcc gggttcagtt tcgaaccgaa cggcaggctc ctgctcctga ggaaccgaga 
160621 gagaacgaat gaatgccctg agaatggtca ttccgccacg atatccctgg tctctgatct 
160681 cgcgagcgat taccgttgcc gggattttgt aaggatgagc atcggcgatg cgttgacgaa 
16 0741 tataatcccg gtattcatcc aggagtgaag caacagcagg tcgcggcgta tattttggcg 
160801 gctcagattt tgcctgcaaa taacgtttaa cggtattgcg ggagatcccc agttctctgg 
160861 caatcgcccg gctactcatt ccctgcttgt gcaggatttt aatttccata actgtctcaa 
160921 aagtgaccat aaactctcct gaatcaggag agcagattac cccctggatc tgatttcagg 
160981 cgttgggtgt ggatcactat tgcaccgttc gttacaccta tggccagagt attaccgatg 
i61041 cctgtattgg ctggaaagat actgaggcgc tgttgcgcca gttggctgga gcagtgagag 
161101 ctcgccgcca gtaggctaac gcgcttgcta ttttgagtct gggcaggtgc gttatcttca 
161161 cgtacaactc gttatcttca tatacagaag agtacgcttt ggtaacggag tgctgactgc 
161221 attcacatag ccgtcaataa aaaaccccga cattattcat gtacggggtt tttttactgt 
161281 gttaatcagg gctaattact tcgctttacc ctggttagca accgctgcag ctttagcggc 
161341 gatttcatca gcgttaccca gatagtagtg tttgataggc ttgaagttct catcaaactc 
161401 atacactaat ggcacgccag ttgggatgtt cagctcaaga atttcgtctt cgcccagatc 
161461 atccagatat tttaccaatg cacgtaaaga gttaccgtga gcagcaataa ttacgcgttc 
161521 accgctggca atacgtggtt tgataacgtc attccagtag ggaatcacgc gttcaatagt 
161581 cagcgccaag ctttctgtgg ttggcagctc agcatcagtc aattttgcat aacgtgggtc 
161641 atggcctggg aaacgctcat cgcttttctc cagtgctgga ggggtgatgg caaagccacg 
161701 acgccattgc ttaacctgct catcgccgta tttttcagcc gtttcagatt tattcaggcc 
161761 ttgcagtgcg ccgtagtgac gctcgttaag cttccaggtt ttttctgttg gcagccatgc 
161821 ctgatctaat tcatccaaaa tgttccacaa ggtgtggata gcgcgtttta atactgaggt 
161881 gtaggcgaaa tcaaaagtga agccttcatc ttttaacagc ttacccgctg cctttgcttc 
161941 agaacggcct ttttcagaca gatcaacatc ataccaaccg gtgaagcggt tttcgttgtt 
162001 ccattgactt tcgccgtgtc gtaccaaaac tagcttagtt actgccataa cgttgactcc 
162 061 aattaagttt ggtatatgcc cgtcatactt caagctgcat gtgcgttggc tgcgctcaat 
162121 cagccgaatc acttactcca gtaagctcat tggaactctc tcgcttgccg ccttcctgca 
162181 actcgaatta ttttgggtaa taaatacaca ttcaatgata attattatca ttatattgcg 
162241 tgatcactgc gaacagcaat cgattgtgtt taaccatagc caatttttta ctttaacgta 
162301 agaccaatag ccgaatcgat tctgccaaat aggcaactta tcaatatatg ggcattaaat 
1623 61 accccatatc aacagacccc tgcctcgaaa tgggttaggc cgctttcttg aggcaagtgc 
162421 tcataaaggt cttgcgctca tcacctttta gcattttttg ggctgcctcg gcattacaac 
162481 ttttcatttt ctgttgctga ggtgtcaggg cttttccatc aggtgctgtc tgtgctttca 
162 541 gacattgact cataaagttt ttccgctcat cacccttgag tgattgagtg ctggcctgtt 
162601 gattacaatc ggtcatctgc tgctgctgcg ccgcttgggc tggtgagggc gttttcggcg 
162661 gatctgcggc catcacattg gcactcagca ataacccagc cgataacaac cataggggga 
162721 atagacgcat aggcgttcct ttttatagtc atgaggttta gtaatcaatg tcagaggtct 
162781 aaaaggcggg cagaacgtca gtacggccgc ctgacagatg cagttaagcg tagtcggcag 



162901 gatattagaa agttaacggc cactaccgaa gtgagaaccg ggtggccgtt gcggcatgat 
162961 gtattacgcc aggagtattc gtaatacatc attcataacg tcatcattca taacgtcatc 
163021 atttataaag gcaacgcaat aaagtgataa accgtcgttg agcgataggt cttccccggt 
163081 tttagccaac aatcgggctg tggccattca gggtggttag ggctatcagg caaaaattca 
163141 ctttctagtg ccacacccgc atagctggca taggtgccgc cgttgcgtgc gggtgtgcca 
163201 gcaagataat ttccgctgta aagctgcaac gcaggcgctg aggtgaacat actcatttgc 
163261 acctggttat cggatgacca caagtgtgcc gccggagagt ctaaggcccc acaggtgcgg 
163321 tgcagcaaat aagcgtggtc atagccttta actcgttgct ggcagcgatc ttgtagaaaa 
163381 tcctgcgcca atgttttagg ttggcgaaaa tccataccgg tattctcgac atcagcgagt 
163441 cctccggttg ggatgcccgc actgtcgaca ggcaggtaac ggtctgcaaa cagttgcaaa 
163501 cgatgtgaac gggcatcatg cccttcgccg tcaaggttga aataggcatg attggtcagg 
163561 ttaaccgggc aggttttatc cacctgtgcc tgataatgaa tttccagccg gttatcttcg 
163621 gttagcaggt actgtacttc aacagtcaga ttaccgggga aaccttgatc gccatctggt 
163 681 gaatcaagtt tataggtaac ctggtggtca tcttgggtca ctttctgcca acgctgggca 
163741 ' tggaaacctt ccgggccacc gtgtaattgg ttagggcctt ggttcgggac caactggatt 
163 801 gtttcgccat ccaaggtgat ttgcgcattg gctatccggt tggcataacg gccaaccgtg 
163861 gccccgagat aagcggcttg atgcaggtaa tcagccggtg tcgggtagcc cagtagcaat 

163 921 tcacgtttct caccattttt cagcggtaat atggccgata accaggttgc cccccattca 
163981 taaaggtgac ggtcataccg cttttattct gcaatttggt gaattcaaaa gggtgaccat 
164041 ccggtgctaa cgggtcaaca ctgtttggtg atgccgcgcc gtttttcagc ataaacccgc 
164101 tccttgtgaa gcctgacaga cataaaaagt ctctttcttg ccgccgctgt gtgccgggta 
164161 ttcttgcgca acggtggtgc gaacctgctc gactaattca agcggcatta acgcgataat 
164221 acaaccgcca aaaccgccac ccgtcatgcg caccccacct tgatcaccaa tcactgattt 
164281 cacaatctcg acgagtctat ctattggtgg aaccgtgatc tcaaagtcat cacgcataga 
164341 aatatgagac tcttgcatca attgccccat caatttcaag ttcccggcgg ccaaggcatc 
1644 01 tgcggctgcc agcgtgcgtg cattctcgct gatcacatgg cgggcgcgtt tagcgacgac 
164461 cggatctagc tcgtcttgta ttgagaagaa gaggctcggt tcgacatcac gcaatgcttt 
164521 gacgccaaag aaacgggcgg cagcttcaca ctgttggcgg cgagtattgt attcgctgtc 
164581 aaccaggcca cgttgaatgt ttgagttgat aataacgacg gccatgtttt ccggcattgg 

164 641 cacggcacgg gtttccaagg tccgacaatc aatcagcaac gcatggtctt ttttacccaa 
164701 tgctgagatt aactgatcca taatgccgca gttacagccg acaaattggt tttctgcctc 
164761 ttgcccattc agcgccagtt ctacaccgct aagagggagt tgatagagtg attgcagggc 
164821 ttggcccacg gccacttcca acgatgcaga ggaactgagg ccagcaccct gcgggacatt 
164 881 gccacagata accagatcgg ccccaccaaa atcagcgttg cgcatttgta gatgtttcac 

164 941 cacaccacgc acgtagtcag cccagcgata ttcaggatgc gggacaatcg gtgcatcaag 
165001 agagaatata tcctgctggt tttcatagtc ggcggcaata acacgaatct ggcgatcgtc 
165061 gcgtttgcca caactgatca ccgtttcata attaatggcg cagggcagaa caaagccatc 
165121 gttatagtcg gtatgttcgc caatcagatt gacgcggccc ggcgctttaa tggtgatgtc 
165181 agactcgcgg tcaaactgtt ggcggaaaat agtctgggta tgttgtttta aactcatgat 
165241 ttggctccgg cctctttata gtgaacatca cttactgccc gcaggagtgc tgccgcctgt 
1653 01 tctgctgtaa ggtctcgctg ggtttcggcc agcagttcat agcccaccat aaacttacgt 
165361 actgacgctg aacgcagcag agggggataa aaatgcgcat gtaattgcca atgcgtattg 
165421 tcattcgcgt tataaggcgc gccatgccat cccatagagt aagggaaaga gcatgagaat 
165481 aaattgtcat aacggctggt cagttttttc aaggcaatgg ccaaatcact gcgttgctct 
165541 gccgaaaggt cttccaggcg caatactgcg gctttgggca gtaacagggt ttcaaatggc 
165601 caggctgccc agtaaggcac taccgctagc caatgttctg tttcaaccac ggtccgttga 
165661 ccatcgcggc gttcccgctc gacataatcc agcagcatgg gcgattgatg ttgctgaaaa 
165721 tactgttgct gtaaacggtc ttcctgttct gcttcattcg gcaggaaact gttggcccaa 
165781 atttgcccat gcggatgggg attagagcag cccatcgctg cccctttatt ttcaaacact 

165 841 tgcacccatg ggtatatttt gcccaactcg gcgctctgct gctgccaagt ctgaacgacc 
165901 tgttccagcg cgggcagtgt cagttgcggc agggttttgc tgtgatcagg agagaagcaa 
165961 atgactctac tggtgccgcg tgcgctttga ctacgcatca gcggatcatg acttggcggc 
166021 gcatctgggg tatcaggcat cagtgcggca aagtcattgg taaagacata agtcgatgag 
166081 taatcaggat tcacatcacc ggtaactcgg gtattgccgg ggcacaagaa gcaatcggga 
166141 tcatgagccg gtaaattctc ggttgcgggt gcttcttgtt ggccttgcca tgggcgttta 
166201 gcccgatgag gcgagaccaa tacccactgg tctttcaacg ggttatagcg acgatgtgga 
166261 tgatccacag ggttaaaatg cgtcataaaa ttgccttaat cggggtagcc ttgaggattt 
166321 tttgattgcc agtgccaagt atctgccgcc atctcatcaa gtgagcggct gactcgccag 
166381 cctagttgtt ccgccgcttt agttgcatca gcccagtagg caggcagatc cccttcgcgg 
166441 cgcggggcaa aatggtaggc cagtggttta ccacaggctt tactgaatgc ggccaccact 
166501 tgcaacacac tgtggccgac accggccccc agattgaaga tatgcacgcc gggcttgccg 
166561 tgtaaggttt tcattgcggc cacatggccg tcagccagat ccaccacatg gatatagtca 
166621 cggacgccag taccatcagg ggttgggtag ccattgccaa agatggccag tgattcacgg 
166681 cggccaacgg cgacttgagc aataaacggc attaaattat tcgggatacc ttgcggatct 
166741 tcgcccatca aacccgatgg gtgcgcaccg acggggttga aatagcgcaa aatcgtcata 
166801 ttccactgtg gatcagccag ttgaacatct tgcagaattt gctcaaccat caatttactg 
166861 cggccatagg ggctggacgg tgaaccggtc gggaagcttt ctacataggg aatctgtggc 
166921 tgatcgccat aaacagtggc tgaggagctg aaaatcaggt tcttaacctg tgcggcgcgc 
166981 atcgcttcta ataacaccaa ggtgccaaac acgttgttgt tgtaatactc gagtggccgg 
167041 ttgacggatt cgccaaccgc tttcagaccg gcaaaatgaa tgacggcatg aataggatga 
167101 gccgcaaaaa tgctatcgag cagtgtgcga tcacggatat cacccgcata aagttcaggc 
167161 gtatagccgg tcaaactgtg tatccgggct aatacgctgg atttactgtt gcagaggttg 
167221 tcgaggatga cgggtttgta gccagcctca atcaattgca cacaggtatg gctaccaatg 
167281 taaccgctac cacctgttac cagaacgtac ataataagac tcctctaatg agttcaatgc 
167341 aataaagata gcacgccatt atgggtaaaa aggtgatcta cgctcaataa tggaatcgtt 



167461 agcatagaca aataatggta aaaatgatgg gcataacaac gattgctgaa gtggtagcgt 
167521 tctcatccat caatgtagcc ggaatgtcct gtttgtggtg cgcgaggttg ctgctcacga 
167581 ggttactgcc ataggttgat aggtcaattt atcttgatgt cagtcaccaa tgggcgatga 
167641 cttaatcgcg gtagtgatcc ctttggttca tttagtgggg taaatagagt ggataggtga 
167701 agaacagtaa atgcattaaa ttcagggcga aatggaaagc cacggcaacc cacaggcggc 
167761 cactccatag ccaggccaag ccataaataa cacctgccag tgcggcgaat aacattaata 
167821 gcgggccacc ggcgaagtgg gcactgccaa agagtaatgc ggtgatgacc aacgctgggt 
167881 aagacccaag ccactgaccg aggcgttgtt gcaagtagcc acggaataag gcttcttcag 

167 941 ccagacaaac aaaaaacaga ttggcaataa taaaggagcc tatccaaacc ggggtatgca 
168001 gctcaatttt caagccaccg agcgccacag ccagcagcaa taaggcgggt acgctgagta 
168061 tcagcacaac ccaacccatg cgcccgactg atgggtgttt tttcaccttg aacagggttg 
168121 gcagacaggc cagtaagata aaggggacta gcgccttatc cagattgtaa tacatggtga 
168181 agggggcact gagcggcccg gtacggactt tatccagcac tttcaggttg tgaaaaccag 
168241 gaacaaggtg caggaataac gcgataacgc ccagtacgag cagcaactcc aacccagcgg 
168301 cgacgccatt acgttgacga tacttttgca gcagtagcac gaggatgatt atggcaagta 
168361 aaaacgcggt agcagggaag gtgaggacac cgctataaca ggccattgct aatgatgaaa 
168421 ccaaaaataa cagggagagc acctggctca gtggtaaaaa cagtaacgaa gtagcaagaa 
168481 caccccacat agataatccc tttcttattt catcctgatt gatggcatag ccctaaatga 
168541 cggtttagcc taaggcattt ttttgagcta aatcaaccaa tttacacttg gccaccagca 
168601 ctgataagtg gctggtggta agaataataa catgctggca ccagagtaaa ggtcgggtca 
168661 gtgggggttg cattgggtga cgctcagtta aaactgtgaa aaatgcttat tttacctgtg 
168721 gttggcttgg tgatttttct tgttgatagt gataaatatc gccaaatggc acaaaggtta 
168781 agcggtgcgt aatacactgg ggtgcatcct cggcatggtg agaaacgaac aataattggg 
168841 tttctccttc accgatcatc acatcaataa aacggcgcac taacaaacgg ttaagcggat 
168901 ccagcccttg taagggttca tccaggatca ggagcgctgg gtgcttaacc aatgctcggg 

168 961 cgatcaagac cagacgctgt tgcccccaag ataagctgtg gaaaggttgg ttggcgacgg 
169021 aggtggaaaa ccccagtaaa gacagccatt gatctgctaa ttgttgttga cgatcagagg 
169081 ctgcttggta aagaccaatc gaatcgaaaa aaccggataa aatcacggta cgtacacggg 
169141 tgctaacccg gtagtccaga tggaggctgc cgctgacata accaatatgg cgtttgatgt 
169201 cccagatggt ttcaccactg ccacggcggc ggccaaacag cgttaaatca ttactgtatc 
169261 cttggggatg gtcgccagtt atcaggctga gtaaggtgga tttccctgca ccattctggc 
169321 cgataatctg ccagtgttca ccggggttca cttgccagct caggttatgc aggatagggc 
169381 ggtcgttgta ctccaccacc ccgtcattca gtatgatcaa gggttcattt ttcggtattc 
169441 gtaggtggtg ctgtggatcc tccgtttccg gcagtgacag cccatccaga ttttcactgt 
169501 gtgccaattg ggcaattaac gcttccgaca gaatctgctg gcgcggcccc atgtgggtca 
169561 ggtggcaatc agccaatacg ccaacgtact gcacaaaatc agggatatca tcaaagcggt 
169621 tgaggaccaa taccagggtg acgccttggg ccgaaagtgt ggatagcatc tcggctaact 
169681 gcgcccgagc cgcgacatcc aggccatcaa agggctcatc cagaatcagc aggtccggtt 
169741 gcggcatcag cgcctgacac agcaatgttt tacgggtttc ccccgtagaa aggtatttga 
169801 agcggcgaga aagcaagtgg ctgattccaa atagcgctgc cagttgttga cagcgctgcg 
169861 ggtcacgggc cgcttcctga ataatttcgg cggtagtacg gccagtgtca tcttcatcag 
16 9921 cactgagcag gtcggtatta ttacgttgcc actcttcatt gaccagttgc tgtagttgtt 
169981 cgaatgataa acgcacacca tggcggaagt ggctttggcg gctgccctcc agcagggcca 
170041 gttctccaga cagggccttg gctaacgcag atttaccact gccattggcc ccgacaaatg 
170101 cccaacagtc acccgcctgg atatgtagcg ctggcaagat gagtgtctgt gtgtcactca 
170161 ggcgaaaaca accttgcgat atttgcaact cagacatcat ctttcccata ttttgcaacg 
170221 tagtaaagat gttagggata agcggaaaga cgaaaaatgt caacgaacaa atgtatggtc 
170281 caacagtgat tactgggaat aatcattcaa cacaatgtgg cgataatcac ccgatctgca 
170341 ttgaattgtg caatgacggc actgccaacc cgtaaattca gcgcatggag ttcgctgttg 
1704 01 ttttgggttg aacacaggtt ggccccgcca gttaaggtga caataacttc gctatgatcg 
170461 ttgcccggct caatgctggc gaccgttccc ggcaatgcgt tatccgctgc acctgccagt 
170521 gccggatcgg taaccagttt tacccaaggt gctttaatca atgccagcac ttctttacct 
170581 gctgaaagtt gcaggcgatc cgcgctttgt tgggtgatcg ccgccgttaa gcgagtttta 
170641 ccgtccgata gcagaatatt gacatgctgt tgcacctgct ggtggtcgcg ttcaataagg 
170701 gtaccgaaaa attgattacg ggcgctggtt tgcagtgaaa aacgggaaat tgccgccagc 
170761 aagctgtcca gcggcagtga atcatctttc agggtatcaa aggctttttg ctggatggtt 
170821 gcgagtaaat cataaagctg gattaagcgt tcaccgtagc gggtcagatg tgcaccccct 
170881 ccacctttgc cgccagtggc ccgttcaacc agtatctcct cggacagtgt attcatgtca 
170941 tttatggcat cccaggcact tttatagctg atacccgcga gcttggctcc ctggctgatg 
171001 gaaccggtat gctgaatttg ctttaacagt gcaatacggc gtggatcggc aaacaggcgc 
171061 tgttgaagtt ttagtgtaag aagaatttcg gcctgcattg ttttttgctc cggcgggaaa 
171121 cagttattgt catagatagc taaatggcat agatagctaa acaacataga tagctaaaca 
171181 acatagatag ctaaacaaca tagatagcta aacaacatag atagataaac aacatagata 
171241 gataaacaac atagatagat aaatcgcacg tttggcaggt tacatttttt tatcgtcggt 
1713 01 taaaataaca gggttaattc attgctgata tcatttacag taaatgatag gcagcagtgt 
171361 tggttattgt gataatgtgg gttatcgctc aatgggcagt tatgagaggc tagcatgttt 
171421 gaactattaa aaagtctggt ctttgccgtg gtcatggtac cggtggtgat ggcagtcatt 
171481 ttggggctga tttatgggct gggtgaagtg tttaacgtga tctccaagac gggtcacccg 
171541 aaagaacgga atacccttcg ttcttgagtg ttgctgtacg cttatcagca caagcacaca 
171601 agtctctgat ttctgaatgc ctggcgtata aacgttgggc atttttgtat cattcctcac 
171661 cttaatcccc caccttagtt tctcccacga tacggttacg actcaagctg cctcacatct 
171721 tgccgataaa ctgcatgacc tgcttataat gttacgctat attcctccgt atataacggt 
171781 atcattaatg tggagaacaa catgaagaat caatacggta aagtgagtta ctgggtagcc 
171841 ggtgcggtat tgctgacggc tttttccggc ccagccttcg ctgccgataa gattaccgtg 
171901 tttgccgcgg cctcactgac caatgcgcta caggatattg ctgtccagta taaacaagaa 



172021 cagggggcac cggccgatct gtttatttcc gctgaccaac aatggatgga ttatgccatt 
172081 gataaacagc aaattgtcgc caatactcgc tacaccctat tagggaacga actggtattg 
172141 attgcgccgc aagacagcca gattgataaa gttgagatcg ataagaaaac cgactggaaa 
172201 aagttgctgg aggggggccg cttggcggtt ggtgatcctg accacgtacc ggcgggtatt 
172261 tatgccaaag agtccctgga aaacttaggg gcctggtcaa cgttagcccc ggaaatggcc 
1723 21 cgtgccaata atgtgagaag cgcaatggcg ctggtcgaac gggcagaagc gccgctgggc 
1723 81 attgtttatg gttcagatgc ggtagccagt aagaaagtta aagttgttgg catattcccg 
172441 gaggcaagtc ataagccggt tgaataccca atggcgattg ttaaaggcca tgacaatcca 
172501 acggtaacgg ctttttatga ttacctgaaa agcccggcag ccgcggtgat tttcaaaaac 
172561 tatggattca ccccacgcta atgatattga gtgagtatga atggcaggcg attattctca 
172621 gcctgaaagt ttccggtgtc gcggtagctt gtagcctgcc attgggtatt ctgatggcct 
172681 gggttttagt gcgttgtcgc ttccccggaa aatcgctgct agacagtgtg attcatttgc 
172741 cactggtgct gccaccggtg gtgattggct atctcttgtt gatcagtatg gggcggcgcg 
172801 gttttatcgg cgaatggctc tatagctggt ttggcattaa tttcagtttc agttggcgcg 
172861 gtgcggcctt ggcttcggcg gtggttgctt ttcctttgat ggtcagggcg attcgcttag 
172921 ctctggaggc cgtcgatacc cgtttggagc tggcggcgcg cacgttaggg gcaacgccat 
172981 ggcgggtctt tttcacgata actctgccgc tctctttacc gggggtgatt gccggtacgg 
173 041 tgctgtcttt tgcccgctca ttgggggagt tcggtgcgac catcacgttt gtttcgaata 
173101 tccccggtga aacccgaacc attcctctgg caatgtacac cctgattgaa acgcccggtg 
173161 cggaagccgc cgctgcgcgc ttgtgtgtga tcgccattat tttatcgttg gtgtcattgc 
173221 tgttatccga atggttggct aattggggta aaaaacgcat gggggcccca tgctagaact 
173281 gaatttcagc caacaattgg gtgatttaca cctacaggtc gccaccgatt taccggcgca 
173 341 ggggattacc gcgatttttg gtttatcggg tgccggtaaa acgtcactga ttaatgtgat 
1734 01 aggggggctg actcgcccac agcaaggccg agtgattctg aatggccggg tgctggtgga 
1734 61 tgccgagaaa aatatttacc tgccgccgga aaagcgccgg gttggctacg tgtttcagga 
173521 tgcgcgcttg ttccctcatt atcgggtgcg gggtaacttg caatacggta tggcggcgtc 
173 581 aatgcgcggg cagtttgacg cgattgtggg cttgctgggc attgagccgt tattgaaccg 
173641 cttcccgttc acgctgtcag gtggcgagaa acagcgggtt gccattggtc gtgcattatt 
173701 gaccgctcct gagttgctgt taatggatga gccattggca tcactggatt tgccacgtaa 
173 761 gcgcgagctg ttgccgtatc tggagcggct ggcgcaggat gtgaatacgc ccattttgta 
173821 tgtcagccac agcatggatg aaattttgcg gctggcggat caagtggtgg tgatggatgc 
1738 81 cggtaaagtt cgtgccgtgg gggggctgga ggaggtatgg gccagcagtg cactgcggcc 

173 941 gtggttgcaa cgagaagagc ctagcagcat tttgcgggtg agtgtgattg gtcatcatga 

174 001 tcgttatgcg atgacggcac tggcgttagg tgaccagcgg ttgtgggtgg gcaagttgga 
174061 tgccgctgaa ggcaattcaa tgcgtatccg tattaatgct gccgatgtct cactggcatt 
174121 acagccgccg cacagcagca gtattcgtaa tattttgccg gtaaaagtgg ctgagtgtct 
174181 ggatgtggac gggcaggtcg atgttaagtt agccatcggt gagcaatggt tatgggcgcg 
174241 gatcacgcca tgggcacggg atgaactggg gctgaaacct ggtcagtggg tgtatgcaca 
1743 01 gataaaaagt gtgtcattta accgccagaa tggccctgta ccagattgag tggtgccgct 

1743 61 ttttgtgtag gcaccaccat gatgaatctg tcgctgtgaa tttatcaggc taatacatgc 
174421 tggcggataa cggcggcaat acccggttgc tcgttatctg taatcaccag gtccgcccgt 

1744 81 tgtttaatcg catcgtcgct gttacccatg gcaaccccta atccagcagc ctctaacata 
174541 ctcagatcgt taaagttatc accaaaagcg atcacctctt gcatactcag gccctgagat 
174601 tcaacccact gttggaggcg tttccccttg ctgttccccg cttgggcgat atccacctga 
174661 tcatgccacg accattcgca ggccagaccc atttctgctt ctaccctggc ggcaaacgct 
174721 tttaactgtg cagtatctgg gtgagaggtt gcaaatttcc agatggcggt cgcactgtga 
174781 gcggcatcca gtaagctgtt gacctgaaga aaggtcgggc gctgagcggg cggcaatgac 
174841 tcggcccaac tgagggtgcg ggtcacatgg cccgtgatct gctgatacag catggcatca 
174 901 tccacataca tcaggccatg gatctgtgtt ttctccagta attgcaggac ttgaaccgcc 
174 961 tgttgtggag tgagtgggtt tgaatccagc acttttttat tttgataatc ataaatataa 
175021 gtgccgttac agcagatagc gggtgtatcc agttgcagtg cctgataaaa cggatggatc 
175081 gcgacatggt ggcggcctgt tacgacgatc actttcacgc cttcagcccg ggcttgcgcc 
175141 agcgcggaca acgattcagg caggatgcgt tttttgtgat ctaacagtgt tccgtccaga 
175201 tccagtgcga ttatgcgata tgtcatagcg gtccctgtcc ccattaaagg ggcgaatgag 
175261 aaattatatg cttgaaaagt atgttccgat agtacaccgg atgggtgatc gattaaacca 
175321 aaccggggga aaggcctgta taaacagcta aaaagaagcc gcgccgcgag atgcccgaca 
175381 gttgttatgt ccgtgcagtc aggtaagcta ggcttaaatt aaccgttaat aaaagatatt 
175441 tggttaaggc gaaggggctg gaacaggtca attcacctta atagcgttac ccaacaggtt 
175501 cgactaagga gtaaagatga agcaagcggt ttacgtggca agtccagaca gccaacaaat 
175561 tcatgtctgg caattagatt cagccggtga attaacctta ttgcaaaccg tggacgtgcc 
175621 gggtcaggta cagccgatgg caataagccc taatcagcgc catctgtatg tcggggttcg 
175681 tcctgatttt ggtattgtca gctatcacat tgccgatgat ggtacgctga ccgccgcagg 
175741 tatggcaccg cttcccggca gcccgactca cattgacacc gatcgacaag gtcgcttctt 
175801 gttctctgct tcttacagtt tcaattgcgt cagtattagc cctattgata cgcatggcgt 
175861 agtgcaagcg cccattcagc aattggatga cctgcccgcg ccacattcgg cgaatattga 
175921 cccaaccaac cagatcttgc tggttccttg cctgaaagag gataaggtac ggctgtttga 
175981 tttgagcgcg gaggggcagt taacccccca tgcacaggct gatattactg ttgccgccgg 
176041 cgctggccca cgtcatatgg ctttccaccc gaatcatcag gttgcttatt gtgttaacga 
176101 actcaatagc tcggtggatg tgtatcaaat cagcaataac ggtcaggaat accacttagt 
176161 gcagagcctg gacgcgatgc ctgccgattt caccgggaca cgttgggcgg cggatattca 
176221 tatcacgcct aatggccgct atttatacat cagtgaccgt accgccaacc tgttgggtat 
176281 tttcaccgtt tctgaagatg gccgggtgat ttcgttggtc gggcaccacc tgaccgaagc 
176341 ccaaccgcgt ggtttcaata tcgaccacag tggtaatttc ctgattgctt ccggtcagaa 
176401 atctgatcat atcgaagttt atcgtatcga tcagaacacg ggggagctaa ccaccctgaa 
176461 acgttatccg gtaggtaagg ggccgatgtg ggtcagtatc cggggggcgc agaactcata 



176581 cttgagtgac tgtgcactgc ccatgtctaa 
176641 gagatcttta caccagacgg cggaaaaacc 
176701 ggcatttagc ttccgcgatc taccacggcc 
176761 ataatataag gcggcatcag ataaatcagt 
176821 tcgacaaaac tgcgttgcag gcgggcgaca 
176881 atggccccaa gtacccgcac atcggccacg 
176941 ctttttaact gatcttctat tttactcacc 
177001 ctcaggttcg cagaggcgac tgcgcaagcc 
177061 ataaagcagc ccgcatcccc tttactgatg 
177121 gcggataagg tgagataacc tccggtgagc 
177181 atctgcgcat gttcacaggc aaacagtttg 
177241 tcggcaatca ataaaatttg atgtttatca 
177301 gggtgataaa ttcgcatccc gcccgcccct 
1773 61 atctcattgg cgtgctgggt tatcagcgcg 
177421 gggtcatcaa agccgcactg tggcgcattg 
177481 tacaaactgt gcattgaatt ctgtgggtca 
177541 tggtaaccat ggcgcaaggt caaaatacgc 
177601 tgcaacgcca ttttcagtgc gacctcaacg 
177661 cattccagcg gcggcggtgt catcgcgacc 
177721 tgggtaatgc ccccgaacat cacgtgcgac 
177781 ttcagtaccg ggtggttata gccatgaatg 
177841 agccgccgac catcagccaa ttgcaattca 
177901 ggcagtggct cagtcattga ggtgtaggga 
177961 tcagaaggtg tcattaggta ccgttgtaaa 
178021 atcgagttta tttacactga cgaaactttt 
178 0 81 ttatcatcac tggacagtag ggcaagccct 
178141 attgtttgaa gcacaacaag ttcaccgcca 
178201 tacgttgttg tcgattaaaa ccggtgcctg 
178261 ctcccgctat aaaaccggtc tggaaagcga 
178321 atctgccaag aaagccaaag ccgcaggttc 
1783 81 aaatccccac gagagggata tgccttattt 
178441 ggggatggaa acctgtatga cattgggttc 
178501 ggatgccggg ctggattatt acaaccacaa 
178561 cattattacg acccgcagct atcaggaacg 
178621 tgggatcaaa gtgtgttctg gtggcatcgt 
178681 cgggctgttg gtacaactgg cgaacttgcc 
178741 gttggtaaaa gtgaaaggca ccccgctgga 
178801 tatccgcacc attgcggtag cgcggatcat 
178861 cgggcgcgag caaatgaacg aacaaaccca 
178921 tattttctac ggctgtaaat tactgaccac 
178981 attgttccgt aaattggggc taaacccgca 
179041 acaacaacag gccttgaccg agcagctact 
179101 ggcggtctaa tgagctggca agataaaata 
179161 gcggcttacc gtactcgcca ggtcaacgag 
179221 gagcggcaat atctcaactt ctccagtaat 
179281 gtcattgccg catggcagca aggtgcacgg 
179341 catgtgaccg gttatagcca acctcatgct 
1794 01 ggctacccgc gcgccctgct atttatctct 
179461 gctctgaccg atgctgatga ccggatattg 
179521 gaagccgccg cgcattcacc cgcgcagttg 
179581 ctgcaaaatc tgctgatcaa gccttgccag 
179641 tttagtatgg atggcgatag tgcgccattg 
179701 ggcggctggt tactggtgga tgatgcgcac 
179761 ggcagttgct ggttacaagg ggttcaaccc 
179821 ttcggcttaa gcggtgcggc ggtcttgtgt 
179881 tatgcccgcc acctgattta cagcaccgcc 
179941 gccgccttgc gccaagtgca acaaggtgat 
180001 cgccaatttc ggaccgcggc cgcgcattta 
180061 atccaaccgc tgttggtggg cgataatcag 
180121 gctgccggtt tatgggtgac ggcaatccgc 
180181 ttgcgcatta ccctcagcgc ggctcaccaa 
180241 ctttatggcc tctgtcactg agcacgcgca 
1803 01 cgttgataaa caggcgattg cggctgcatt 
18 03 61 ggctaattta cagcgagaaa cgggtcatcg 
180421 ttttgtggtg ttggatgcgg gttgtggcac 
180481 gggtaagcgg gtcattgcct tggatctggc 
180541 gcaggtcgct gacgactatc tgctggggga 
180601 ggtggatatc tgctttagca atttggcggt 
180661 gagtgaattc taccgagtga cgcgccccgg 
180721 aggctccctt gatgaattag ggcaagcgtg 
180781 tgattttttg ccattgcaac atatccagac 
180841 aaccgcgtta tatcagccac ggtttcctaa 
180901 cattggtgcc acgcacttgc atcatgggcg 
180961 ggcggcgcta caacgggcct atgtcatgca 
181021 ggtttatggg gttatctatc gtgactaaac 



attgacggcg actgtggcgc tgatgtcgca 
gcgatgccgt taccttaaac ggtgaagata 
gcagtgagcc gggttaaggc gtgttgagag 
ttgccaaagg gcctgatcca caccccctgt 
ttaacgggtt ttttcatctc taccaccccg 
gtatcttcct gagccagcgg cagtaattcc 
tgctgttgcc acgaattctc tgctaataaa 
agcggattcg ccataaaggt tggcccatgc 
gtttctgcca ctgcacgggt ggtcagcgtg 
gctttaccta aacataaaat atccggcacg 
cctgttcggc caaacccagt ggctatctca 
cataacgccc tgacttcacg cagatagctg 
tgcaccacgg gttccaggat tacggcggca 
acaaaattag cgatatcttg cggatcccac 
gcaaacagat tctcggctaa atagccctga 
cagacggaca tcgcaccgaa cgtatcgccg 
tgccgccgct cgcccttcgc ttgccagtat 
ctcaccgagc cggaatcggc caaaaaaacg 
agttgtcggc acagctttac tgcgggcgga 
attttatcca gttgctgatg cgcggcgaaa 
gctgcccacc aagatgacat accatcgatc 
acaccttcag cgcccaccac ggggtaacag 
tgccagatat gctgttgatc aaaaactaag 
ccaaaatatc aatagaatgg ttgacagtat 
tctcttcttt tgagaggcca tcatggcaac 
ggccctgttt gataaaccgc tgttggaatt 
gcattttgat ccgcgtcagg tacaagtcag 
tccggaagat tgtaaatatt gcccgcaaag 
acgcttaatg caggttgagc aggtactgga 
gacccgcttc tgtatgggcg cggcgtggaa 
ggcgaaaatg- gttgaaggcg tcaaggctct 
gctgagtaaa cagcaagctc atcggctggc 
tctggatacc tctccagagt tttacggcag 
tttggatacg ttaaacgagg tgcgtgacgc 
cgggctggga gaaacggtac gcgatcgtgc 
taagccacct gaaagtgtgc ccatcaatat 
aaataacgca gaggtggatg cctttgagtt 
gatgccgtct tcctatgtcc gtctgtctgc 
ggccatgtgc tttatggctg gagccaactc 
accgaatcct gatgaagata aagatctgca 
gcagacggca acatcacacg gtgatcgtga 
gcatggggat accgcccagt tttataatgc 
gcgcaaggct tgcaacggcg acgtgatgcg 
ggggcgaatg ggcgctggct acagtcaggc 
gactatctag ggctaagcca aaatgacgag 
cgctatgggg tggggagtgg cggttcgggt 
cggttggagc agcaattagc tgattggttg 
ggttatgccg cgaatcaggc agtgctgacg 
gcggataaac tcagccatgc ctcactattg 
cggcgtttcc agcataatca gcccgaagcg 
ggccagacgc tggtggtcac cgaaggcgta 
gcggcgctgc agcaacagac gtcagccgcg 
ggtattgggg ttcatggtga aggggggcgg 
gagctgttag ttgtcacgtt tggcaaggcg 
caggaaccgg ttgctgaata cttgttgcag 
atgccccccg cgcaagcctg tgctctacaa 
gcattacgcc aacaacttca gcagcgtatc 
ccgctgcaac tgggggcgtc aaaaacggcg 
caaagtctga tctgggcaga gcaactgcgt 
ccccccacgg tgccaccagg aagcgccaga 
cccgaggata tcgatagatt attggaggtg 
atggggcgtt gggtgtttac cgccactgaa 
cagccgggcc gccgagagct acgacagcgc 
gttagtgcag ttgggccagc agcatacggg 
ggggcatttc agccagcatt ggcgcttgtt 
ggcaggtatg ttggattatg cgcgccaaca 
tattgaacat attccactgc ctgatcagtc 
gcaatggtgt tcagatctgg gggcggcatt 
cgggatcatc ctattttcta ccttggcaga 
gcaacaggtg gatggtcagc gtcatgtaaa 
ggcctgccaa tattatcgcc atcacctaac 
cgtgatagcg ctgatgcgct cgttacaagg 
tcaggctggg ttacaggggc gtcaacgtct 
atctggcggc tatccactga gttatcatat 
gttggtttat caccggtacc gatactgatg 



181141 ctgctggcta taaaccggta gcatccggta gccagatgac cgctgacggc ttgcgcaata 
181201 gtgatgccct ggcgttgcag gcgaatagca gccagaggct cggttactca caagtgaatc 
181261 cgttcacttt tttggaagcg acatcaccgc atattgccag tgaaagtgaa gggagggcga 
181321 tccctttgac ggcgttatca cagggtttac gtcaactgga accctccgct gactggatat 
181381 tgatagaagg ggccggcggc tggtttaccc cgctttctcc gcaggccaca tttgccgatt 
181441 gggtacaaca ggaacaattg ccggtcatta tggtggttgg ggtgaaactg gggtgtatca 
181501 accatgcgtt gctcacggca caggcgattc agcacgccgg gctgacactg gccggttggg 
181561 ttgccaatga ggtgacaccg gcagggagac ggcaggccga atatcaggcc acattgaccc 
181621 ggatgataac cgcgccattg ctgggaatta ttccctatct ttctgatata gaagaaaatc 
181681 cggtgacaac acgccgtgat cttgggcact acttggattt gacggtatta cgtgcagcag 
181741 aacgggaggc tgtcaacatg tgatacatat atggacgccg ctacaagcat gtgatgtggc 
181801 ttgcaatcag aggccatcag gtgcgattaa ccttatgtag ttaatctacc cgagtatggt 
181861 taatctaccc gaatatagtt gatctacctg aatgcgccgc ggccctgagg ttgctcagcg 
181921 ggctaaatgc tcgctaatca agccttcatt caattgatct aacgtcccct gagcgacatt 
181981 tcggccaccc gccatcaggc aaaagcgatc agcaatatgc tgaataaacg gcaagcggtg 
182041 ttcgaccaat aatagggtca tacccaaatt acggctcagt tgatgcaaaa ccgtactgat 
182101 atcttcactc agcgctggcg aggtaccgct gcccggctca tcaaggataa gcagttctgg 
182161 ctctgcgacc aaggctctgg caatggctag ctgttgttgt tgattacggg tcagctcacc 
182221 acctcgttgg ctacgtttat cgtgcagtaa tggaaatagc tcaaaaaccc agcccggaat 
182281 cgggtgggat ttgtggcggc ctgccagcac ggcgaccagt aggttatcct cgacgctcag 
182341 ttgggagaaa atttgctgac cctgtgggac ataaccaatg cctaacgcat tgcgtctttc 
182401 aacaggctgc tgttgcaagt tttgcggggg ttcatggtga ggctgccaga tcatggtgcc 
182461 gcttttaatg ggtagatgcc ccataatgca attaatcaac gtggtttttc ctacaccatt 
182521 gcggcctatc aggcacgtgc attggccgcg agggagctcc agattgatat cccagagaat 
182581 atgattattt ccatagtatt gatttacagc ggttaaactt agcatcagtg cgtctcctta 

182 641 gtcgatccca gtggttcaca ggtgccccgg tgttctgtaa cggctagtga gagttaacca 
182701 gcaaatctgc atacttggtc gcagtagtag agcataaaat acgacactga ttacgcgggg 
182761 ttgcgcgtgt aaacagcgct taaataaccc cttagtggca ttaaatgatg caaaagcgct 
182821 atttttacca ctaatacctt caagagtagg ccctatccag ccaattggtt tgctgaattg 
182881 ttttcagctg attcatctat ccttcagata atctacagag atcaagcaac tcttggacca 
182941 agttgccgaa cacagaagta aaatttgtat gttaatggcc tgtaaatgtt tttttgggct 
183001 aaggagagga agaataataa agttcgaggc agggaagggg ggccatttgg ggtggggtgc 
183061 acctaaatag tgccctggtg ctcgttctgg tgcaaaaagg aaccgctctc tgattttgga 
183121 taaggtgata atacgaataa atgctagtta taacattagc aagcagtgta gttagcacta 
183181 attgaaatat gcgttgaaat aaaaacatca aaaaaacact aaaaaaggcc gtccatctcc 

183 241 atttcatcgc gaagttttgt ttttcgccag ctcccagcct ctgtggggtt gattgagtta 
183301 tccactattc ctgtggataa ccttgtgcat taacaataca aaactaaaga gaggcgagag 
183361 ccagcgcggc ttaggctcct gttgtccgga ttctctcctt tttgttaaat gtatttaatt 
183421 tcaatgtgtt aaataaaatc aagcgcctgg tgttcattcg aaaaaatgat gagtaatgtt 
183481 taatcaatta tttggggata tcttgatcgc agtagcattt ggggataaaa atttttacgg 
183541 tataaatgtt accaaattgt cactgtaaag ggataaccgt tgcatcgata cgaaaattcc 
183601 tgtcattggg tgtttgaagc tggttttata tccagtatca taaggttggc aaaaatctcg 
183 661 gggctttcat aattagcccc ctgacagtgt gtttctcccg cgggtagcga attcatgagc 
183721 aaatcattca aactgcattc tgtgtttaag ccagctggcg atcagcctga ggctatccgt 
183 781 aaattggagg aggggctgga aaatggtctg gcacaccaga ccctgctagg cgtaacagga 
183 841 tcaggcaaaa cctttacggt tgctaatgtt attgccgatc tgaaccggcc aaccatgatc 
183 901 ctggccccga acaaaacact ggcagcgcaa ctttacggcg agatgaaaga gttcttcccg 

183 961 gataacgcgg tagagtactt tgtctcctac tacgattatt atcaacctga agcttacgtg 
184021 cccagttctg atacttttat tgagaaggat gcgtcggtca atgaacacat cgagcagatg 

184 081 cggctatccg ccaccaaggc cttgttagag cggcgagatg tggtggtcgt ggcgtcggtt 
184141 tccgccattt atggtctggg cgatccagat ctgtatctga aaatgatgtt gcacctaacc 
184201 cgtggcatga tcattgatca acgttccatt ttacgtcgcc tgtcagaatt gcaatacagc 
184261 cgcaacgatc aggtatttca acgcggtact tttcgggttc ggggtgaagt gatcgatatc 
184 321 ttcccggcag aatctgatga gtgggcgctg cgggtagaat tatttgatga ggaagtcgag 
184381 cgattatcca ttttcgatcc actcacgggt cagctacagc atgaagtccc tcgctttact 
184441 gtttatccta aaacacacta cgtcacccca cgggaacgca ttctacaggc gatggaagag 
184501 ataaaagtcg agctggcgga gcggcgtcag gtgttgttag ccaataacaa attattggaa 
184561 gagcaacgtc tctcccaacg tactcagttt gatcttgaaa tgatgaatga gctgggctat 
184621 tgctcgggta tcgaaaacta ttctcgctat ctctctggcc gtgggccggg tgaagcgcct 
184681 cccacgttat ttgactattt accggctgat ggtttgttga ttgtcgatga gtcccacgtc 
184741 actattccgc aaattggtgg catgtacaaa ggcgaccgct cacgtaaaga aaccctggta 
184801 gagtacggtt tccgtttgcc atcggcactg gataaccggc caatgcgctt tgaagagttc 
184861 gaagcgttgg caccgcaaac gatctatgtt tcggccacac cgggcaaata cgagctggag 
184 921 aaatctggcg gtgacattat tgagcaagtg gtgcggccaa cgggcctact cgatccgttg 
184 981 atcgaggtcc gtccggtagc cacacaagtg gatgacttgc tgtcagagat ccgcatccgt 
185041 gctgcgatta acgagcgggt actggtgacc acgctaacga aacggatggc cgaggactta 
185101 accgactatc tcagcgagca tggcgccaaa gtacgttacc tgcattcaga tatcgatact 
185161 gtggagcggg tggagattat tcgcgattta cgcttgggtg aatttgatgt gctggtgggc 
185221 atcaacttgc tgcgggaagg gttggatatg ccagaggtct cattggttgc cattcttgat 
185281 gcagataaag agggcttctt gcgttctgaa cgctccttga ttcagactat tggtcgtgct 
185341 gcccgtaacc tcaacggcaa ggcgatttta tacggcgata gaattaccgc ctcgatggaa 
185401 aaagccattg gcgaaacgga acggcggcgc gctaaacagc aggcgtataa tgaagaacgc 
1854 61 aggattattc ctcaggggct aaacaaaaag attggcgata tcctgcaatt gggccagcct 
185521 tcaatgcgcg gcaaaggaaa agggcgcggc agtcataaaa tggccgatac cacccaatat 
185581 caatctttgt cgccgaaagc acttgatcag aaaatccgtg aactggaagc aaaaatgtat 



185701 ctgcggcaac agtttattgc aatatcttaa tttttattgc gacatcttaa gtttatctcg 

185761 ctgctttaag gtttgcttca gcgccttaag gtttatcgag atgccttaaa acttatttca 

185821 ataccttaag gccgaggggc agaccgttgg ccactggcgt acatcccacg tgaaatatga 

185881 cgagtgtatc cagaaatact tgtcgcactc atcgattctc agtaaggtgt gccgctaaaa 

185941 ttatgacgat acagataatt atctgtcagt gtactgttgt caggccatga gatatttatc 

186001 tcataacaca agagaaaaac gatgagcgag catttatcta aagatccgct gcacgggatt 

186061 accctagagc aattgttgac aaaattagtt gaaaactatg gctgggatgg gcttgctgat 

186121 cgtattcaga ttaactgttt tatcagtgac cccagcatta aatccagctt aaagtttttg 

186181 cgccgtaccc cgtgggccag gcaaaaagtg gaagcgctct atattgagat gagcgacaaa 

186241 gcggagtggt tgaaaggcca gtcttgagcc gcctgcctcg cgggttatta cgtgtattcc 

1863 01 atctatttta tggattttat ttacgccatt tgctctctct gcgagcgagc ggcgctgcgg 
186361 gtgatgagcg gcaaacgtat tgtggggcat cggtaactta caccgagtct gcaccgttca 
186421 actgttgcaa ggtgttttcc aacgcctgac gcaataactc gcggtcatgg cggtgaggaa 

1864 81 tatcttttgc ttccagtggt tgttggacaa taattcggtc ttttacgcca ctgatatcga 
186541 ttgttgggct gacaataacg gcatcaatga ttttacggcc aatcttgctt tccatgatgg 
186601 tcagtttgtc ttgtaatgac agcgcagcgg cggcggggct taattcgcgg cccagattcc 
186661 caatgtaaat catactggca gagctgcggc gcaatgcctg agttaactca tccagcagca 
186721 gtaatggcat tagactggtt agaaaactgc cggggccaat caaaataaca tccgcctgat 
186781 taatggcttc gacggcttct cgggtggcgt gtacaggggg cgatagcaac atttcttgtg 
186841 gcatctgcgc caattgatcg atattcactt caccataaat agggtgccct tcatggtcaa 
186901 tagccatcaa atctaccggt tgttcggaca ttgggatcaa caaggcatcc acttttagca 
186961 agctacgcac caaattgatc gcttcaatgg ggcgaatgct caggtgatct aatgctttca 
187021 gcatcaagtt acccaaatta tgccctgcca gctcaccatt accggtaaaa cggtattcga 
187081 acatcgcgga ggcgacactt ggctcggtaa tcaactgatt aaggcaattg cgagtatctc 
187141 cccaggcaat tcccccctct gagcggcgta tccgcccagt tgaaccgcca ttatcggtgg 
187201 tggtaacaat gcctgtcagg cgagagccta aggaggagag agacgacatt acccgcccca 
187261 aaccatggcc tccgcccaga gcaacgaccc gttccaaatc cgctaatgtg cgatttcgca 
187321 tatttctcct gttgttatat tcggtgcctg gcgattatcg cgcatacagt ttttatcttg 
1873 81 tgttgcaacg ctaatgaacc tggctgccat gctaatggcg ttggggaccg ttaatcataa 
187441 aattgccgca taagttaacg gatttgtgat tgaaacgcga tataagccat gatttttgtg 
187501 attttaaaca aggcatcaat cccggattca gtaagctgaa taacaataac gttgaataga 
187561 taagtcgttg tatataataa tatatagcga gaaactatca ttggctatcc tgccatttta 
187621 acgttacaat ctttaggaaa agtagggttt ttaatggtta gttgttgttt attgccttac 
187681 tttttaacgc catgatttag cagtaaaata taagttatag ctgaacataa actcctagcc 
187741 ttagcgtcta cgttgtcttt tttgtattgg caatatgatg ttctggccgt ggccttaagc 
1878 01 caccagggtg caaggtagaa atgcctgcat ctcccgtatt tggaaaggtg ttatggtaca 
1878 61 actcactgac gcatttgcgc gtaaattcta ctatttgcgc ctatcgatca ctgatgtgtg 
187921 caatttccgt tgcacatatt gcttgccaga gggctaccgc ccagatggtg tcaaaagttt 
187981 tttgagcctt gacgaaatca atcgcgtcag ccgcgccttt gccttgttgg gaacggaaaa 
188041 aatccgcctg acggggggcg agccttcgat gcggcgcgac ttcactgata ttattgccac 
188101 tatccggcaa aatcccgcta tccgtacctt ggccgttaca actaatggtt atcgtttagt 
188161 tcgtgatgtg gcgcaatggc gtgatgcggg cctgacggca attaatgtca gcgtcgatag 
188221 cctggatccg cgacaatttc atgccatcac cgggcaggat aaattttatc aagttatgca 
18 82 81 gggcattgat gccgcatttg atgccggttt tgacaaagtc aaagtcaacg ctgttttgat 
188341 gcgcgatgtt aatgatcgcc aactgagtgc attccttgac tggatcaaac ctcggcccat 
188401 tcaactgcgt tttatcgagc tgatggaaac cggcgaaggc ggtaacctgt tccgtaaaca 
1884 61 ccacgtttct ggtggcgtca tccgccaaca attactcgaa caaggctggc agcagcaaga 
188521 tcgggcgcgc agtgatggcc cggctcaggt gtttcaccac tcggattacc aaggtgaaat 
188581 tgggcttatc atgccgtatg aaaaagactt ttgcgccagt tgtaaccgcc tgcgtgtttc 
188641 tgctctcggt aatttgcatt tgtgcttatt tggtgagcag gggattacat tacgtgatct 
188701 gctgggcagt gatgatcaac aagatgagct gatagcgcgg attcaaagtg cgttgcaaac 
188761 gaagaaacag actcattttt tgcatcaggg taacagcggt ataacgcaaa atttatcctt 
188821 tattggcggc tgatgtgagc gaattagcgc cgatgagctt acatcagtaa gtgattcggg 
1888 81 caagtgaatg ccgctaaccc ccctgtaatg tcaagaacga agggatttta ttgatgaccc 
188941 aactgacaca cataaacacc gcaggcgaag cccacatggt cgatgtctcg gcaaagaatg 
18 9001 aaacggtccg cgaagcacgc gcagaagcgt ttgttgacat gcaggccgca acactggcga 
189061 tgattatcga tggcagccat cacaaagggg atgtgttcgc gacggctcgg atagccggta 
189121 ttcaggcagc caaaaaaacg tgggaattga tcccgctgtg ccatccgttg ctgctgacta 
189181 aagtggaagt aaaactggaa gcacagccag agcataaccg ggtgcggatc gaaacctgct 
189241 gccgcctgac cggtaaaacc ggggtagaaa tggaagccct tactgcggct tcagtcgcag 
18 93 01 ccctgacgat ttacgatatg tgcaaagcgg tacaaaaaga catgatcatc ggcccggtac 
18 93 61 gcctactgac aaagagtggc gggaaatcgg gtgatttcaa ggtggatata tgattcagat 
18 94 21 tctatttttt gctcaggtac gtgaactggt tggggtggat aaattacagc tagcggccga 
18 94 81 gtttccaact gttgaagcgc tacgtcaatc cctgtgtcaa cggggtgagc gctggcagtt 
18 9541 ggcccttgaa gagggcaaat tactgacggc ggttaaccaa tcattagtga gtgctcagca 
189601 tccactggcc gcaggtgatg aagtggcgtt tttcccgccg gtgaccgggg ggtgatgatg 
189661 gaaaatacgc gtatccgtgt cggcgcagaa gcgttcagtg tgggtgatga atacacatgg 
189721 ctgtcgcagt gcgatgaaga cggcgcggtg gtgaccttca ctggcaaagt ccgtaatcat 
189781 aatctggggg ccagtgtcag tgcgctaaca ctggaacact accccggtat gacagaaaaa 
189841 gcgctgacgg agattattgc cgacgcacgc agccgttggt cattgcagcg ggtatcggtc 
189901 atccatcggg tcgggccact attccccgga gatgaaattg tctttgtggg tgtgaccagc 
189961 gctcatcgca gtatggcatt tgaagcagca gaatttatta tggactacct gaaaacacgg 
190021 gcaccattct ggaagcggga agccacggtc gagggtgagc gctgggttga atcacgggac 
190081 agcgaccata ttgcggctaa acgctggtaa tctaggcgca gaacagcata aaaatcacgg 
190141 ctaaatgtct atttttactc ctttgtttcc gctatttcta tggtaccctt aaaaagacgg 



190261 ctatccacgc tctaatggtt cgattgtcga acgtgccggc tcaggcattc aggcctatat 
190321 ggcgcaggta tacggttgga tgacctgtgg cttgttatta accgcagttg tcgcttggta 
190381 tgccgcgaac actccttcaa tcatttttgc tttgcaatca aaccaaatat tgttctttgg 
190441 cttgattatt gcccagcttg gtctggtttt tgttatttcc ggtatggtta accgcctgag 
190501 tggaaccgca gcaacatcac tgtttatgct gtattccgcg ctgaccggcc tcaccctttc 
190561 cagcatattg ataatgtaca cgggcgcgtc tatcgccagt actttcgtta tatgtgccgg 
190621 qatgtttggt gccatgagct tctatggtta caccaccaaa cgtgatctga gcggtatggg 
190681 cagcatgttg tttatgggcc tgatcggtat cattctggca tcccttgtta acatctggct 
190741 gaaaagccca gcgttgatgt gggttgtgac ctatatcggt gtgctcgtgt ttgtgggcct 
190801 gactgcctac gacacacaga aactgaaaaa cttgggcgca cagttggatg tcaatgataa 
190861 agattcattc cgcaaatact ccattgtggg cgcactgact ctgtatctcg atttcattaa 
190921 cctgttcttg atgttattac gaatcttcgg taatcgtcgc taagcgggtt gattaaacgc 
190981 tatggcgcgg taagtttgcc caccgacatg gtttgagata gtaaaggaca ccaaaatctg 
191041 gtgtcctttt tgttgtgccc agaaaacccc cagctaggct gggggttcag taaagctttc 
191101 agctttgggt cagttataaa aacccctttt gatttgttaa aacagtttgc ggtctggcaa 
191161 ctgcaaatgt tcaacaagaa atcaaaaggg ggtcccaatg agggatgaaa agagcttagc 
191221 gcacacccga tggaactgta aatatcatat agtttttgcg ccgaagtacc gaaggcaggt 
191281 gttctacagg gaaaaacgca gagcgattgg cagtatttta agaaaactgt gcgaatggaa 
191341 aaacgtgaat atcctggaag cagaatgctg tgtggatcac atccatatgc ttctggagat 
191401 cccgcccaag atgagtgtct cgggatttat ggggtacctg aagggaaaga gcagtctgat 
191461 gctttatgag cagtttggcg atttgaagtt caaataccgt aacagggagt tttggtgtcg 
191521 agggtattac gttgatacgg tagggaaaaa cacggccagg atacaagaat acataaagca 
191581 ccaattggaa gaggataaaa tgggtgagca actctcgatc ccgtatcccg gtagcccgtt 
191641 tacgggccgt aagtaatcca tagatgcaaa tgtcagatcg cgatgcgcct gttagggcgc 
191701 ggctggtaac agagccttat aggcgcatat gaaaaacctc cggctatgcc ggaggatatt 
191761 tattttttct ttcaccagtg gtcatcaccg tttctatgta gaagttgccg tttgatgcgc 
191821 tgagcaacgg tttgtatcga tgatcaatgt ggttaacgat cctctacgta tttcgtgatt 
191881 ttctttacgg atatgatgtt acgggtaatg aatgccttac gggagatgta cgggcaggag 
191941 tatctccggc gataaaatac gcggcgttgc tttttgccag tggttggcag ccacgaatac 
192001 gatccgctat tttctctgca atcatgatgg tcgtggcatt gaggttgccg gtaatgattt 
192061 gcggcatgat agacgcatcg accacccgta gcccttgcac accgtgcacc cggccttgcc 
192121 catcaaccac cgccatcttg tcgtcaccca ttttacatga gcacgaagga tgatatgccg 
192181 tctcggcatg ttcgcggatg aaggcatcca gttcgtcatc gttttgaaca ttggccccag 
192241 ggcttatctc tcgaccacga tatggatcca gtgcaggctg cgcgataatc tcgcgggtga 
192301 tgcggatagc atcgcggaac tcatgccaat cttgttcgct cgacatgtaa ttgaacaaga 
1923 61 tacttggatg ctggcgcgga tctttcgatt ttacctgaat gcggccacgg cttggtgaac 
192421 gcatcgagcc aacatgcgcc tgaaaaccat gctcttttac ggcgttgctc ccattgtaat 
192481 taatggcgac gggcaggaag tggtactgaa tattcggcca ggtaaatgca tcgcgactgc 
192541 ggataaaccc accagcttcg aattgattgc tggcacccac gcccgtaccg ttgaataacc 
192601 attcgatacc gattttgggc tgattaaacc acagcagagc agggtaaagt gacaccggct 
192661 gtttacagct atattgcaga tacatctcaa ggtgatcttg caaattctcc ccgacgccgg 
192721 gcaacgcctg caccaacgga atatccaggc gttgcagtaa ttccgcaggg ccaatccccg 
192781 agcgctgtaa aatttgtggt gacgcgatag cacccccaca caataaaact tcacggcgag 

192 841 cataggcggt ttggccggtg ccagcatcac ctttgaggta gcggacacca gtggctcgtt 
192901 tcccttcaaa taggatccgg tccgtcaggg cgtgcgtaat aatggttagg ttattacgtg 
192961 ggcgcgcctg atcaagatag ccgcgcgcag tactggcacg acgacccttc ggcgtcacag 
193021 tacgatccat cgggccgaaa ccctcctgct gatagccatt caggtcatca gtacgcggat 
193081 aaccggcctg cacacccgca gcgaccattg catgaaataa cgggttgttg ccgattttag 
193141 gtgtggtcac gctgacaggg ccttcaccac cgtggaaatc atttgggccg atgtcgcgtg 
193201 tttctgcctt gcgaaagtac ggcaggcaat caagataact ccagtcctcc aggcccgata 
193261 aactggccca gtggtcaaaa tccatggcgt tgccgcggat gtaacacatg ccattaatta 
193321 atgatgaacc gcccaaacct ttaccgcgtc cgcactccat gcggcggttg ttcatatgcg 

193 3 81 gctccggatc ggtttcataa gcccaattat aacgtttacc ctgaagtgga aaggcgagcg 
193441 ccgcaggcat ttgtgtccga aaatccagcc gatagtctgg tccacccgct tcgagcaaga 
193 501 gtacggtgac atccgcatct tctgttaaac gggctgccag cacgttgcct gcagaaccgg 
193561 caccaatgat gatgtaatcg tattccattg gatgtccctc tggtgttgga ttaaaaaaca 
193621 gaattgaagc tgcctaactc aacctgaatt gatttgattt gcgtgtaatg ttccagcgtg 
193681 ctgataccgt tttcacggcc tacaccagaa tgcttatatc cgcctacagg catctctggt 
193741 gccgattcgc cccaagtatt gatccagcag atacccgctt gcagttgatg aatcacgcga 
193801 tgggcacggt tcaaatcctg tgtaacgacc cccgccgcta aaccgtactc ggtatcattg 
193 861 gcgcggcgaa tgacttcctc ttcgctttga taactaagaa tgctcatcac cggtccgaag 
193 921 atctcttcac gcacaatttg catgtcatca cggcagtggg tgaataccgt aggggcgaca 

193 981 taggccccgt gtgccagtgc gccctcggtc agactgtatc ccccgatcag cagggttgcg 

194 041 ccttcccgtt taccgctgtc aatgtaacgc atcactgaat cgcggtgctg gaagctaacc 
194101 aacgggccaa agttggttcg ttcatcgctt gggtcaccaa tatgaatccg cttgacccgt 
194161 tcaacgattt tctgctcaaa cgcggcctgt aacgcctgcg ggacaaaaac tcgcgtgccg 
194221 ttggtgcaga cttgtcctga gctgtagaaa ttggccatca tcgcaatatc agcggcctta 
194281 tcgagatcgg catcagcaaa aataattagt ggggactttc cgcccaactc catggtgaca 
194341 tctttcaggg tcgatcccgc ggcgttagcc atcacctttt taccgctggc aatccccccg 
194401 gtgaaggaga cctttgcaat acccggatgc tctgtcagca tctgcccaac ctggtcacca 
194461 ctgccggtca atacgttaaa tacgccagcc ggtaaacccg cttcggtgta gattcccgcc 
194521 aatttcagtg cggtcagcga tgtcacttca cttggtttga agatcatagc gttgcccgcc 
194581 gccagcgccg gtgcagattt ccacaaagcg atctgaaggg gatagttcca ggcaccaata 
194641 ccggcaacca cgcccagtgg ctcacggcgg gtatagacaa atgcgctgcc acgtagtggg 
194701 atctgttgcc cttcaagggc ggggattagg ccagcataat attccagcac atcggcaccg 



194 821 gccgccagtt catcattacg atcacgcaag 
194881 cgttccatcg cggtcatcgc tgcccacaca 
194 941 ttatcaacat cctgtgcgtt ggcggcctgg 
195001 ttcacggcat cgaaagtatc accactggtg 
195061 ttttgcaagc cgtagcggga catagtctct 
195121 tcagtgcgtt cgaattttgt tcagcactta 
195181 ttatggataa tcagcagcgc ttcctgacgg 
195241 agccataatc catcaatcaa acccgccata 
1953 01 ggtaaacaac gtttgaattc agcacataaa 
1953 61 tgtaaccgat acaattgtgg gctatgcata 
195421 atcgcggcgc tattgatctg gctgtcatca 
195481 gcacgaggat cgttgacgga cagggcagcc 
195541 cttatcagat acctcattgt ggcttctagc 
195601 ataatgccgt tggatacgcc agcacgcttt 
195661 cccacttcat tcaccgcggc cattgttgcc 
195721 attcctactt ttggcacggg tctggactcc 
195781 atgctgccta ttggcagcta gttacccgtt 
195841 acgttcaata aaaaatgaat atattttatt 
195901 gattgtaact tgttgatgtt agaagttaat 
195961 tcaccaataa cgtcagctca tcaaaaattg 
196021 agcagtctct tgatatgcgc taaggggcgc 
196081 tttaaaaatg caggggaaac gatggtgact 
196141 acgttaaatc cggtggtgtt ctttacgtcc 
196201 actatttttt atactgacct ctctaatcgc 
196261 gccacttttg gttggtacta tctgctggcg 
196321 attgccactt cacgctatgg ctcaatcaaa 
196381 agcctggtca gttgggcggc aatgctgttc 
196441 ttctcggtag ccgaacccgt gacgcaatat 
196501 ctggaagccg cgcggcaggc gatggtctgg 
196561 tcaatgtatg cattgatggg gattgcactc 
196621 ttgaccatcc gttccgcgct ttaccctatc 
196681 cacagtgtgg atatcgccgc cgttatcggg 
196741 attggggtgg tgcagctcaa ttacggcctc 
196801 gcggttcagg ccgggttaat tctgttgtcg 
196861 ggtgtcaaca aaggcattcg gatcctctct 
196921 atcctattcc tgctgaatgc gctggtgctt 
196981 ggcatgacgc tcaacagttt tgcttttgat 
197041 ctgtttttct gggcatggtg ggttgcctgg 
197101 atttcccgtg gccgcactat ccgccaattt 
197161 ttcaccttgc tttggctgtc tatcttcggt 
197221 aatattgctt ttgctaatga agtgatggag 
197281 gcacagtacc ccggtttcac tttgagtgct 
197341 tatgtcacgt cagcggattc cggttcgcta 
197401 gatattaata atgattcccc caactggtta 
197461 ctgactttgg gaatgttgat gaccgatggt 
197521 atggggttac cgttcagttt tgtgattttc 
197581 aaaattgagg attaccgcaa agccagctcg 
197641 ggcaatgaag tactaaattg gaaacaacgt 
197701 acttataccg cgaaaatact ggataccatt 
197761 gaactggccg tacgcggtgt taaagttgaa 
197821 caactgaatc gtcttaatca tctggagttg 
197881 gtctaccaga tctggccgca acgttattct 
197941 gggaaatcac actattaccg ccttgaaact 
1980 01 ttgatggact acagcaaaga acagattatc 
198061 atgaacttcc tacatattca aagtgaagcg 
198121 taatccgcaa tagtggggtc tcttcatctg 
198181 atcaaaatca tctgccgctt tttctggctt 
198241 cgcctgactt aggtaataaa tacatataat 
198301 tattattgaa ataataattc aggttgctcg 
198361 aagcttcttt tttccggtcg tatcaatttg 
198421 gacgagttac acttcggaaa agccgccgaa 
1984 81 ctgcaaatta aagaattaga agatgcatta 
198541 aacgtggtgt taaccctggc gggcgaaatg 
198601 aatatagaga attcattaag ccgcgtatcc 
198661 aatattggca ttatcggttc cgccctttgg 
198721 aagacagtca atcctaagac cacctggtcg 
198781 gaagccttat taaataagaa gcttgatatt 
198841 aatccggcgc tgatctatag gcgggtagaa 
198 901 gagagcttat tggtgaataa aaaaatactc 
198961 atatttctca cctttaatca ctctggctac 
199021 gctggctgca cgccgcaggc tatttatcag 
199081 gtgaatagca accttggtat cgccttagtt 
199141 aatattaaat ttattccgct gaaagaagat 
199201 cctgatagcg tgactccggc actaaataag 
199261 aatctgtggc attcactggc cggtattact 



atatccacgg cccggcgcaa gatgcgtgaa 
ggctgacctt gtttggcggc ggctaccgcc 
agttgtgcaa tgcattcgcc attcgcgggg 
ctgtcggtgt acgcgccgtt gatataaagt 
cctgtctttt cctgtacggc taaataccgt 
tatttaatgt tcaattgctg ctcaatatag 
ttaaagtgtt cgccactcaa tgcactgcgt 
cctttagcgg cgagctgcgc ttgctctcgc 
tttgaataga gtcggcggtt attcacctgc 
ctgcttgccc aaaatgccag ccaggttttc 
aagttgcctt cggcgatcgc ccgcaaacgg 
aagtgctgct tcaccgcctc gccaaggtgc 
aggccatttt tgtcacggaa atagtggctg 
gcaatctgcg caatagacgc ttcatgcatt 
tcgatcaatt gctgccgtcg gatcggctgc 
tttaaattca atcaattaat actgttattc 
acgaaacact attaaacttt ttttgattga 
gctgataagt taaatttttg tgtgaacatt 
cacgttggta tcggataggt ggtttttagc 
gcggtaagtg cactcagcgg atcgcctgcg 
ttacccacca aatgtgatga gccttacttt 
tcagacatcg acacaaaacc gcaaaaggat 
gcggggttga tcctcgcgtt ctgtcttatg 
tggattggta ttacccttaa ctgggtctct 
gccacgctgt atatcgtttt tgttatttat 
cttggccccg agcagtcaaa gcccgagttc 
gccgccggta tcggtattga cctgatgttc 
atgatgccac cagaaggcca gggccagact 
acgctgttcc actatgggct gacaggttgg 
ggttacttca gctatcgcta taacctgccg 
tttggtaagc ggatcgacgg gccaatcggc 
acgattttcg ggattgccac cacattgggg 
aatgtcctgt ttcatatccc ggaaagtctg 
gtggtgatgg cggcagtatc ggtgacatcg 
gaagtgaatg ttctgctggc gttaggcctg 
aacgtggggg attatatcaa tcgctttttg 
cgcccaaccc aatggatgaa tagctggacg 
tcaccctttg ttggtctgtt tctggcgcgt 
gtggtcggta cgttgatcat accctttgtc 
aatagcgcgt tgtaccagat cctgcatggc 
tttccggaac gcggtttcta tagcctgtta 
tcggttgcca cgatcaccgg cttgctattt 
gtgctgggga atttcacctc tcgtctggcc 
cgtactttct ggtctattac tatcgggtta 
gtcgccgcct tgcaaaacgc aaccgtcatt 
ttcgtgatgg tcggcttgta taaatcgcta 
caacagacac caacgcctac gctgatttca 
ttatcgcgag taatgaacta cccaggtacc 
tgccgcccag cgatggagga agttgcgcag 
ttcaacgaac tgccgccgaa ggaagatgaa 
ctggtacatc tgggtgatga gcagaacttc 
gtgccggcat tcacctaccg tgcacgctca 
tatctgcttg aaggaacgca ggggaatgat 
aataacattc ttgaccagta tgagcgccat 
ccgggaaata tcatgacatt tcccgatcaa 
gcgagtcgat gaagagatca ccaccgctta 
aactttatta atgtaatcgt ctactatcac 
taagattgcg gtttgattat cagacaagat 
cattaaaata attggatagg tgatatgcaa 
aaaatgatcc gctatttttt agccgtttca 
cgccttcata tctctcagcc gccgttgagc 
ggtttccctc tctttatacg agacagccgt 
atgaaggctg aaatgaaaga tgtatttgat 
tatattgccc gccatgaaca aacccacctt 
catcagctac tagagaaatt caagcattat 
ttacacgagc tgccacccag taagcagtat 
gggttctggc gatgtgccga tctggaacag 
aagcagcgtg tcgccgtcgc cgtgtctaat 
acgctgaaag atctctcagg gcaaacgctc 
tctaaaaatt tatacaatag ttgtcttaat 
tttgatgaac ctcagaccca gttggctttt 
cctgaaagta tgcaggaaat cccatggcct 
ctttctgccg atctcttcgc tgtttatcac 
ttgcttgcgt tattttgagt gtgatttttg 
tggctttaac taaaatagcc atcgcgataa 



199381 tagtgacaat gactgacagc gggatcaggt acacatagcc tgatactttc gtcggcgaaa 
199441 gcacaatgca ggctttttgg aacagaaaga aacttaatgc ggtggcaaag gtggcaagat 
199501 ataaaacgcc gctccaggtg atgacgctaa tattgcgcca ttcaatttct ggaagctgat 
199561 aggcgatgat caccgttagc agtagtgtgg cgcatatcaa gctccaaccg gtaagcacca 
199621 aggcgtgctc accccgatgt aacttcttca ccacaatagg attcagcgcc atgcctaaac 
199681 aaccaaacag gaacaggtaa tcagatggca tcagcgacaa gcgtaaaacc tgagacaagt 
199741 cacctttaaa aatgattaat aatgccccta aaattccaag caataatgcg gctaccacag 
199801 gccacgtcga ttttgcgttc aaaaacacca cactcattac catgctcatt aatggcacag 
199861 tggtgtataa cgcactcgaa tttatcgcgc tggttgtctg cagtgaaaag atcatacagg 
199921 caaaatagag cagcggcggt aggctaatca gagtataacg gcccaaatct cggtaatgag 
199981 gtaacgttag taaaccctga ctcaccagca atacaacaaa aaacaggctt gcaatagcgt 
2 00041 atctcagcca agtaattacc attggtggca gtgcgttgct gatggtggcg ctggcaataa 
200101 aggaaccgcc aagtagagcg gtaaacccca acatttgcaa atgcgcttta agtactcttg 
200161 ttgtcatgtg cttacccgtt accctcttgt gatcgatgat aattgcgtgg aaacacattc 
200221 tgaaccatta cattttcagt atggattagc gggtttttag tattggttat gcaataaccc 
200281 actacttatg atggggtaga ttagatctat ctctaccttt cgtctttgac gttgcggcgg 
200341 tgttagcagc attta;gtctc ccctatcact cagggagtta gctcatcact ttgcgctcag 
200401 gtgtagtaac gccaattact cgaggtacat atttagccgg tttatttgag cgttattaaa 
200461 ataatcaagg ggttttatta tgacaacagt aaattcaaat ataaacgttg atgctgattt 
200521 caatgaatta ccatttcaag ccgttaaata tattcagaaa ataaaacccg gatttaagcc 
200581 tcaaatcgcg tttattttag gttctggatt gggtgattta gttgatcaaa ttacgaatga 
200641 caccacaatt agctacgctg atatccctgg cttccctgta agctcggttc acggccatgc 
200701 tggcgaactg gtgctgggcg atctgtgtgg tgttcctgtg atgtgcatga aaggccgtgg 
200761 ccacttttac gaaggcaaag gcatgagcat catgaccaat ccggtgcgca catttaaatt 
200821 gatgggctgc gaattcttat tctgtactaa tgccgccggt tcattgcgcc cagaagtttt 
2 00 8 81 gccaggttcg gtggtgatgc taaaagatca cattaatacc atgccgggca cgccgttagt 
2 0 0941 tggccctaat gatgatcgct ttggcccacg tttctttagc ctagcgaatg cctatgataa 
201001 agatctgcgt gctgatatgg ctaaaattgc tcaacagctt gatatcccgc tgaccgaagg 
201061 tgtatttgtc tcttacctag ggccttgctt tgaaacaccg gctgagattc gcatgatgca 
201121 gatcatcggc ggtgatgttg tgggtatgtc tgtcgtgccg gaagttctct ccgcagccca 
201181 ttgtggtttaaaagttattg ctctgaccgc gattactaac ctggctgaag gcctttctga 
2 01241 tgtcgttctg tcccatgaac aaaccttaaa atttgctaaa gtggcatccg tcaatttcac 
2013 01 taaattgatt gaagcattct taaaaagcaa agcgctccgt taatcgataa agacgtcata 
201361 aatattttat gccccctatt atgccagccg caaatagggg caccgtaaat gagggatata 
201421 ccctataggc tcaatgcgaa aaaatatatt cctcgacgca tttgaatcac ggagaataaa 
201481 aatgataata aaaaaacggc ttaaagttat gtttttcctt cagttcttta tctggggtgc 
2 01541 ctggttggtg acgctgggcg cttacatgat gaatacgctg aactttaccg gtgcacaggt 
201601 gggtttagtc tatggcgcta aaggcatcgc ctgtatttta atgccgagtc tggtcggtat 
201661 tattgccgat cgatgggtga aagctaatat tctctatgcg tgttgccacc tgctgggggc 
201721 ggttgcgctg ttgatcgccg ccaatgtttc agaccctaat atcatgttct ttgtgatgtt 
201781 gttcaatgcg atggtctata tgccgacgat tgcactggct aataccattt cttatatctg 
201841 tttagaaaaa gcggggctgg acacgattaa agatttcccc ccggtacgcg tctttggcac 
201901 cattggtttt atttttgcca tgtgggctat cagcctctcc ggatttgaat tgagtaatat 
201961 ccagctttat atcgcttctg gtgctgcgct ggtgttggcg atatatgctc tttcattacc 
202021 aagctgtcca acatctaata ttaaaaaaga ccgctcgtgg gttaatattt tagggcttga 
202081 tgctttcgta ttatttaaac aaaagaaaat ggctattttc ttcttgttgt gcccagaaaa 
2 02141 cccccagcta ggctgggggt tcagtaaagc tttcagcttt gggtcagtta taaaaacccc 
202201 ttttgatttg ttaaaacagt ttgcggtctg gcaactgcaa atgttcaaca agaaatcaaa 
202261 agggggtccc aatgagggat gaaaagagct tagcgcacac ccgatggaac tgtaaatatc 
202321 atatagtttt tgcgccgaag taccgaaggc aggtgttcta cagggaaaaa cgcagagcga 
2 023 81 ttggcagtat tttaagaaaa ctgtgcgaat ggaaaaacgt gaatatcctg gaagcagaat 
202441 gctgtgtgga tcacatccat atgcttctgg agatcccgcc caagatgagt gtctcgggat 
202501 ttatggggta cctgaaggga aagagcagtc tgatgcttta tgagcagttt ggcgatttga 
202561 agttcaaata ccgtaacagg gagttttggt gtcgagggta ttacgttgat acggtaggga 
202621 aaaacacggc caggatacaa gaatacataa agcaccaatt ggaagaggat aaaatgggtg 
202681 agcaactctc gatcccgtat cccggtagcc cgtttacggg ccgtaagtaa tccatagatg 
202741 caaatgtcag atcgcgatgc gcctgttagg gcgcggctgg taacagagcc ttataggcgc 
202801 atatgaaaaa cctccggcta tgccggagga tatttattta tcaaaatacg gccttaatct 
202861 gattaaaaaa tgtatatata accattattg aatttaacgt tatctaatta aattaaattg 
202921 tagttaaagt ctataagggc taattactaa agtttctata ttaataagtt ttttcattaa 
202981 acataaaata atttaattta ttaaaaatga taattttgtc tttttttgaa ataataacaa 
203041 ttaagtttta atgtattatt gttaatactg attgagaata ttatctaatc agaaataaac 
203101 gaataaaaaa ctgtttactc atcattaata taaggatttt ttgtatgcca caagttaata 
203161 acatatcgac aaacaatata cactcagccg gctttaacaa ctcaaattca attcaaaaat 
203221 atactggagc agtctcatct ataagtgatg atttacgtat taataatgaa aagtgcaaga 
203281 gtgacatagg aactataagt ggtgatataa aaattaatcg ccattctgca gtctacggta 
203341 atgtgaattc agtcagtggc gatattactg taaaaaattc tatagtagat aaagacatca 
203401 ccacagtaag tggtgatgtt aacgcggtga attcgactat aggaaaaaat ataaaaacag 
2 03461 tcagtggttc gattgaggtg gaacaatcaa ctgtaagcgg caatttggaa acaacaagtg 
203521 gtgggataga tatagataca acaaaaatta acggtaatgt gcatacgaca agtggttcaa 
203581 tttcgatgaa cgattcaact attgatggta gcgtaacatg taaggcgggg tctgtaacaa 
203 641 ttgtcaattc aacaattaaa gaaagcctta atgtaaccag tgagaaaata attgttggta 
203701 cagcatcttg tattggaaaa attaacattt ccccccctga atctgtaaat tttaacatca 
203761 tgaattttgg gaacgatagt atagtgatgg gaatgaggaa tttttgtatt tcaggagagg 
203821 taaattttac cattactaat ggtaaagtat ttgttaatga acaaagggtc ggccatacag 



203941 gtgttaatga tattgttttt tatactaaaa 
204001 aatataacgg agaaaaaaag gatggcatgc 
204061 atgcttatgc gtaaaatacg tatttagact 
204121 gggcattgcc catgggtatt ttaccacacg 
204181 acaaagatga ccaaacaaaa aattttaagg 
204241 gtaatctgta ttattcagtg ttatctcttg 
204301 cagccttttt tgacattata cgatatagaa 
204361 ctggataatg agtatgcgag taatactggc 
204421 gcgtgaattg ctcagtgaag taaatgatta 
204481 ggttgatcaa ctgctgcctg ctaaatcttt 
204541 aagccgtacc cagtctggca cattagtacg 
204601 gctggcagaa aatgcggctc gagcggtagc 
204661 cgggtgtcct tcaaaactgg ttaacggcag 
204 721 agaactgatt tatcaaggtg caaaagccat 
204781 cacggtaaaa attcgtttag gttgggattc 
204 841 cgtgcagcaa gcgggggcga ccgaattggc 
204901 tcaagctgaa cggattaact ggcaagccat 
204961 agttattgcc aatggtgaaa tttgggatta 
205021 cggctgtgat gccgtgatgc taggccgtgg 
205081 ggtgaagtat aacgaaccgc gtatgccgtg 
205141 tgtgcagttg gaaaagcagg gcgataccgg 
205201 gttggggtat ctacgtaaag aatataccga 
205261 attaaaaaat tcgaaggata ttgcgctggc 
205321 aataatgttt caattgaata ataatgtggt 
205381 ctatgttatt gttttttctt gcggctgata 
205441 ccgataagga tgattaacac atcaacccag 
205501 atatctgcta aacccgcagc aatcaagaat 
205561 cccatgttcg cccccagagt gccgttttat 
205621 atctttaatc cgacaggcgt tttttatcat 
205681 gatattttag ataacctaat tttataaccg 
205741 atgtgttaat tcgcttttag ggttgactgc 
205801 taacaatcat ttaaagaaac cgtgatgcgt 
205861 accaatcatg tccactcccg ataatttatt 
205921 ttttgcacta ttgagctttt tgcttttatc 
205981 tgcccgggga tcagcggttg aggttaaagg 
206041 cgctatggtg gttgatttac aaacgaataa 
206101 tgtgcctatt gcctcaatca ctaaactgat 
2 06161 gcctcttgat gaaattctct ccgtagatat 
206221 ttcccgcgtt cgtgttaaca gtgaaattag 
206281 gtcgtcagaa aatcgtgctg cggccagcct 
206341 atttataaaa gccatgaatg caaaagccaa 
206401 tgagccaaca gggctgtcga ttaataacgt 
206461 gatggcgacc aaacagtatc cgttaattgg 
206521 tactttccgt gaaccaaatt atacgttgcc 
2 06581 cgacaaatgg aatattcaac tgaccaaaac 
206641 ggtcatgcgt accgtgattg gcaagcggcc 
2 067 01 taaatatacc cattttgcgg atgcaaaccg 
206761 ggcgcccatc cctggtgctg ccaaaagtta 
206821 agcgcaggta tcagaataag agcagcgcat 
206881 ggcaaattcc gctttagtga ctaattgttt 
206941 ctatttctgg ctaatagcgt cagtcttgct 
2 07001 tccccggatt aaaactgttg gttgggtcga 
207061 ttgccagata gaggtggccg acattatgtt 
207121 ggagcgataa cattttttct tttagcgctt 
2 07181 cttgatgaaa gacgtggcac agaaaatgtc 
2 07241 ctatttctgg tggtaaggtt tcaaaccact 
207301 gtgccaaaat atcttccact tgatcggcgt 
2 073 61 ccgccgcaaa acggtgtaga aaagcctttt 
207421 taccttccgc agtggcaaaa aaggcgcgta 
207481 ctgacatttt gagcagcaga tgatgttcat 
207541 gtaaatggtt aggcaatatc tgactaaagc 
207601 ctaaatgcgg aattttattg aggcgagcat 
207661 gcatgttatt ggtgcccatg ctgttaatca 
207721 caatatcaaa aatatcacgg tgcagatatt 
207781 ggagaataat acggcgtaac tcagtcaata 
207841 cttgttgttg tttctcactg gggaaggtat 
207901 cggcacagcc ggaggcttca aataaacgtc 
207961 ttgcatcaat atctcttacc cgcgtcgcat 
208021 gttcgatatc ttcagcacca tattggcctt 
208081 tatttcccaa gctgatgcct aaatgattaa 
208141 gtgcatacag agccatttcg gtgtatgccg 
208201 tattgcaaat accgcctacg accgaggcac 
208261 gttcgcgacc ataaggtttt aagcgttttt 
208321 ctataacttg tttgccatca tccaatagct 
208381 tgacgatagg gcgatcataa tcatcaccac 



agtgccatat cattttagaa ggaaatgcta 
aattcaccca tgttaacgca ccgaaaagtc 
atctctttgg caatgtccat tttcataaat 
actttaatag ccgattttca gtgtggtaga 
ggccaagaat attattattt tttactctac 
tttttatccc cgtttccctc gtatcattgc 
ttaggctggc ggtcatgaca aataaaggct 
accgatggaa ggtgtattag attcgctggt 
cgacctttgc atcaccgaat ttttgcgggt 
ctaccgccta tgtcctgaat tgcacaatca 
tatccaatta ttaggccaat acccagagtg 
gcttggttca tatggtgttg atctcaattg 
tggcggcggg gcaaccttgc taaaagatcc 
gcgtgcggct gttccggccc atctccccgt 
tggtgatcgc caatttgaaa ttgctgatgc 
tgttcatggt cgaaccaaag aagatggcta 
tggtgagatt cgccagcgcc tgacgattcc 
tcagagcgca caagagtgca tgaaagtgac 
cgcgctgaat gtgcctaatt tgagccgggt 
gctggaagtg gtcaaactac tgcaaaaata 
cttgtatcat gttgcgcgca tcaaacagtg 
agcgacagat ttattcggtg aaatccgcgc 
aattcagcgt attaatcgtt agctcaggaa 
tggaatataa acgtattgcc gagagtaacg 
aatttagcta cccctagcat gaataatgca 
aacgtcaatg aggccatttt ttcatcgtga 
acaccaaaga taataccaaa aattgttgtt 
ctgaaaatag atgtggacca ttatagggtg 
tattgagcgt agatttagct caaaacatca 
gtgagtctgg ctaaatttat tagaacagaa 
tgcataacaa acagtcaact ttaatctttg 
gcgtcaggcc ggtaatttat aatgcgcgac 
taagctgaca aacatgtatg taaaaatccg 
caccggtatc agtgttgcgc ctttggccat 
gacggcacct ttggaattgg catcgggtag 
agtcatttat gccaataatg cagataaagt 
gacggccatg gtggtattag atgccaaatt 
tgatcaaact aaagagttga aaggggtgtt 
ccgtaaggac atgctgctac tgacactgat 
ggcccaccat taccctgggg gctacaacgc 
gtctttgggc atgaacagta cgcattatgt 
ctcgacagcg cgtgatttag ccaaattact 
ccagttaagt accactaccg aaaaaatggc 
attccgcaat actaaccatt tggtttataa 
cggttttacc aaccaggctg gacactgctt 
ggtcgcattg gtggtattgg atgcttttgg 
cctgcgtagc tggatcgaaa caggtaaagc 
tcgtcagcaa aaagattcac agggccgttt 
cgcgactgag aagcggaatt tgctagcaga 
gctttcagat aatgcctaag cctaatggtg 
gccaacgctt acgttttgag gttttaccaa 
tctgttgata aaaatctttt aatgcaggtt 
ctgcggggta ttccgcacct ttgtcattga 
ttacatcaac ccctttctta acgatatagt 
cataatagag cttcgcgatc agacattttt 
gtttttcatt acggcgcagg gcgatatcca 
ggaccgcgtg atagcgcact gccgcacctg 
taccttcatc agccgtgcag gcaataaagt 
gatgttgttg cgcttcggta ataccatcgc 
attgattacg gtaagtctta aggcgcttag 
cttgcattac acgatcggtc aggtgatcga 
caattttacc ttttaacgtg aagaatcggg 
tcacaaacgt atctttgcca taggtttccg 
cacccgcgat aggtaaatgt ttaaagtcgc 
cctgagtttg gttggtgccg atataaaaaa 
ccagccgcac ggcaaatatt gccaatttac 
tggggtcggc gttaaaacgc gaaggggtcg 
attcactatc tgaggcttgc tgcccggtct 
gttccaaacg ttgtagaatc tcttctggtg 
ttaactgcaa ctcaccttgt gcatcaattt 
ggcctctttg gaccaatgaa ccgcctgaat 
caatacagga ggagccaata accgaatggg 
ctaattgatt cagtgtgctt ccagggaatc 
gaatgttgtt aagacgtaag gtattcaaga 
ttggcgtcga gccttctgtc agcccagtat 



208501 
208561 
208621 
208681 
208741 
208801 
208861 
208921 
208981 
209041 
209101 
209161 
209221 
209281 
209341 
209401 
209461 
209521 
209581 
209641 
209701 
209761 
209821 
209881 
209941 
210001 



gttgcagtag 
cggttcggta 
gctgtaattg 
ttccctgtta 
tcccttaacc 
taattcaacc 
tactgattaa 
aaatttcgtg 
gcttttaccc 
gcggatcaac 
attgttaaag 
gtacaaagta 
agtatttact 
tttgctgaaa 
tgccgtgatt 
cggcaactat 
aagatttcaa 
ctccggcaac 
cgtgacaaaa 
catgatcaat 
atttaccaca 
gggaataatt 
ttattttagc 
caacacgcca 
tttaccaaaa 
cgttttacac 



tgttgacgga 
gcgttcagtc 
agtcagcaag 
gtgacattag 
gtttaatata 
cgcattttga 
taaggcattt 
tcatggatct 
ttaataatca 
agatcagcat 
ggccgggcgc 
tgttggcaac 
tcgtggggaa 
tcgccatcat 
ttgccaatac 
tgggttttta 
attagcaaat 
cacggttgag 
caggcataac 
atgatgaacc 
aacccagggg 
ctttggtata 
gcgaaagtgg 
ttagcgcgtt 
gttatttaga 
tgtcagtatg 



aataccacgg 
tgccgttcac 
agctggattt 
ccttttactg 
tcgacgttga 
acaggaggta 
tatgagtgag 
gacgacgttg 
tgagaaagac 
ggtattacgg 
tgaacgctgt 
gttaggttat 
gtttcatatc 
gaccgatgat 
gttcgggtta 
gccgtcagaa 
gggttatggg 
ccgtataaat 
atctcccaat 
gttagctctc 
aaaattttca 
tatgccatat 
gcgtcggact 
tttcttctta 
gaacttttga 
gaattattga 



cgagggccac 
cggtcagtaa 
tttcattatt 
cattagcttt 
ttatcgcggt 
gctaaagtat 
cattttgttg 
catgaacaat 
atctatctgg 
gaaatgaatc 
gaggccagca 
cacccggcat 
acggttgatc 
gccaccgagc 
caggttgatc 
agacgagccg 
tggcgacgcc 
ccacctctgt 
tgctgccatc 
gcagattgga 
gtgccttctc 
gagccttcaa 
atcgactatt 
tcgggatgta 
gacttttgct 
gctaatttta 



ccctttaccg 
atagcgtgta 
gaactggctc 
ttactacatc 
tagcggacta 
agaatcagat 
ggaaatatga 
tagtcgccca 
acgctaatgg 
catcaggtat 
atattgagga 
tcaccataga 
atctgaccgg 
ttgataaatt 
aacaagaacc 
cacagattat 
ttgatccttt 
atatttggaa 
ttctgggtta 
atagacaaac 
ccgataaccg 
ttctgcgaaa 
gaggataaat 
tcaacggcat 
gatgatttaa 
ttttacggca 



ga-gcggaaac 
cccactatat 
atcaatgtgg 
aaccttttac 
ttgtgattaa 
ctaatggctg 
agttgaattg 
aaaagcgacg 
gcaggattta 
tcgtttatgg 
tgtcagtaaa 
gaaacagcga 
attgggtgac 
aaaggcagaa 
tcgttcatat 
tttttcatta 
tgtgcatcat 
tgctcatgat 
ttaccgtggt 
tcccgtgagc 
ctttccaatc 
aatatgctgc 
actcaaacgc 
ttactggtat 
ggggataaga 
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LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



GenBank (R) 



GI:13162669 



** * 



SOURCE : 
ORGANISM 



(ORGN) 



(NA) 



AF276872 
AF276872 
AF276872.2 
306927-06-0 
1743 

mRNA; linear 
Rodents 
28 Feb 2001 

Mus musculus sodium and chloride -dependent 
***af f inity*** ***choline*** 
***transporter*** mRNA, complete cds. 

house mouse . 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Rodentia; 
Sciurognathi ; Muridae; Murinae; Mus 
406 a 409 c 410 g 518 t 



high 



*** 



NUCLEIC ACID COUNT 
COMMENT: 

On Feb 28, 2001 this sequence version replaced gi: 11527247. 



REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 

JOURNAL (SO) 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) 



REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) 



FEATURES (FEAT) : 
Feature Key 



1 (bases 1 to 1743) 
Apparsundaram,S. ; Ferguson, S .M. ; Blakely,R.D. 
Molecular cloning and characterization of ***human*** 
and murine high-affinity choline transporters 
Unpublished 

2 (bases 1 to 1743) 

Apparsundaram, S . ; Ferguson, S . M. ; Blakely,R.D. 
Direct Submission 

Submitted (09- JUN-2000) Department of Pharmacology and 
Center for Molecular Neuroscience, Vanderbilt 
University, 23rd Avenue South at Pierce, Nashville, TN 

37232-6420, USA 

3 (bases 1 to 1743) 

Apparsundaram, S . ; Ferguson, S .M. ; Blakely,R.D. 
Direct Submission 

Submitted (28 -FEB- 20 01) Department of Pharmacology and 
Center for Molecular Neuroscience, Vanderbilt 
University, 23rd Avenue South at Pierce, Nashville, TN 
37232-6420, USA 



Location 



Qualifier 



source 
CDS 



1. .1743 
1. .1743 



/organism="Mus musculus' 
/db-xref="taxon: 10090" 
/codon-start=l 
/product=" sodium and 



choline transporter" 

/protein- id= " AAG3 6 945.2" 

/db-xref="GI: 13162670" 

/ 1 r ans 1 at ion= " MPFHVEGLVAI I LFYLLI FL 

VGIWAAWKTKNSGNPEERSEAI IV 

GGRDIGLLVGGFTMTATWGGGYINGTAEAVYGP 

GCGLAWAHAPIGYSLSLILGGLFF 

AKPMRSKGYVTMLDPFKQIYGKRMGGLLFIPALM 

GEMFWAAAI FS ALGAT I S VI IDVD 

VNISVIVSALIAILYTLVGGLYSVAYTDWQLFC 

IFIGLWISVPFALSHPAVTDIGFT 

AVHAKYQSPWLGTIESVEVYTWLDNFLLLMLGGI 

PWQAYFQRVLSSSSATYAQVLSFL 

AAFGCLVMALPAICIGAIGASTDWNQTAYGYPDP 

KTKEEADMILPIVLQYLCPVYISF 

FGLGAVSAAVMSSADSSILSASSMFARNIYQLSF 

RQNASDKEIVWVMRITVLVFGASA 

TAMALLTKTVYGLWYLSSDLVYI I I FPQLLGVLF 

IKGTNTYGAVAGYIFGLFLRITGG 

EPYLYLQPLIFYPGYYSDKNGIYNQRFPFKTLSM 

VTSFFTNICVSYLAKYLFESGTLP 

PKLDVFDAWARHSEENMDKTILVRNENIKLNEL 

APVKPRQSLTLSSTFTNKEALLDV 

DSSPEGSGTEDNLQ" 



SEQUENCE (SEQ) : ^ 
1 atgcctttcc atgtggaagg actggtagct attatcctct tctacctcct tatatttctg 
61 gttggaatat gggctgcatg gaaaaccaaa aacagcggca acccagaaga gcgcagtgaa 
121 gccatcatag tcgggggccg tgacattggt ttgttggttg gtggttttac catgacagcc 
181 acctgggttg gaggaggcta catcaatggg acagcagaag cagtgtatgg gccaggttgt 
241 ggtctagctt gggctcatgc acccattgga tattctctga gtctaatttt aggtggtctg 
3 01 ttttttgcga aacctatgcg ttccaaggga tatgtgacta tgttagaccc attcaaacag 
361 atctatggaa agcgcatggg tgggctgctc ttcatccctg cactgatggg agagatgttc 
421 tgggctgcag caattttctc tgcattaggg gccaccatca gcgtgatcat tgatgtggat 
481 gtgaacatat cggtcattgt ctctgcactc attgccattc tttataccct agtgggtggg 
541 ctctactctg tggcatatac tgatgttgtc cagctattct gcatttttat aggactgtgg 
601 atcagtgtcc cttttgccct gtcacatcct gcagtcaccg acatcggatt cacagctgtg 
661 catgctaaat accagagtcc ctggctggga accattgaat cagttgaagt ctacacctgg 
721 cttgataatt ttctgttatt gatgctgggt ggaatcccat ggcaagccta cttccagagg 
781 gtcctctctt catcctcagc cacctatgct caggtactgt ccttcctggc agcttttggg 
841 tgcctggtga tggctctacc cgccatatgc ataggagcta ttggagcttc cacagactgg 
901 aaccagactg cctacgggta tccagatccc aagactaagg aggaagcaga catgattctc 
961 ccgatcgttc tgcagtacct ctgccctgtg tacatctcct tctttgggct tggtgctgtt 
1021 tcagctgctg tcatgtcctc agctgactcg tccatcctgt cggcgagttc tatgtttgct 
1081 cggaatatct accagctttc cttcagacaa aatgcatcag acaaggaaat tgtgtgggtc 
1141 atgaggatca ctgtgcttgt gttcggagca tctgcaacag ccatggcttt gctgacgaag 
1201 actgtgtatg ggctctggta cctgagctct gaccttgtct acatcatcat cttcccacag 
1261 ctgctctgtg tactcttcat caaaggaacc aacacttatg gggcagttgc tggttatatt 
1321 tttggactat tcctgagaat tactggagga gagccatatc tatacttgca gcccttaatc 
13 81 ttctaccctg gttattactc tgacaagaat ggtatataca atcagaggtt cccatttaaa 
1441 actctctcca tggttacctc attctttacc aacatttgtg tttcttatct agccaagtat 
1501 ctatttgaaa gtggaacctt gcctccaaaa ttagatgtat ttgatgctgt tgtcgcaagg 
1561 cacagtgaag agaacatgga caagaccatt ctagtcagaa atgaaaatat caaattaaat 
1621 gaacttgcac ctgtgaaacc tcggcagagc ctaaccctca gttcaacttt caccaataag 
1681 gaggccctcc ttgatgttga ttccagtccg gaggggtctg ggactgaaga taatttacaa 
1741 tga 
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LOCUS (LOG) : AB043 997 GenBank (R) 

GenBank ACC. NO. (GBN) : AB043 997 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 



AB043 997 . 1 GI: 11231080 

304428-43-1 

5158 



MOLECULE TYPE (CI) : mRNA; linear 

DIVISION CODE (CI) : Primates 
DATE (DATE) : 19 Nov 2000 

DEFINITION (DEF) : Homo sapiens mRNA for ***high*** - ***af f inity*** 

***choline*** ***transporter*** CHTl, complete 

cds . 

SOURCE: Homo sapiens spinal cord cDNA to mRNA. 

ORGANISM (ORGN) : Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 



REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) 



***human*** 
***choline*** 



1 (sites) 
Okuda,T.; Haqa,T. 

Functional characterization of the 

***high*** - ***af f inity*** 

* * * transporter* * * 
FEES Lett., 484 (2), 92-97 (2000) 

2 (bases 1 to 5158) 
Okuda , T . 

Direct Submission 

Submitted (30-MAY-2000) Takashi Okuda, University of 
Tokyo, Faculty of Medicine, Department of 
Neurochemistry; 7-3-1 Kongo, Bunkyo-ku, Tokyo 113 0033, 
Japan ( E -mai 1 : okuda@m . u- tokyo . ac . ] P / 
URL : http : / /park . ecc . u- 1 okyo . ac . j p/ neurochemistry , 
Tel: 81-3 -5841-3 560, Fax:81-3-6814-8154) 



FEATURES (FEAT) : 

Feature Key Location 

source 1- .5158 



Qualifier 



/organism="Homo sapiens" 
/db-xref="taxon: 9606" 
/tissue-type="spinal cord" 
CDS 277.. 2019 /codon- start-1 

/product="high-af f inity choline 

transporter CHTl" 

/protein-id="BAB18161. 1" 

/db-xref="GI: 11231081" 

/ 1 r ans lat ion= " MAFHVEGLI AI I VFYLLILL 

VGIWAAWRTKNSGSAEERSEAI IV 

GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVP 

GYGLAWAQAPIGYSLSLILGGLFF 

AKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALM 

GEMFWAAAI FSALGATI SVI IDVD 

MHI SVI I SALI ATLYTLVGGLYSVAYTDWQLFC 

I FVGLWI SVPFALSHPAVAD IGFT 

AVHAKYQKPWLGTVDSSEVYSWLDSFLLLMLGGI 

PWQAYFQRVLS S S S AT YAQVLS FL 

AAFGCLVMAIPAILIGAIGASTDWNQTAYGLPDP 

KTTEEADMILP I VLQYLCPVYI S F 

FGLGAVSAAVMSSADSS ILSASSMFARNI YQLSF 

RQNASDKEIVWVMRITVFVFGASA 

TAMALLTKTVYGLWYLSSDLVYIVI FPQLLCVLF 

VKGTNTYGAVAGYVSGLFLRITGG 

EPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAM 

VTS FLTNI CI S YLAKYLFESGTLP 

PKLDVFDAWARHSEENMDKTILVKNENIKLDEL 

ALVKPRQSMTLSSTFTNKEAFLDV 

DSSPEGSGTEDNLQ " 

SEQUENCE (SEQ) : ^ ^ ^ v*. i- 

1 ctgcgccgcg cgccctccgc cggcgccaac acctgtcaac tctgcgcgct cccaggttct 
61 tggagacgcc gagtgaggag ccgccctgac cgcccggccc cgctccgcac tgcaccccga 
121 cccaccccgc accctccctt tctgcaccct cgcacccaca cccctcgcgg ttccaggggg 
181 cggcggcccc gggcggagcg ctttcgcgtg cagccaccac tccagaagac ttaatgaagt 
241 agccagctgc agaagaatct ggatcattag ataaaaatgg ctttccatgt ggaaggactg 
301 atagctatca tcgtgttcta ccttctaatt ttgctggttg gaatatgggc tgcctggaga 
361 accaaaaaca gtggcagcgc agaagagcgc agcgaagcca tcatagttgg tggccgagat 
421 attggtttat tggttggtgg atttaccatg acagctacct gggtcggagg agggtatatc 
481 aatggcacag ctgaagcagt ttatgtacca ggttatggcc tagcttgggc tcaggcacca . 
541 attggatatt ctcttagtct gattttaggt ggcctgttct ttgcaaaacc tatgcgttca 
601 aaggggtatg tgaccatgtt agacccgttt cagcaaatct atggaaaacg catgggcgga 
661 ctcctgttta ttcctgcact gatgggagaa atgttctggg ctgcagcaat tttctctgct 
721 ttgggagcca ccatcagcgt gatcatcgat gtggatatgc acatttctgt catcatctct 
781 gcactcattg ccactctgta cacactggtg ggagggctct attctgtggc ctacactgat 
841 gtcgttcagc tcttttgcat ttttgtaggg ctgtggatca gcgtcccctt tgcattgtca 
901 catcctgcag tcgcagacat cgggttcact gctgtgcatg ccaaatacca aaagccgtgg 
961 ctgggaactg ttgactcatc tgaagtctac tcttggcttg atagttttct gttgttgatg 
1021 ctgggtggaa tcccatggca agcatacttt cagagggttc tctcttcttc ctcagccacc 
1081 tatgctcaag tgctgtcctt cctggcagct ttcgggtgcc tggtgatggc catcccagcc 
1141 atactcattg gggccattgg agcatcaaca gactggaacc agactgcata tgggcttcca 
12 01 gatcccaaga ctacagaaga ggcagacatg attttaccaa ttgttctgca gtatctctgc 
1261 cctgtgtata tttctttctt tggtcttggt gcagtttctg ctgctgttat gtcatcagca 
1321 gattcttcca tcttgtcagc aagttccatg tttgcacgga acatctacca gctttccttc 
1381 agacaaaatg cttcggacaa agaaatcgtt tgggttatgc gaatcacagt gtttgtgttt 
1441 ggagcatctg caacagccat ggccttgctg acgaaaactg tgtatgggct ctggtacctc 



1561 ggaaccaaca cctatggggc cgtggcaggt tatgtttctg gcctcttcct gagaataact 
1621 ggaggggagc catatctgta tcttcagccc ttgatcttct accctggcta ttaccctgat 
1681 gataatggta tatataatca gaaatttcca tttaaaacac ttgccatggt tacatcattc 
1741 ttaaccaaca tttgcatctc ctatctagcc aagtatctat ttgaaagtgg aaccttgcca 
18 01 cctaaattag atgtatttga tgctgttgtt gcaagacaca gtgaagaaaa catggataag 
1861 acaattcttg tcaaaaatga aaatattaaa ttagatgaac ttgcacttgt gaagccacga 
1921 cagagcatga ccctcagctc aactttcacc aataaagagg ccttccttga tgttgattcc 
1981 agtccagaag ggtctgggac tgaagataat ttacagtgac cccatctaaa taaaatactg 
2041 cttttgcaaa cagaacactg taatagggta gttctggaga gatggtatgc agcatacaaa 
2101 aatatattaa aaatataaac aatgttcagg agagtaaaaa ttcatataaa gtgcaattgc 
2161 acaaatacaa gccaagctag aaggaagcac ctatgaaagc aacaactttg tttctcatcc 
2221 atagtagtat tgattttgat gctagatagt tttgctaggt ataaaaaata agtaaagttc 
2281 cacttagaga acaaagggcc aaatagagtt tttatatttg ttatgataaa aggaagtaga 
2341 tgtgaaaaag cctaagaaaa aggaaattgg acagttttga tacaaacttt gtttgctaat 
2401 gcactgatga gtctagtttc attatagcac gaagctatga gaataacttc agtcactccc 
2461 ttgaatggtg caatgaatta accagctgat ttttcttagt gtgatgatta accccttctt 
2521 tcatgttctg agctataaca tttgctgaat atgcaatttg ttattctttt attaatggca 
2581 tgtaatattc tgagcacggg caaagaaaac acacaaaaaa ttatgtattg gcatttattt 
2641 atgtgcaagg tgataggaaa actgaatcca tctttgtaga agagcactgg gctaatttgt 
2701 atgtttccat agctactata tgcataaaca acagtacctg aaggattatt aagcaacctt 
2761 aaagcaataa gttcattaaa cagaaggtaa taggaagaac agtacatttt tgtctttatc 
2821 tcaagtatat aaagttttgc cccatgataa aataagtaac atataaatta atacataagt 
28 81 taatattgga agtgaaatta ggccttatgt aaggagaggt agctacacag catctgttac 
2941 ggttaatgag aggcacactg ctaaatttac gtatatttca ttgaatactt tatgggaagc 
3001 ctccacaaca cctcctgata gtagtaaaat tagaaaaaaa caaactctac taatttttag 
3061 atattcatta ataggaaaga tttaatgtta cagcatctac tctttggcac tttttcacat 
3121 aagccagcct tagagactag acaaataact actgcaaaaa gtacatagtg aattttaaaa 
3181 tgtagaagaa agcaaaaaat aaaataagtc ctaaagtgaa gaagccatca attttaacat 
3241 ttgatgtcta gataacattc atgagattgg tcattaaata ctatttgtct ccttttctgg 
3 3 01 tgttacacat acacactcat acgtgtgtgt gtgtgtatgt gtgtgtgtgt atgtgtaatg 
3361 atttgtgtcc atttcaagtg ttttcacacc agtcacacat tatagaaatt actggagcca 
3421 tgtctacaaa agcaaagtga agtgatgagc tgaaaagtga tcagggaatc tgatcccata 
3481 aagaaagtga gtctgaattg ttgaaactaa agcacttctt ttccaaaaat acttctgcaa 
3 541 ctcatgaaca aaatgaagtt aacttagtgt ttagcattaa aaataaaagt tgtaccttga 
3601 ccctagaatt ccttttataa gatttaattg taattgtctc aaagcatcaa ataatacatc 
3 661 tttatcatta aacatgaatt taatgtctag gataattttt aaaagaggaa attaaataat 
3721 tttacatgtc tcatattttc agtgctttct caaaatttga tggtgaatat tacattgcat 
3781 tcaaaatctt cactgaatat catatatatc taaatgaaaa gagaattact tatattcatc 
3841 tattttctca gaatctattc agtaccctat ggagtacatg gaagaagtat atttttggaa 
3901 ccagtaagta tgggagtcca cgattctcag gtattgattg aaaaaagaga aaggggcaag 

3 961 aaattcatta aagaaaaacc acagatactg aaacatttga cacataaaag taattaattg 

4 021 ctggagacta caattcctgg caaacaccat tattccagat gcaccatgct acccagcatg 
4081 ctacctgaat ggcggttttt atccatcctg tcaccctgag catctgtctc aacctgctga 
4141 ggcacagact tccaccatgt gcatcaggaa acacctcact catactgtca tataatgcaa 
4201 cagggaggca tagtctctct acaggtctct ctctttacgt tgtaccattc tgttgggcaa 
4261 caattgtccc aacaaccaca cttaaatcca gtggggaaaa aaaggttaac agtaaaaatt 
4321 tcttccacgt aattcaattt ctcacaatag acagaaacag ggggaaaatg tttgccctgt 
4381 tcaaagtact aaaactagat aacttaaata atttaagcca ataaataata ttatattgta 
4441 tttaacccat tattcctcac ctaccctcaa agtaggtata caaaagacta gtttgttatt 
4501 aatccacaaa tatgctaaat gttagcaaga tgatttttaa gtacaaaaaa gtttaagatt 
4561 gcaaggaagg aaacaatggt cacaaggtct ttacttatgc acatttgtgc ataaatttcc 
4621 acccaatttc caagacagtt gtatcacaaa gctgtgaaat gtgcaagagt atactttgta 
4681 cccatcatgt gctctgtttt gacataatga gtaatgcaga catttacatg ttcttccagc 
4 741 ttcaatcagt agacctgatt ttcaaaatct gccctcagat gaaaataaaa taaacgaaac 
4 801 atttagacaa tggctatatt aatctgggaa ggcaacacct gtggataaaa cattagaaga 
4861 aaacatatat tttattttca tacttttgat atgattgtaa catatttctt gagtaattta 
4921 aatgctttgt tttcccacac atattcaaat cagcaagctt gtagctggac tgcaatatca 
4 981 acaacaagtt gtttcaaaca gcatcaaaat acaacttcat tgctacactt acaagtaatg 
5041 aattttatgt gactatgaaa cgaatataag cttaaaataa gtgaatctaa taaaatggct 
5101 attcctcttt ttacttggaa ataataaatt aacattatat tctaaaaaaa aaaaaaaa 
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LOCUS (LOC) : AF2 76871 GenBank (R) 

GenBank ACC. NO. (GBN) : AF276871 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : mRNA; linear 

DIVISION CODE (CI) : Primates 
DATE (DATE) : 27 Nov 2 000 

DEFINITION (DEF) : Homo sapiens ***high*** ***af f inity*** 

***choline*** ***transporter*** (SLC5A7) mRNA, 

complete cds. 
SOURCE : * * * human* * * 



AF276871.1 GI: 10998441 

392136-26-4 

1743 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 

: 412 a 393 c 406 g 532 t 

1 (bases 1 to 1743) 

Apparsundaram, S . ; Ferguson, S .M. ; George, A. L. Jr.; 
Blakely,R.D. 

Molecular cloning of a ***human*** 
hemicholinium- 3 -sensitive choline transporter 
Biochem. Biophys. Res. Commun. , 276 (3), 862-867 (2000) 
CA 134:52827 

2 (bases 1 to 1743) 

Apparsundaram, S. ; Ferguson, S ,M. ; Blakely,R.D. 

Direct Submission 

Submitted (09- JUN-2000) Department of Pharmacology and 
Center for Molecular Neuroscience, Vanderbilt 
University, 23rd Avenue South at Pierce, Nashville, TN 
37232-6420, USA 



FEATURES (FEAT) 
Feature Key 



Location 
1743 



Qualifier 



source 



gene 
CDS 



1. . 1743 
1. .1743 



/organism="Homo sapiens" 
/db-xref="taxon: 9606" 
/chromosome= " 2 " 
/map="2ql2" 
/gene="SLC5A7 " 
/gene="SLC5A7 " 

/note="hCHT; solute carrier family 
5 member 7" 
/ codon- s t art = 1 

/product = "high affinity choline 
transporter" 

/protein-id="AAG25940. 1" 

/db-xref="GI: 10998442" 

/translation="MAFHVEGLIAIIVFYLLILL 

VGIWAAWRTKNSGSAEERSEAI IV 

GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVP 

GYGLAWAQAPIGYSLSLILGGLFF 

AKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALM 

GEMFWAAAIFSALGATISVIIDVD 

MHISVIISALIATLYTLVGGLYSVAYTDWQLFC 

I FVGLWI SVPFALSHPAVAD IGFT 

AVHAKYQKPWLGTVD S S EVYS WLDS FLLLMLGG I 

PWQAYFQRVLSSSSATYAQVLSFL 

AAFGCLVMAI PAILIGAIGASTDWNQTAYGLPDP 

KTTEEADMILP I VLQ YLCPVY I S F 

FGLGAVSAAVMSSADSSILSASSMFARNIYQLSF 

RQNASDKEIVWVMRITVFVFGASA 

TAMALLTKTVYGLWYLSSDLVYIVIFPQLLCVLF 

VKGTNTYGAVAGYVSGLFLRITGG 

EPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAM 

VTSFLTNICISYLAKYLFESGTLP 

PKLDVFDAWARHSEENMDKTILVKNENIKLDEL 

ALVKPRQSMTLSSTFTNKEAFLDV 

DSSPEGSGTEDNLQ" 



SEQUENCE (SEQ) : 

1 atggctttcc 
61 gttggaatat 
121 gccatcatag 
181 acctgggtcg 
241 ggcctagctt 
301 ttctttgcaa 
361 atctatggaa 
421 tgggctgcag 
481 atgcacattt 
541 ctctattctg 
601 atcagcgtcc 
661 catgccaaat 
721 cttgatagtt 
781 gttctctctt 
841 tgcctggtga 
901 aaccagactg 
961 ccaattgttc 



atgtggaagg 
gggctgcctg 
ttggtggccg 
gaggagggta 
gggctcaggc 
aacctatgcg 
aacgcatggg 
caattttctc 
ctgtcatcat 
tggcctacac 
cctttgcatt 
accaaaagcc 
ttctgttgtt 
cttcctcagc 
tggccatccc 
catatgggct 
tgcagtatct 



actgatagct 
gagaaccaaa 
agatattggt 
tatcaatggc 
accaattgga 
ttcaaagggg 
cggactcctg 
tgctttggga 
ctctgcactc 
tgatgtcgtt 
gtcacatcct 
gtggctggga 
gatgctgggt 
cacctatgct 
agccatactc 
tccagatccc 
ctgccctgtg 



atcatcgtgt 
aacagtggca 
ttattggttg 
acagctgaag 
tattctctta 
tatgtgacca 
tttattcctg 
gccaccatca 
attgccactc 
cagctctttt 
gcagtcgcag 
actgttgact 
ggaatcccat 
caagtgctgt 
attggggcca 
aagactacag 
tatatttctt 



tctaccttct 
gcgcagaaga 
gtggatttac 
cagtttatgt 
gtctgatttt 
tgttagaccc 
cactgatggg 
gcgtgatcat 
tgtacacact 
gcatttttgt 
acatcgggtt 
catctgaagt 
ggcaagcata 
ccttcctggc 
ttggagcatc 
^.agaggcaga 
tctttggtct 



aattttgctg 
gcgcagcgaa 
catgacagct 
accaggttat 
aggtggcctg 
gtttcagcaa 
agaaatgttc 
cgatgtggat 
ggtgggaggg 
agggctgtgg 
cactgctgtg 
ctactcttgg 
ctttcagagg 
agctttcggg 
aacagactgg 
catgatttta 
tggtgcagtt 



1081 cggaacatct accagctttc cttcagacaa aatgcttcgg acaaagaaat cgtttgggtt 
1141 atgcgaatca cagtgtttgt gtttggagca tctgcaacag ccatggcctt gctgacgaaa 

12 01 actgtgtatg ggctctggta cctcagttct gaccttgttt acatcgttat cttcccccag 
1261 ctgctttgtg tactctttgt taagggaacc aacacctatg gggccgtggc aggttatgtt 
1321 tctggcctct tcctgagaat aactggaggg gagccatatc tgtatcttca gcccttgatc 

13 81 ttctaccctg gctattaccc tgatgataat ggtatatata atcagaaatt tccatttaaa 
1441 acacttgcca tggttacatc attcttaacc aacatttgca tctcctatct agccaagtat 
1501 ctatttgaaa gtggaacctt gccacctaaa ttagatgtat ttgatgctgt tgttgcaaga 
1561 cacagtgaag aaaacatgga taagacaatt cttgtcaaaa atgaaaatat taaattagat 
1621 gaacttgcac ttgtgaagcc acgacagagc atgaccctca gctcaacttt caccaataaa 
1681 gaggccttcc ttgatgttga ttccagtcca gaagggtctg ggactgaaga taatttacag 
1741 tga 



L3 ANSWER 104 OF 111 

LOCUS (LOG) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE : 
ORGANISM (ORGN) : 



GENBANK. RTM. COPYRIGHT 2004 on STN 



GenBank (R) 



GI:9843808 



***af finity*** 
(CHTl gene) . 



NUCLEIC ACID COUNT 

REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



(NA) 



MMU401467 
AJ401467 
AJ401467.1 
286924-11-6 
1743 

mRNA; linear 
Rodents 
16 Aug 2000 

Mus musculus mRNA for ***high*** 

* * *choline* * * * * * transporter* * * 

house mouse. 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi ; Mammalia; Eutheria; Rodentia; 
Sciurognathi; Muridae; Murinae; Mus 
: 407 a 410 c 410 g 516 t 

1 (bases 1 to 1743) 
Wieland,A.; Bonisch,H.; Bruss,M. 

Molecular cloning of the ***human*** and murine 
high affinity choline transporters and characterization 
of the ***human*** gene -structure 
Unpublished 

2 (bases 1 to 1743) 
Bruess,M. 

Direct Submission 

Submitted (14-AUG-2000) Bruess M., University of Bonn, 
Pharmacology and Toxicology, Renter str. 2b, D- 53 113 
Bonn, GERMANY 



FEATURES (FEAT) : 

Feature Key Location 
===============+============= 

source 1. . 1743 



Qualifier 



gene 
CDS 



1. .1743 
1. .1743 



/organism="Mus musculus" 

/ s t rain= " BALB/ c J " 

/db-xref="taxon: 10090" 

/ 1 i s sue - type= " brains tern " 

/gene=" CHTl" 

/qene="CHTl" 

/f unction=" sodium- and 

chloride -dependent reuptake of 

choline " 

/codon-start=l 

/evidence=experimental 

/product = "high affinity choline 

transporter" 

/protein- id= " CACO 3 719.1" 

/db-xref="GI: 9843 809" 

/ 1 r ans 1 at ion= " MS FHVEGLVAI I LFYLLI FL 

VGI WAAWKTKNSGNPEEHSEAI IV 

GGRD IGLLVGGFTMTATWVGGGYINGTAVAVYGP 

GCGLAWAQAPIGYSLSLILGGLFF 

AKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALM 

GEMFWAAAIFSALGATISVI IDVD 

VNI S VI VSALI AILYTLVGGLYSVAYTDWQLFC 

IFIGLWISVPFALSHPAVTDIGFT 

AVHAKYQSPWLGTIESVEVYTWLDNFLLLMLGGI 

PWQAYFQRVLSSSSATYAQVLSYL 

AAFGCLVMALPAICIGAIGASTDWNQTAYGYPDP 

KTKEEADMI LP I VLQ YLCPVY I S F 

FGLGAVSAAVMSSADSSILSASSMFARNIYQLSF 



TAMALLTKTVYGLWYLSSDLVYIIIFPQLLCVLF 

I KGTNTYGAVAGYI FGLFLRI TGG 

EPYLYLQPLIFYPGYYSDKNGIYNQRFPFKTLSM 

VTSFFTNICVSYLAKYLFESGTLP 

PKLDVFDAWARHSEENMDKTILVRNENIKLNEL 

APVKPRQSLTLSSTFTNKEALLDV 

DSSPEGSGTEDNLQ" 



SEQUENCE (SEQ) : 

1 atgtctttcc 
61 gttggaatat 
121 gccatcatag 
181 acctgggttg 
241 ggtctagctt 
301 ttttttgcga 

3 61 atctatggaa 

4 21 tgggctgcag 
4 81 gtgaacatat 
541 ctctactctg 
601 atcagtgtcc 
661 catgctaaat 
721 cttgataatt 
781 gtcctctctt 
841 tgcctggtga 
901 aaccagactg 
961 ccgatcgttc 

1021 tcagctgctg 
1081 cggaatatct 
1141 atgaggatca 
1201 actgtgtatg 
1261 ctgctctgtg 
1321 tttggactat 
1381 ttctaccctg 
1441 actctctcca 
1501 ctatttgaaa 
1561 cacagtgaag 
1621 gaacttgcac 
1681 gaggccctcc 
1741 tga 



acgtagaagg 
gggctgcatg 
tcgggggccg 
gaggaggcta 
gggctcaggc 
aacctatgcg 
agcgcatggg 
caattttctc 
cggtcattgt 
tggcatatac 
cttttgccct 
accagagtcc 
ttctgttatt 
catcctcagc 
tggctctacc 
cctacgggta 
tgcagtacct 
tcatgtcctc 
accagctttc 
ctgtgcttgt 
ggctgtggta 
tactcttcat 
tcctgagaat 
gttattactc 
tggttacctc 
gtggaacctt 
agaacatgga 
ctgtgaaacc 
ttgatgttga 



actggtagct 
gaaaaccaaa 
tgacattggt 
catcaatggg 
acccattgga 
ttccaaggga 
tgggctgctc 
tgcattaggg 
ctccgcactc 
tgatgttgtc 
gtcacatcct 
ctggctggga 
gatgctgggt 
cacctatgct 
cgccatatgc 
tccagatccc 
ctgccctgtg 
agctgactcg 
cttcagacaa 
gttcggagca 
cctgagctct 
caaaggaacc 
tactggagga 
tgacaagaat 
attctttacc 
gcctccaaaa 
caagaccatt 
tcggcagagc 
ttccagtccg 



attatcctct tctacctcct 



aacagcggca 
ttgttggttg 
acagcagtag 
tattctctga 
tatgtgacta 
ttcatccctg 
gccaccatca 
attgccattc 
cagctattct 
gcagtcaccg 
accattgaat 
ggaatcccat 
caggtactgt 
ataggagcta 
aagactaagg 
tacatctcct 
tccatcctgt 
aatgcatcag 
tctgcaacag 
gaccttgtct 
aacacttatg 
gagccatatc 
ggtatataca 
aacatttgtg 
ttagatgtat 
ctagtcagaa 
ctaaccctca 
gaggggtctg 



acccagaaga 
gtggttttac 
cagtgtatgg 
gtctaatttt 
tgttagaccc 
cactgatggg 
gcgtgatcat 
tttataccct 
gcatttttat 
acatcggatt 
cagttgaagt 
ggcaagccta 
cctacctggc 
ttggagcttc 
aggaagcaga 
tctttgggct 
cggcgagttc 
acaaggaaat 
ccatggcttt 
acatcatcat 
gggcagttgc 
tatacttgca 
atcagaggtt 
tttcttatct 
ttgatgctgt 
atgaaaatat 
gttcaacttt 
ggactgaaga 



tatatttctg 
gcacagtgaa 
catgacagcc 
gccaggttgt 
aggtggtctg 
atttcaacag 
agagatgttc 
tgatgtggat 
agtgggtggg 
aggactgtgg 
cacagctgtg 
ctacacctgg 
cttccagagg 
agcttttggg 
cacagactgg 
catgattctc 
tggtgctgtt 
tatgtttgct 
tgtgtgggtc 
gctgacgaag 
cttcccacag 
tggttatatt 
gcccttaatc 
cccatttaaa 
agccaagtat 
tgtcgcaagg 
caaattaaat 
caccaataag 
taacttacaa 
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LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
GenBank VERSION (VER) 
CAS REGISTRY NO. (RN) 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

SOURCE: 
ORGANISM (ORGN) : 



NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
REFERENCE : 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



* * *high* ** ***aff ini ty * * * 

*** transporter*** (CHTl gene) . 



GENBANK. RTM. COPYRIGHT 2004 on STN 

HSA401466 GenBank (R) 

AJ401466 

AJ4 01466 . 1 GI: 9843753 
286923-93-1 
1813 

mRNA; linear 
Primates 
16 Aug 2000 
Homo sapiens mRNA for 

***choline*** 

***human*** 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; Catarrhini; 
Hominidae ; Homo 

: 440 a 406 C 417 g 550 t 

1 (bases 1 to 1813) 
Wieland,A.; Bonisch,H.; Bruss,M. 

Molecular cloning of the ***human*** and murine 
high affinity choline transporters and characterization 
of the ***human*** gene -structure 
Unpublished 

2 (bases 1 to 1813) 
Bruess^M. 

Direct Submission 

Submitted (14-AUG-2000) Bruess M. , University of Bonn, 
Pharmacology and Toxicology, Renter str. 2b, D- 53 113 
Bonn, GERMANY 



FEATURES (FEAT) : 

Feature Key 
===============+== 

source 1 , 



Location 
1813 



Qualifier 
/ organ! sm= "Homo sapiens" 



/ chromosome = " 2 " 
/map="2qll-13" 
/ 1 i s sue - type = " hypothalamus " 
19. .1761 /gene="CHTl" 
19. .1761 /gene="CHTl" 

/function=" sodium- and 
chloride -dependent reuptake of 
choline" 
/codon-start=l 
/ e V i de nc e = expe r imen t a 1 
/product = "high affinity choline 
transporter" 

/protein- id= " CACO 3 717.1" 
/db-xref ="GI : 9843754 " 
/translation^ "MAFHVEGLIAIIVFYLLILL 
VGIWAAWRTKNSGSAEERSEAI IV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVP 
GYGLAWAQAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALM 
GEMFWAAAI FSALGAT I S VI IDVD 
MHISVI ISALIATLYTLVGGLYSVAYTDWQLFC 
IFVGLWISVPFALSHPAVADIGFT 
AVHAKYQKPWLGTVDSSEVYSWLDSFLLLMLGGI 
PWQAYFQRVLS S SS AT YAQVLS FL 
AAFGCLVMAIPAILIGAIGASTDWNQTAYGLPDP 
KTTEEADMILPIVLQYLCPVYISF 
FGLGAVS AAVMS S ADS S I LS AS SMFARNI YQLS F 
RQNASDKEIVWVMRITVFVFGASA 
TAMALLTKTVYGLWYLSSDLVYIVIFPQLLCVLF 
VKGTNTYGAVAGYVSGLFLRITGG 
EPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAM 
VTS FLTNI C I S YLAKYLFESGTLP 
PKLDVFDAWARHSEENMDKTILVKNENIKLDEL 
ALVKPRQSMTLS STFTNKEAFLDV 
DSSPEGSGTEDNLQ" 

SEQUENCE (SEQ) : 

1 ctggatcatt agataaaaat ggctttccat gtggaaggac tgatagctat catcgtgttc 
61 taccttctaa ttttgctggt tggaatatgg gctgcctgga gaaccaaaaa cagtggcagc 
121 gcagaagagc gcagcgaagc catcatagtt ggtggccgag atattggttt attggttggt 
181 ggatttacca tgacagctac ctgggtcgga ggagggtata tcaatggcac agctgaagca 
241 gtttatgtac caggttatgg cctagcttgg gctcaggcac caattggata ttctcttagt 
3 01 ctgattttag gtggcctgtt ctttgcaaaa cctatgcgtt caaaggggta tgtgaccatg 
3 61 ttagacccgt ttcagcaaat ctatggaaaa cgcatgggcg gactcctgtt tattcctgca 
421 ctgatgggag aaatgttctg ggctgcagca attttctctg ctttgggagc caccatcagc 
481 gtgatcatcg atgtggatat gcacatttct gtcatcatct ctgcactcat tgccactctg 
541 tacacactgg tgggagggct ctattctgtg gcctacactg atgtcgttca gctcttttgc 
601 atttttgtag ggctgtggat cagcgtcccc tttgcattgt cacatcctgc agtcgcagac 
661 atcgggttca ctgctgtgca tgccaaatac caaaagccgt ggctgggaac tgttgactca 
721 tctgaagtct actcttggct tgatagtttt ctgttgttga tgctgggtgg aatcccatgg 
7 81 caagcatact ttcagagggt tctctcttct tcctcagcca cctatgctca agtgctgtcc 
841 ttcctggcag ctttcgggtg cctggtgatg gccatcccag ccatactcat tggggccatt 
901 ggagcatcaa cagactggaa ccagactgca tatgggcttc cagatcccaa gactacagaa 
961 gaggcagaca tgattttacc aattgttctg cagtatctct gccctgtgta tatttctttc 
1021 tttggtcttg gtgcagtttc tgctgctgtt atgtcatcag cagattcttc catcttgtca 
1081 gcaagttcca tgtttgcacg gaacatctac cagctttcct tcagacaaaa tgcttcggac 
1141 aaagaaatcg tttgggttat gcgaatcaca gtgtttgtgt ttggagcatc tgcaacagcc 
1201 atggccttgc tgacgaaaac tgtgtatggg ctctggtacc tcagttctga ccttgtttac 
1261 atcgttatct tcccccagct gctttgtgta ctctttgtta agggaaccaa cacctatggg 
1321 gccgtggcag gttatgtttc tggcctcttc ctgagaataa ctggagggga gccatatctg 
13 81 tatcttcagc ccttgatctt ctaccctggc tattaccctg atgataatgg tatatataat 
1441 cagaaatttc catttaaaac acttgccatg gttacatcat tcttaaccaa catttgcatc 
1501 tcctatctag ccaagtatct atttgaaagt ggaaccttgc cacctaaatt agatgtattt 
1561 gatgctgttg ttgcaagaca cagtgaagaa aacatggata agacaattct tgtcaaaaat 
1621 gaaaatatta aattagatga acttgcactt gtgaagccac gacagagcat gaccctcagc 
1681 tcaactttca ccaataaaga ggccttcctt gatgttgatt ccagtccaga agggtctggg 
1741 actgaagata atttacagtg accccatcta aataaaatac tgcttttgca aacagaacac 
18 01 tgtaataggg tag 

L3 ANSWER 106 OF 111 MEDLINE on STN 

AN 20 03 541411 MEDLINE 

DN PubMed ID: 12827517 

TI Organic cation transporters. 

AU Koepsell H; Schmitt B M; Gorboulev V 

CS Institut fur Anatomie und Zellbiologie, Bayerischen Julius-Maximilians - 



gene 
CDS 



Hermann@Koepsell . de 

SO Reviews of physiology, biochemistry and pharmacology, (2003) 150 36-90. 
Ref: 390 

Journal code: 0434624. ISSN: 0303-4240. 
CY Germany: Germany, Federal Republic of 
DT Journal; Article; (JOURNAL ARTICLE) 

General Review; (REVIEW) 

(REVIEW, ACADEMIC) 
LA English 
FS Priority Journals 
EM 200404 

ED Entered STN: 20031119 

Last Updated on STN: 20040501 
Entered Medline: 20040430 

L3 ANSWER 107 OF 111 SCI SEARCH COPYRIGHT 2004 THOMSON ISI on STN 

AN 2004:343483 SCISEARCH 

GA The Genuine Article (R) Number: 8 02CT 

TI Expression of ***high*** - ***af f inity*** ***choline*** 

***transporter*** CHTl in the ***human*** leukemic T cell line 
MOLT -3 

AU Fujii T (Reprint) ; Okuda T; Haga T; Kawashima K 

CS Kyoritsu Coll Pharmaceut Sci, Dept Pharmacol, Tokyo 1058512, Japan; Univ 
Tokyo, Fac Med, Dept Neurochem, Tokyo 1170033, Japan; Gakushuin Univ, Fac 
Sci, Inst Biomol Sci, Tokyo 1718588, Japan 

CYA Japan 

SO JOURNAL OF PHARMACOLOGICAL SCIENCES, (10 MAR 2004) Vol. 94, Supp. [1], pp. 
202P-202P . 

Publisher! JAPANESE PHARMACOLOGICAL SOC, EDITORIAL OFF, KANTOHYA BLDG 

GOKOMACHI-EBISUGAWA NAKAGYO-KU, KYOTO, 604, JAPAN. 

ISSN: 1347-8613. 
DT Conference; Journal 
LA English 
REC Reference Count: 0 

L3 ANSWER 108 OF 111 SCISEARCH COPYRIGHT 2004 THOMSON ISI on STN 

AN 2004:105139 SCISEARCH 

GA The Genuine Article (R) Number: 753 PR 

TI Characterization of the ***human*** ***high*** - ***aff inity*** 

***choline*** ***transporter*** 
AU Haga T (Reprint) ; Okuda T; Kaitsuka C; Okamura M; Osawa C; Nishiyama N; 

Yamada H; Nakamura T; Kobayashi Y 
CS Gakushuin Univ, Inst Biomol Sci, Tokyo 171, Japan; Univ Tokyo, Fac Med, 

Tokyo 113, Japan; Kyorin Univ, Fac Med, Tokyo, Japan 
CYA Japan 

SO JOURNAL OF NEUROCHEMISTRY, (DEC 2003) Vol. 87, Supp. [1], pp. 43-43. 

Publisher: BLACKWELL PUBLISHING LTD, 9600 GARSINGTON RD, OXFORD 0X4 2DG, 

OXON, ENGLAND. 

ISSN: 0022-3042. 
DT Conference; Journal 
LA English 
REC Reference Count: 0 

L3 ANSWER 109 OF 111 USPATFULL on STN 
AN 2004:63 743 USPATFULL 

TI Invertebrate choline transporter nucleic acids, polypeptides and uses 

thereof 

IN Raming, Klaus, Leverkusen, GERMANY, FEDERAL REPUBLIC OF 

PI US 2004048261 Al 20040311 

AI US 2002-241784 Al 20020911 (10) 

DT Utility 

FS APPLICATION 

LN.CNT 2653 

INCL INCLM: 435/006.000 

INCLS: 435/069,100; 435/320.100; 435/348.000; 530/350.000; 536/023.500; 
800/008.000 
NCL NCLM: 435/006.000 

NCLS: 435/069.100; 435/320.100; 435/348.000; 530/350.000; 536/023.500; 
800/008.000 

IC [7] 

ICM: C12Q001-68 

ICS: A01K067-033; C07H021-04; C07K014-705 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L3 ANSWER 110 OF 111 USPATFULL on STN 



TI 18F- labeled choline analogs 

IN DeGrado, Timothy R. , Durham, NC, UNITED STATES 

Coleman, R. Edward, Durham, NC, UNITED STATES 
Baldwin, Steven W. , Durham, NC, UNITED STATES 
Price, David T., Greenwood, LA, UNITED STATES 
Orr, Matthew D., Durham, NC, UNITED STATES 
Wang, Shuyan, Durham, NC, UNITED STATES 

PI US 2002061279 Al 20020523 

US 6630125 B2 20031007 

AI US 2001-844674 Al 20010430 (9) 

PRAI US 2000-200347P 20000428 (60) 

US 2000-231303P 20000908 (60) 

DT Utility 

FS APPLICATION 

LN.CNT 1576 

INCL INCLM: 424/001.890 
NCL NCLM: 424/001.890 
IC [7] 

ICM: A61K051-00 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 



L3 ANSWER 111 OF 111 USPATFULL on STN 
AN 1999:85438 USPATFULL 

TI Decahydroquinoline -based anti-cholinergic agents 

IN Efange, S. Mbua Ngale, Plymouth, MN, United States 

Parsons, Stanley M. , Santa Barbara, CA, United States 
PA Regents of the University of Minnesota, Minneapolis, MN, United States 

(U.S. corporation) 
PI US 5929087 19990727 

AI US 1997-826830 19970408 (8) 

DT Utility 
FS Granted 
LN.CNT 857 

INCL INCLM: 514/314.000 

INCLS: 514/278.000; 546/017.000; 546/154.000; 546/158.000; 546/164.000 
NCL NCLM: 514/314.000 

NCLS: 514/278.000; 546/017.000; 546/154.000; 546/158.000; 546/164.000 
IC [6] 

ICM: A61K031-47 

ICS: C07D401-04 
EXF 546/164; 546/154; 546/158; 546/17; 514/314; 514/312 
CAS INDEXING IS AVAILABLE FOR THIS PATENT. 
STN INTERNATIONAL LOGOFF AT 10:22:35 ON 21 MAY 2004 



